📘 Non-Parametric Tests: Introduction & Applications

Non-parametric tests are used when the data doesn't meet the assumptions required for parametric tests (e.g., normal distribution, equal variance).

These tests are often used when dealing with ordinal data, non-normally distributed data, or small sample sizes.

🎯 What are Non-Parametric Tests?

Non-parametric tests are statistical tests that do not rely on assumptions about the parameters of the population distribution, unlike parametric tests (such as the t-test or ANOVA) that assume normality and other conditions.

They are also known as distribution-free tests because they do not assume any specific distribution of the data.

Common uses of non-parametric tests:

  • When the data is ordinal (ranks, categories).
  • When the data is not normally distributed.
  • When sample sizes are small.
  • When dealing with skewed distributions or outliers.

⚖️ Key Features of Non-Parametric Tests

  • No assumption about data distribution (non-normality is acceptable).
  • Suitable for ordinal data or non-numeric scales.
  • More flexible in handling small or skewed datasets.
  • Not as powerful as parametric tests when the assumptions of parametric tests are met.

Non-parametric tests are often the go-to choice when the data does not meet the requirements of parametric methods.

📊 Common Non-Parametric Tests

  • Chi-Square Test: Used for categorical data, to determine if there is an association between two variables (covered previously in our course).
  • Mann-Whitney U Test: Used to compare two independent groups when the dependent variable is ordinal or not normally distributed.
  • Wilcoxon Signed-Rank Test: Used to compare two related samples when the dependent variable is ordinal or not normally distributed (paired test).
  • Kruskal-Wallis H Test: Used to compare more than two independent groups, similar to one-way ANOVA, when assumptions for ANOVA are not met.
  • Spearman's Rank Correlation: Used to measure the strength and direction of association between two ranked variables.

🔹 Mann-Whitney U Test

The Mann-Whitney U test is used to compare two independent groups when the dependent variable is ordinal or not normally distributed.

Example: Testing whether men and women have different levels of job satisfaction. If satisfaction scores are ordinal (e.g., low, medium, high), the Mann-Whitney U test can be used.

Test Logic:

  • Ranks all data from both groups together.
  • Calculates a U statistic based on the ranks.
  • Compares the U statistic to the distribution to find the p-value.

Null Hypothesis (H₀): The distributions of both groups are the same.

Alternative Hypothesis (H₁): The distributions of both groups are different.

🔹 Wilcoxon Signed-Rank Test

The Wilcoxon Signed-Rank test is used to compare two related or paired samples when the data is ordinal or not normally distributed.

Example: Comparing the pre- and post-treatment scores of patients on a scale where the differences are not normally distributed.

Test Logic:

  • For each pair of values, calculate the difference.
  • Rank the absolute values of the differences.
  • Sum the ranks for positive and negative differences separately.
  • Calculate the test statistic and compare it to the critical value.

Null Hypothesis (H₀): There is no difference between the two related samples.

Alternative Hypothesis (H₁): There is a significant difference between the two related samples.

🔹 Kruskal-Wallis H Test

The Kruskal-Wallis H test is used to compare more than two independent groups, particularly when the assumptions for ANOVA are not met (e.g., non-normal distribution).

Example: Comparing the satisfaction levels of customers across three different stores where the satisfaction scores are ordinal.

Test Logic:

  • Ranks all data across all groups.
  • Calculates the H statistic based on the ranks and the sample sizes.
  • Compares the H statistic to the Chi-Square distribution to find the p-value.

Null Hypothesis (H₀): All group distributions are the same.

Alternative Hypothesis (H₁): At least one group distribution differs significantly.

🔹 Spearman's Rank Correlation

Spearman's Rank Correlation is used to measure the strength and direction of the relationship between two ranked variables.

Example: Testing if there is a relationship between employee satisfaction and performance ranking.

Test Logic:

  • Ranks the values of both variables.
  • Calculates the difference in ranks for each pair of values.
  • Calculates the correlation coefficient (ρ), which ranges from -1 to 1.
  • ρ > 0 indicates a positive correlation, ρ < 0 indicates a negative correlation, and ρ = 0 indicates no correlation.

Null Hypothesis (H₀): There is no relationship between the two ranked variables.

Alternative Hypothesis (H₁): There is a significant relationship between the two ranked variables.

📊 Non-Parametric vs Parametric Tests

Test Type When to Use Example Assumptions
Parametric Tests When data is normally distributed and variances are known/assumed equal t-test, ANOVA Normal distribution, equal variance
Non-Parametric Tests When data is not normally distributed, ordinal, or small sample size Mann-Whitney U test, Wilcoxon signed-rank test No assumption about the population distribution

🧠 Key Insights

  • Non-parametric tests do not assume normality and are used for ordinal or non-normally distributed data.
  • Common non-parametric tests include the Mann-Whitney U test, Wilcoxon signed-rank test, and Kruskal-Wallis H test.
  • Non-parametric tests are useful for small sample sizes or data with outliers.
  • Although non-parametric tests are more flexible, they may be less powerful than parametric tests when the assumptions for the latter are met.