Permutation Tests

September 30 + October 2, 2024

Jo Hardin

Agenda 9/30/24

Review: logic of hypothesis testing
Logic of permutation tests
Examples - 2 samples and beyond

Agenda 10/2/24

Conditions, exchangeability, random structure
Different structures and statistics

Permuting MacNell data

Conceptually, there are two levels of randomization:

$N_{m}$ students are randomly assigned to the male instructor and $N_{f}$ are assigned to the female instructor.
Of the $N_{j}$ assigned to instructor $j$ , $N_{j m}$ are told that the instructor is male, and $N_{j f}$ are told that the instructor is female for $j = m, f$ .

macnell  |>
  group_by(tagender, taidgender)  |>
  summarize(n())

# A tibble: 4 × 3
# Groups:   tagender [2]
  tagender taidgender `n()`
     <dbl>      <dbl> <int>
1        0          0    11
2        0          1    12
3        1          0    13
4        1          1    11

Other Test Statistics

Data	Hypothesis Question	Statistic
2 categorical	diff in prop	${\hat{p}}_{1} - {\hat{p}}_{2}$ or $χ^{2}$
variables	ratio of prop	${\hat{p}}_{1} / {\hat{p}}_{2}$
1 numeric	diff in means	${\overset{―}{X}}_{1} - {\overset{―}{X}}_{2}$
1 binary	ratio of means	${\overset{―}{X}}_{1} / {\overset{―}{X}}_{2}$
	diff in medians	${median}_{1} - {median}_{2}$
	ratio of medians	${median}_{1} / {median}_{2}$
	diff in SD	$s_{1} - s_{2}$
	diff in var	$s_{1}^{2} - s_{2}^{2}$
	ratio of SD or VAR	$s_{1} / s_{2}$
1 numeric	diff in means	$\sum n_{i} ({\overset{―}{X}}_{i} - \overset{―}{X})^{2}$ or
k groups		F stat
paired or	(permute within row)	${\overset{―}{X}}_{1} - {\overset{―}{X}}_{2}$
repeated measures
regression	correlation	least sq slope
time series	no serial corr	lag 1 autocross

1 / 43

Permutation Tests September 30 + October 2, 2024 Jo Hardin

Permutation Tests
Agenda 9/30/24
Statistics Without the Agonizing Pain
Logic of hypothesis tests
Logic of permutation tests
Consider the NHANES dataset.
Summary of the variables of interest
Mean Income broken down by Health
Income and Health
Differences in Income ($)
Overall difference
Creating a test statistic
Creating a test statistic
Creating a test statistic
Creating a test statistic
Permuting the data
Permuting the data & a new test statistic
Lots of times…
Compared to the real data
Compared to the observed test statistic
Agenda 10/2/24
Exchangeability
Exchangeability
Probability as measured by what?
Permuting independent observations
Permuting homogenous cluster
Permuting herterogenous cluster
Gender bias in teaching evaluations
Gender bias in teaching evaluations
Gender bias in teaching evaluations
Gender bias in teaching evaluations
Gender bias: MacNell data
Analysis goal
MacNell Data without permutation
Permuting MacNell data
Stratified two-sample test:
MacNell Data with permutation
MacNell Data with permutation
MacNell Data with permutation
Observed vs. Permuted statistic
MacNell Data with permutation
MacNell results
Other Test Statistics