05. Permutation Tests

Simulating scenarios, simulating datasets, simulating random variables.

Author

Published

September 30, 2024

Agenda

Class notes: Permutation Tests
Baumer, Horton, and Kaplan (2021), Simulation (Chp 13) in Modern Data Science for R.

What is a test statistic?
What is a p-value?
Why for a two sample comparison (treatment A vs treatment B) is it okay to use ${\overset{―}{X}}_{A} - {\overset{―}{X}}_{B}$ for a test statistic in a permutation test, but for a t-test the test statistic is necessarily $t^{*} = \frac{{\overset{―}{X}}_{A} - {\overset{―}{X}}_{B}}{\sqrt{s_{A}^{2} / n_{A} + s_{B}^{2} / n_{B}}}$ (that is, divided by a measure of variability)?
How do you know what to permute in order to create a null sampling distribution?
What does “exchangeability” mean (as a technical condition) when discussing permutation tests?
What is the difference between a permutation test and a randomization test? Are there times when doing a randomization test is possible?
What is power? What are type I and type II errors?

In a permutation test, sometimes there are many test statistics to choose from (which address the same hypotheses). Why wouldn’t you want to try them all and choose the one that gives you the highest level of significance?
When is it acceptable to claim that the resulting “significant” outcome is actually a causal relationship (and not just an association)?

:::