Course 1 · Week 4 — Two-group comparisons, associations, reporting

Cheatsheet — biostats_courses

Author

R. Heller

Choosing a comparison

Design	Continuous outcome	Binary outcome
Two independent groups	`t.test(y ~ g)` (Welch); Wilcoxon rank-sum	`prop.test` / `fisher.test`
Paired / pre-post	`t.test(pre, post, paired = TRUE)`; Wilcoxon signed-rank	McNemar
> 2 groups, continuous	`aov(y ~ g)`; Kruskal-Wallis	chi-square

Statistic	What it reports
Mean difference + 95% CI	primary for continuous
Cohen’s d	standardised mean difference
Hedges’ g	small-sample correction of d
Risk ratio (RR) / odds ratio (OR)	two proportions
Risk difference	absolute, clinically intuitive

effectsize::cohens_d(y ~ g)

prop.test(c(tA, tB), c(nA, nB))         # asymptotic
fisher.test(matrix(c(tA, nA - tA,
                     tB, nB - tB), 2))  # exact, small cells

Report RR (or OR) with 95% CI, not just the p-value.

Method	Captures	Assumptions
Pearson	linear association, continuous	bivariate normal, no outliers
Spearman	monotonic association	rank-based, robust
Kendall	concordant/discordant pairs	robust, slow on large data

cor.test(x, y, method = "spearman")

Test	Replaces
Wilcoxon rank-sum (Mann-Whitney)	two-sample t
Wilcoxon signed-rank	paired t
Kruskal-Wallis	one-way ANOVA
Sign test	paired t, when even ranks fail

pwr::pwr.t.test(d = 0.5, power = 0.80, sig.level = 0.05,
                type = "two.sample")

Simulation-based power for anything the textbooks skip: simr::powerSim.

library(gtsummary)
trial |>
  tbl_summary(by = arm, statistic = list(all_continuous() ~ "{mean} ({sd})")) |>
  add_p() |>
  add_overall()