Behavioral Research: Statistical Methods
CG3.402The statistical methods that turn behavioural data into reliable inferences. Builds from probabilistic intuition (Bayes, base rate) through hypothesis testing, ANOVA, regression, Bayesian methods, and logistic regression. Heavy emphasis on which test to pick, reporting effect sizes, and avoiding p-hacking.
Syllabus
Unit 1 — Why Do Statistics? (Biases & Base Rates)
1 chaptersWhy human intuition is unreliable for probabilistic reasoning, the biases statistics protects against (belief bias, confirmation bias, Simpson's paradox, base-rate fallacy), and Bayes' rule as the formal corrective.
Unit 2 — Research Design & Measurement
1 chaptersOperational definitions, the four scales of measurement (NOIR), reliability (test-retest, inter-rater, parallel forms, internal consistency), and validity (internal, external, construct, face, ecological).
Unit 3 — Probability & Distributions
1 chaptersFrequentist vs Bayesian probability, PDFs and CDFs, the core distributions (Bernoulli, Binomial, Normal, t, χ², F), the Central Limit Theorem, sampling distributions, and the Law of Large Numbers.
Unit 4 — Data Visualization
1 chaptersWhy visualise before you analyse (Anscombe's quartet), matching plots to data types, and the catalogue of plots: histogram, boxplot, scatter, bar/pie, mosaic, violin/raindrop, heatmap — plus pitfalls (rainbow palettes, truncated axes, dual-y).
Unit 5 — Descriptive Statistics
1 chaptersCentral tendency (mean, median, mode — robustness to outliers), dispersion (range, IQR, variance, SD, MAD), standardisation (z-scores), and Bessel's correction.
Unit 6 — Correlation & Reliability Quantified
1 chaptersPearson r (continuous-continuous), Spearman ρ (rank-based, nonlinear monotone), Kendall τ, partial vs semi-partial correlations. Cohen's κ, Cronbach's α for reliability.
Unit 7 — Hypothesis Testing & NHST
1 chaptersSteps of NHST, Type I (α) and Type II (β) errors, statistical power (1 − β), Cohen's d, sample-size planning, one- vs two-tailed tests, t-tests (one-sample, independent, paired, Welch).
Unit 8 — Multiple Comparisons (FWER, FDR)
1 chaptersFamily-wise type-I inflation, Bonferroni and Holm corrections (FWER), Benjamini-Hochberg (FDR), permutation tests, and how multiple comparisons drive the replication crisis.
Unit 9 — Non-parametric & Categorical Tests
1 chaptersChi-square (goodness-of-fit and independence), Mann-Whitney U, Wilcoxon signed-rank, Kruskal-Wallis, Friedman, Spearman ρ, McNemar's test. When parametric assumptions fail.
Unit 10 — Multicollinearity, PCA & Factor Analysis
1 chaptersVIF and what multicollinearity does to regression coefficients; PCA for dimensionality reduction; EFA vs CFA for psychometric scale validation; scree plots and parallel analysis.
Unit 11 — ANOVA (one-way, RM, two-way)
1 chaptersWhy ANOVA over multiple t-tests; SS_total = SS_between + SS_within; F = MS_between/MS_within; one-way, repeated-measures (sphericity, Mauchly, Greenhouse-Geisser), two-way (main effects + interaction), post-hoc (Tukey HSD).
Unit 12 — Regression (Linear, Multiple)
1 chaptersSimple regression Y = β₀ + β₁X + ε; OLS minimises sum of squared residuals; R² vs adjusted R²; LINeM assumptions (linearity, independence, normality, equal variance, no multicollinearity); residual diagnostics; categorical predictors via dummy coding.
Unit 13 — Bayesian Statistics
1 chaptersBayes' theorem with prior, likelihood, posterior; Bayes Factor and its interpretation (3–10 moderate · 10–30 strong · >30 very strong); BayesFactor R package; Bayesian advantages (evidence for null, optional stopping, priors).
Unit 14 — GLMs & Logistic Regression
1 chaptersWhy OLS fails for binary outcomes; the logit link; GLM components (random, systematic, link); odds ratios; maximum likelihood estimation; interpreting coefficients on the log-odds scale.
Unit 15 — Rapid Revision & Exam Strategy
1 chaptersMaya's final walk-through: the 'Which test do I use?' decision tree, common confusions to memorise (PCA vs FA, FWER vs FDR, reliability vs validity), the report checklist (test statistic, df, p, effect size, CI), and exam-day rules of thumb.
Weightage
Exam pattern
Typical: 100-mark end-sem (2 hours). Section A — 20 × 1-mark MCQs. Section B — 10 × 2-mark MSQs (multiple correct, all required). Section C — 6 × 5-mark short descriptive. Section D — 3 × 10-mark long descriptive. Marks split roughly evenly across NHST, ANOVA / regression, Bayesian, and design / measurement.
Important dates
- Mid-sem2025-09 (TBC)
- End-sem2025-11 (TBC)
- Project submissionsRolling
Professor notes
- Heavy emphasis on 'which test to pick' — questions usually combine an IV/DV scale + design and ask for a justified choice.
- Bayes rule + base-rate fallacy appear in the opening lecture and in almost every paper.
- ANOVA partition (SS_total = SS_between + SS_within) and interpretation are recurring exam targets.
- Effect size reporting is mandatory — Cohen's d, η², R², OR. p alone is half a mark.
- Mauchly's test + Greenhouse-Geisser appear whenever RM-ANOVA does.
- PCA vs FA, FWER vs FDR, reliability vs validity are the 'spot the confusion' classics.