Courses/Behavioral Research: Statistical Methods

Behavioral Research: Statistical Methods

CG3.402

Vinoo Alluri•Monsoon 2025-26•4 credits

The statistical methods that turn behavioural data into reliable inferences. Builds from probabilistic intuition (Bayes, base rate) through hypothesis testing, ANOVA, regression, Bayesian methods, and logistic regression. Heavy emphasis on which test to pick, reporting effect sizes, and avoiding p-hacking.

Syllabus

Unit 1 — Why Do Statistics? (Biases & Base Rates)

1 chapters

Why human intuition is unreliable for probabilistic reasoning, the biases statistics protects against (belief bias, confirmation bias, Simpson's paradox, base-rate fallacy), and Bayes' rule as the formal corrective.

Unit 2 — Research Design & Measurement

1 chapters

Operational definitions, the four scales of measurement (NOIR), reliability (test-retest, inter-rater, parallel forms, internal consistency), and validity (internal, external, construct, face, ecological).

Unit 3 — Probability & Distributions

1 chapters

Frequentist vs Bayesian probability, PDFs and CDFs, the core distributions (Bernoulli, Binomial, Normal, t, χ², F), the Central Limit Theorem, sampling distributions, and the Law of Large Numbers.

Unit 4 — Data Visualization

1 chapters

Why visualise before you analyse (Anscombe's quartet), matching plots to data types, and the catalogue of plots: histogram, boxplot, scatter, bar/pie, mosaic, violin/raindrop, heatmap — plus pitfalls (rainbow palettes, truncated axes, dual-y).

Unit 5 — Descriptive Statistics

1 chapters

Central tendency (mean, median, mode — robustness to outliers), dispersion (range, IQR, variance, SD, MAD), standardisation (z-scores), and Bessel's correction.

Unit 6 — Correlation & Reliability Quantified

1 chapters

Pearson r (continuous-continuous), Spearman ρ (rank-based, nonlinear monotone), Kendall τ, partial vs semi-partial correlations. Cohen's κ, Cronbach's α for reliability.

Unit 7 — Hypothesis Testing & NHST

1 chapters

Steps of NHST, Type I (α) and Type II (β) errors, statistical power (1 − β), Cohen's d, sample-size planning, one- vs two-tailed tests, t-tests (one-sample, independent, paired, Welch).

Unit 8 — Multiple Comparisons (FWER, FDR)

1 chapters

Family-wise type-I inflation, Bonferroni and Holm corrections (FWER), Benjamini-Hochberg (FDR), permutation tests, and how multiple comparisons drive the replication crisis.

Unit 9 — Non-parametric & Categorical Tests

1 chapters

Chi-square (goodness-of-fit and independence), Mann-Whitney U, Wilcoxon signed-rank, Kruskal-Wallis, Friedman, Spearman ρ, McNemar's test. When parametric assumptions fail.

Unit 10 — Multicollinearity, PCA & Factor Analysis

1 chapters

VIF and what multicollinearity does to regression coefficients; PCA for dimensionality reduction; EFA vs CFA for psychometric scale validation; scree plots and parallel analysis.

Unit 11 — ANOVA (one-way, RM, two-way)

1 chapters

Why ANOVA over multiple t-tests; SS_total = SS_between + SS_within; F = MS_between/MS_within; one-way, repeated-measures (sphericity, Mauchly, Greenhouse-Geisser), two-way (main effects + interaction), post-hoc (Tukey HSD).

Unit 12 — Regression (Linear, Multiple)

1 chapters

Simple regression Y = β₀ + β₁X + ε; OLS minimises sum of squared residuals; R² vs adjusted R²; LINeM assumptions (linearity, independence, normality, equal variance, no multicollinearity); residual diagnostics; categorical predictors via dummy coding.

Unit 13 — Bayesian Statistics

1 chapters

Bayes' theorem with prior, likelihood, posterior; Bayes Factor and its interpretation (3–10 moderate · 10–30 strong · >30 very strong); BayesFactor R package; Bayesian advantages (evidence for null, optional stopping, priors).

Unit 14 — GLMs & Logistic Regression

1 chapters

Why OLS fails for binary outcomes; the logit link; GLM components (random, systematic, link); odds ratios; maximum likelihood estimation; interpreting coefficients on the log-odds scale.

Unit 15 — Rapid Revision & Exam Strategy

1 chapters

Maya's final walk-through: the 'Which test do I use?' decision tree, common confusions to memorise (PCA vs FA, FWER vs FDR, reliability vs validity), the report checklist (test statistic, df, p, effect size, CI), and exam-day rules of thumb.

Weightage

Why do statistics?4%

Research Design & Measurement6%

Probability & Distributions8%

Data Visualization4%

Descriptive Statistics4%

Correlation & Reliability6%

Hypothesis Testing12%

Multiple Comparisons6%

Non-parametric & Categorical Tests8%

Multicollinearity, PCA & FA8%

ANOVA12%

Regression12%

Bayesian Statistics6%

GLM / Logistic Regression4%

Rapid Revision0%

Exam pattern

Typical: 100-mark end-sem (2 hours). Section A — 20 × 1-mark MCQs. Section B — 10 × 2-mark MSQs (multiple correct, all required). Section C — 6 × 5-mark short descriptive. Section D — 3 × 10-mark long descriptive. Marks split roughly evenly across NHST, ANOVA / regression, Bayesian, and design / measurement.

Important dates

Mid-sem2025-09 (TBC)
End-sem2025-11 (TBC)
Project submissionsRolling

Professor notes

Heavy emphasis on 'which test to pick' — questions usually combine an IV/DV scale + design and ask for a justified choice.
Bayes rule + base-rate fallacy appear in the opening lecture and in almost every paper.
ANOVA partition (SS_total = SS_between + SS_within) and interpretation are recurring exam targets.
Effect size reporting is mandatory — Cohen's d, η², R², OR. p alone is half a mark.
Mauchly's test + Greenhouse-Geisser appear whenever RM-ANOVA does.
PCA vs FA, FWER vs FDR, reliability vs validity are the 'spot the confusion' classics.