Behavioral Research: Statistical Methods
CG3.402Vinoo Alluri•Monsoon 2025-26•4 credits
200-mark mock paper (Set 2) · Paper TWO
Duration: 180 min • Max marks: 200
Section A — 0.5 mark MCQs (20 × 0.5 = 10 marks)
10 marks- 1.Distance walked in kilometers is which scale? (a) Nominal (b) Ordinal (c) Interval (d) Ratio0.5 m
- 2.In a normal distribution, the **mean, median, and mode** are: (a) Different (b) All equal (c) Cannot be determined (d) Equal only when SD = 10.5 m
- 3.A study with **internal validity but low external validity** is: (a) Causally interpretable but limited in generalization (b) Both strong (c) Both weak (d) Easily replicable0.5 m
- 4.A test with α = 0.10 (rather than 0.05) is: (a) More conservative (b) More liberal — easier to reject H₀ (c) Always preferred (d) Equivalent0.5 m
- 5.Kurtosis of a Normal distribution is: (a) 0 (b) 1 (c) 3 (excess = 0) (d) Always positive0.5 m
- 6.A scatter plot with data falling perfectly on a downward line has Pearson r = (a) +1 (b) −1 (c) 0 (d) Cannot say0.5 m
- 7.Adira draws 30 sample means from a uniform population. The sampling distribution of those means is: (a) Uniform (b) Approximately normal by CLT (c) Bimodal (d) Skewed0.5 m
- 8.A research design where participants know they're being studied is: (a) Naturalistic observation (b) Experimental (c) Reactive (d) Always invalid0.5 m
- 9."Generalizability across measures" most relates to: (a) Test-retest reliability (b) Construct validity (c) Inter-rater reliability (d) Internal consistency0.5 m
- 10.Aagam's data show **kurtosis = 5**. Distribution is: (a) Platykurtic (b) Mesokurtic (c) Leptokurtic / heavy-tailed (d) Skewed0.5 m
- 11.Regression with **R² = 0.95**: (a) Strong fit (b) Overfitting (c) Always biased (d) Insufficient data0.5 m
- 12.**Fisher's exact test** appropriate when: (a) Chi-square assumptions fail (expected cells < 5) (b) Sample size huge (c) Always for proportions (d) Only for paired data0.5 m
- 13.A within-subjects design tests whether **scores change** as a function of: (a) Different participants (b) Conditions within the same participants (c) Random sampling (d) Sample size0.5 m
- 14.McNemar test is used for: (a) Two independent proportions (b) Two **related** proportions (paired binary) (c) ANOVA (d) Continuous DVs0.5 m
- 15.Cohen's d between two means: (a) (M₁ − M₂) / SD_pooled (b) (M₁ − M₂) / SE (c) M₁ − M₂ (d) (M₁ − M₂) × √n0.5 m
- 16.**Eta-squared (η²)** is: (a) Proportion of total variance explained by effect in ANOVA (b) Effect size for chi-square (c) Correlation coefficient (d) Always > 10.5 m
- 17.99% CI is **wider** than 95% CI on same data because: (a) Higher confidence needs wider interval (b) Different data (c) Larger sample (d) Random0.5 m
- 18.**Mode** for a continuous distribution: (a) Highest density point (b) Mean (c) Median (d) Always equal to mean0.5 m
- 19.A **leverage** point in regression has: (a) Extreme predictor values (b) High residual (c) Always influential (d) Outlier in DV only0.5 m
- 20.A **conjugate prior** in Bayesian inference yields: (a) Non-informative posterior (b) Posterior in same family as prior (c) Frequentist result (d) Always normal0.5 m
Section B — 1 mark MCQs (20 × 1 = 20 marks)
20 marks- 1.Mehek compares **stress** in 4 hospital shifts (Morning, Afternoon, Evening, Night) using **different nurses** in each (between-subjects). Continuous, normal. Best test: (a) RM ANOVA (b) One-way ANOVA (c) Mixed ANOVA (d) Friedman1 m
- 2.A 4 × 5 chi-square test of independence, n = 200. df and expected-cell concern: (a) df = 20; n > 200 needed (b) df = 12; expected counts must be ≥ 5 (c) df = 9 (d) df = 12; sample fine1 m
- 3.Manvi has a 3-level **ordinal DV** and a continuous IV. Best analysis: (a) Linear regression (b) Ordinal logistic regression (c) Chi-square (d) Pearson r1 m
- 4.Regression's F-test F(3, 96) = 5.2, p = .002 indicates: (a) Overall model significant; at least one predictor matters (b) Each predictor significant (c) Model fits perfectly (d) Multicollinearity1 m
- 5.Disha sees Cook's distance > 1 for one observation: (a) Normality (b) Multicollinearity (c) Highly influential observation (d) Insufficient data1 m
- 6.`Outcome = β₀ + β₁X + β₂X² + ε` tests: (a) Linear only (b) **Non-linear (quadratic) relationship** (c) Multicollinearity (d) Heteroscedasticity1 m
- 7.A **2 × 2 × 3 factorial ANOVA**. Possible main + interaction effects: (a) 5 (b) 6 (c) 7 (d) 81 m
- 8.Friedman's test statistic is approximately χ² with df = (a) n − 1 (b) k − 1 (k = conditions) (c) (r−1)(c−1) (d) n − k1 m
- 9.Anaya models `time_to_event` (days until customer churn). Best model: (a) Linear regression (b) Logistic (c) Survival analysis / Cox proportional hazards (d) ANOVA1 m
- 10.Logistic regression's pseudo-R² (e.g., McFadden's): (a) True variance explained (b) Interpreted similarly to R² but different scale; 0.2–0.4 = good fit (c) Always 1 (d) Cannot compute1 m
- 11.Yashvi: 5 items, Cronbach's α = 0.95: (a) Excellent reliability (b) Possibly redundant items (c) Both — excellent + may indicate redundancy (d) Poor reliability1 m
- 12.Two-way ANOVA: Factor A (2 levels) × Factor B (3 levels), n = 15 per cell. Total N and df_total: (a) N = 90, df = 89 (b) N = 15, df = 14 (c) N = 30, df = 29 (d) N = 6, df = 51 m
- 13.Mann-Whitney U test tests: (a) Means (b) Stochastic dominance — whether one distribution gives larger values (c) Variances (d) Independence1 m
- 14.Confounders are best controlled by: (a) Randomization in experiments (b) Statistical adjustment in observational studies (c) Both, with randomization being more effective (d) Increasing n1 m
- 15.AIC = 250 vs AIC = 245. Lower preferred if: (a) Δ AIC > 10 (strong) (b) Δ = 5 (substantial) (c) Δ = 5 (some preference) (d) Equally good1 m
- 16.Hriday reports BF₁₀ = 0.4: (a) Anecdotal evidence for H₀ (b) Moderate for H₁ (c) Anecdotal for H₁ (d) Strong for H₀1 m
- 17.Mediator vs moderator: (a) Mediator explains mechanism; moderator changes effect's strength/direction (b) Same thing (c) Always categorical (d) Only used in ANOVA1 m
- 18.Multilevel modeling (LMM) preferred when: (a) Observations independent (b) Data nested (students in classrooms, repeated measures) (c) DV categorical (d) Small sample1 m
- 19.Kavin's RM-ANOVA shows sphericity violation. Greenhouse-Geisser with ε = 0.65, original df = (3, 60): (a) (3, 60) (b) (1.95, 39) (c) (2, 60) (d) (3, 39)1 m
- 20.Eshan: p = 0.049, d = 1.5, n = 8. Best interpretation: (a) Robust and confirmed (b) Significant, but with n = 8, d = 1.5 likely inflated (winner's curse); replicate (c) Type I error (d) No effect1 m
Section C — 2 mark short answers (15 × 2 = 30 marks)
30 marks- 1.State **Bayes' theorem** with each term named.2 m
- 2.Differentiate **between-subjects** and **within-subjects** designs, with one advantage of each.2 m
- 3.Define **mediation**; contrast with **moderation** (with one-line examples).2 m
- 4.Define **convergent** and **discriminant** validity with example pair.2 m
- 5.Why is **stratified random sampling** often preferred?2 m
- 6.Reyaan studies whether an app affects daily steps. Identify IV, DV, scales, and design maximizing internal validity.2 m
- 7.Why does **double-blind** matter in behavioral research?2 m
- 8.Define **regression to the mean** with example.2 m
- 9.State **homoscedasticity** assumption and how to check.2 m
- 10.Define **interaction effect** in ANOVA with one-line example.2 m
- 11.Define **p-hacking** with two specific practices.2 m
- 12.Define **publication bias** and one consequence for meta-analysis.2 m
- 13.Two purposes of a **manipulation check** in an experiment.2 m
- 14.Difference between **FWER** and **FDR**.2 m
- 15.Two differences between **paired t-test** and **Wilcoxon signed-rank**.2 m
Section D — 5 mark questions (12 × 5 = 60 marks)
60 marksSection E — 10 mark long descriptive (8 × 10 = 80 marks)
80 marksTrack your attempt locally — score and time are recorded in your browser. (Coming soon: timed-attempt mode.)