Courses/Computer Vision

Computer Vision

CSE471

Prof. Makarand Tapaswi + Prof. Charu Sharma•Spring 2025-26•4 credits

Revision Notes/Unit 7 — Pose Estimation

Unit 7 — Pose Estimation

Single-person and multi-person human pose. Heatmap regression, CPM, OpenPose with Part Affinity Fields, top-down vs bottom-up, and SMPL for 3D body recovery.

Pose Estimation — Heatmaps, CPM, OpenPose, SMPL

14 min

A bounding box around a person tells you almost nothing about what they're doing — dancing, fighting, lifting a mug, falling. The answer lives in the body itself, in the geometry of arms, legs, torso, head. Pose estimation predicts joint locations and limbs so downstream tasks (activity recognition, motion capture, gesture interfaces, avatar animation) become tractable. The core architectural shift is from direct keypoint regression (broken for several reasons) to dense prediction via per-joint heatmaps.