Computer Vision
CSE471Prof. Makarand Tapaswi + Prof. Charu Sharma•Spring 2025-26•4 credits
Unit 12 — Video Understanding
Beyond images × T: action recognition, temporal localisation, 3D CNNs (I3D), Two-Stream, SlowFast, ViViT, and TimeSformer's divided space-time attention.