Saral Shiksha Yojna
Courses/Computer Vision

Computer Vision

CSE471
Prof. Makarand Tapaswi + Prof. Charu SharmaSpring 2025-264 credits

Revision Notes

Chapter-wise distilled notes. Aim is enough depth to reconstruct the topic — not a textbook rewrite.

Unit 1 — Object Detection

Unit 2 — Dense Prediction: Segmentation + Depth

Unit 3 — Pose Estimation

Unit 4 — 3D Data (PointNet, DGCNN, MeshCNN)

Unit 5 — NeRF & 3D Gaussian Splatting

Unit 6 — Attention & Transformers

Unit 7 — Vision Transformers (ViT)

Unit 8 — SSL: Contrastive (SimCLR, MoCo, BYOL, CLIP)

Unit 9 — SSL: DINO, MAE, JEPA

Unit 10 — Transformer Advances (ViT-5 era)

Unit 11 — Multimodal LLMs (PaliGemma / Qwen2-VL / Gemma 4)

Unit 12 — Video Understanding