Saral Shiksha Yojna
Courses/Computer Vision

Computer Vision

CSE471
Prof. Makarand Tapaswi + Prof. Charu SharmaSpring 2025-264 credits

Revision Notes

Chapter-wise distilled notes. Aim is enough depth to reconstruct the topic — not a textbook rewrite.

Unit 1 — Introduction & Foundations

Unit 2 — Digital Image Processing Recap

Unit 3 — Machine Learning Recap

Unit 4 — Convolutional Neural Networks (CNNs)

Unit 5 — Object Detection

Unit 6 — Dense Prediction: Segmentation + Depth

Unit 7 — Pose Estimation

Unit 8 — 3D Data (PointNet, DGCNN, MeshCNN)

Unit 9 — NeRF & 3D Gaussian Splatting

Unit 10 — Attention & Transformers

Unit 11 — Vision Transformers (ViT)

Unit 12 — SSL: Contrastive (SimCLR, MoCo, BYOL, CLIP)

Unit 13 — SSL: DINO, MAE, JEPA

Unit 14 — Transformer Advances (ViT-5 era)

Unit 15 — Multimodal LLMs (PaliGemma / Qwen2-VL / Gemma 4)

Unit 16 — Video Understanding