Computer Vision
CSE471MCQs
One correct option. Pick, then check.
Faster R-CNN replaces Selective Search with which component?
YOLO v1 default output tensor for PASCAL VOC is:
λ_coord and λ_noobj in YOLO loss are typically:
Mask R-CNN's mask head produces:
Dilated/atrous convolutions enlarge the receptive field by:
OpenPose's output channels for K keypoints and L limb types is:
SMPL's pose parameter θ has dimensionality:
PointNet achieves permutation invariance via:
Total learnable parameters per 3DGS Gaussian (degree-3 SH):
Σ = R·S·Sᵀ·Rᵀ in 3DGS guarantees:
Why is QKᵀ divided by √dₖ?
ViT-B/16 on 224² images yields how many tokens entering the encoder (including [CLS])?
Which SSL method does NOT use negatives?
MAE's mask ratio is:
RoPE's key property is:
In PaliGemma, which component is randomly initialised?
I3D's 'inflation' trick: