Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs
arXiv:2512.08976v1 Announce Type: new Abstract: We introduce Contrastive Region Masking (CRM), a training free diagnostic that reveals how multimodal large language models (MLLMs) depend on specific visual regions at each step of chain-of-thought (CoT) reasoning. Unlike prior approaches limited to…
