Mechanistic Insights into Functional Sparsity in Multimodal LLMs via CoRe Heads

Ruoxi Sun, Quantong Qiu, Juntao Li, Zecheng Tang, Yihang Lou, Min Zhang

Jun 5, 2026 at 04:00

2 Views

0 Comments

arXiv:2606.05843v1 Announce Type: cross Abstract: While Multimodal Large Language Models (MLLMs) demonstrate remarkable proficiency on complex vision-language tasks, the mechanisms by which they extract query-relevant visual features from complex, noisy contexts remain opaque. In this paper, we present an in-depth interpretability study that...

Read the full article at the source.

Read Original Article

Was this helpful?