TL;DR: We evaluate object-centric representations with VLMs for real visual reasoning and introduce a unified metric that jointly measures localization and representation usefulness.
@inproceedings{singh2026evaluating,
author = {Krishnakant Singh and Simone Schaub-Meyer and Stefan Roth},
title = {Evaluating Object-Centric Models beyond Object Discovery},
booktitle = {arXiv:2602.07532 [cs.CV]},
year = {2026},
}
Acknowledgments: This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 program (grant agreement No. 866008). The project was also supported in part by the State of Hesse through the “The Third Wave of Artificial Intelligence (3AI)” project.