FIKA-Bench: From Fine-grained Recognition to Fine-Grained Knowledge Acquisition
Researchers have introduced FIKA-Bench, a new benchmark designed to evaluate the ability of AI systems to acquire knowledge about unfamiliar objects, moving beyond simple visual recognition. The benchmark consists of 311 real-life instances that have been carefully curated to avoid leakage and ensure evidence grounding. Evaluations show that even state-of-the-art large multimodal models and agents struggle with this task, achieving only around 25% accuracy, highlighting the need for improved agent designs focused on fine-grained recognition and evidence verification. AI
IMPACT Introduces a benchmark to push AI beyond recognition towards active knowledge acquisition, potentially improving real-world object understanding.