Researchers have introduced MICo-150K, a large-scale dataset designed to improve multi-image composition (MICo) capabilities in AI models. The dataset addresses the challenge of synthesizing coherent images from multiple references by categorizing MICo into seven tasks and providing high-quality composite images. MICo-150K includes a unique subset for real-world image decomposition and recomposition, along with a benchmark suite and a new evaluation metric called Weighted-Ref-VIEScore. Fine-tuning models on this dataset has shown significant improvements in MICo tasks, with a baseline model, Qwen-MICo, demonstrating enhanced performance. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Enhances AI's ability to generate complex images from multiple references, potentially improving creative tools and visual content generation.
RANK_REASON The cluster describes a new academic paper introducing a dataset and benchmark for multi-image composition.