Researchers have introduced the Hummus Dataset, a new collection of 1,000 image-caption pairs designed to evaluate multimodal large language models (MLLMs) on their understanding of humorous multimodal metaphors. The dataset, inspired by theories of humor and metaphor, was created using an expert-developed annotation scheme. Initial experiments using the Hummus Dataset revealed that current MLLMs struggle to effectively integrate visual and textual information to comprehend humorous multimodal metaphors. AI
IMPACT Highlights current limitations in AI's ability to understand nuanced humor and metaphor, indicating areas for future model development.
RANK_REASON The cluster contains an academic paper introducing a new dataset and associated research findings. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →