NutriMLLM: Multimodal Large Language Models for Dietary Micronutrient Analysis
Researchers have developed NutriMLLM, a new family of multimodal large language models specifically designed for analyzing dietary micronutrients from food images. Existing models proved unreliable for this task, often abstaining or providing inaccurate data. To overcome this, the team created a large synthetic dataset of over a million image-description-nutrient triplets by repurposing dietary recall data. Fine-tuning models like Qwen3-VL on this dataset resulted in NutriMLLM variants that demonstrate near-complete coverage of 65 micronutrients and competitive accuracy against leading proprietary models. AI
IMPACT Enables more accurate and comprehensive dietary analysis from food images, potentially improving personalized nutrition and public health surveillance.