Gemini 2.5 Pro fine-tuned for early autism diagnosis from home videos

By PulseAugur Editorial · [1 sources] · 2026-06-29 04:00

Researchers have fine-tuned Google's Gemini 2.5 Pro model to analyze short home videos for early autism diagnosis. By training on 400 clinician-rated videos and focusing on 30 validated behavioral features, the model demonstrated a 40% improvement in inter-rater reliability with clinicians. The fine-tuned model also showed an emergent zero-shot capability, improving ASD diagnosis accuracy by 53% and achieving 77% overall accuracy with an AUC of 86%. This advancement suggests that multimodal large language models can be scaled to extract behavioral features for more accessible autism assessment. AI

IMPACT Enhances potential for early autism diagnosis through accessible video analysis, improving clinical outcomes.

RANK_REASON Research paper detailing a novel application of a multimodal LLM for a specific medical diagnosis. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gemini 2.5 Pro fine-tuned for early autism diagnosis from home videos

COVERAGE [1]

arXiv cs.CV TIER_1 English(EN) · Mohammadmahdi Honarmand, Parnian Azizian, Aaron Kline, Kae Nurge, Zerin Nasrin Tumpa, Saimourya Surabhi, Kaitlyn Dunlap, Yang Qian, Ali Kargarandehkordi, Sameer Neupane, Peter Washington, Dennis P. Wall · 2026-06-29 04:00

Fine-tuning a multimodal large language model for clinician-grade autism behavioral scoring from short home videos

arXiv:2606.27484v1 Announce Type: new Abstract: Autism spectrum disorder (ASD) affects 1 in 31 US children, yet median age at diagnosis exceeds four years. Artificial intelligence pipelines that provide quantified diagnosis using easy to access observational data (e.g., home vide…

COVERAGE [1]

Fine-tuning a multimodal large language model for clinician-grade autism behavioral scoring from short home videos

RELATED ENTITIES

RELATED TOPICS