A new study evaluated large language models, specifically Gemini Pro, against mental health professionals in diagnosing personality disorders from autobiographical narratives. While the LLMs demonstrated higher overall diagnostic scores, particularly for Borderline Personality Disorder, they significantly underdiagnosed Narcissistic Personality Disorder. The models provided detailed, pattern-focused justifications, contrasting with the human experts' more concise and patient-centered approach, highlighting potential biases and reliability concerns in LLM clinical assessments. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT LLMs show potential in clinical narrative analysis but require careful validation due to bias and reliability issues.
RANK_REASON Academic paper evaluating LLM performance against human experts on a specific clinical task.