A new arXiv paper introduces NCP-ExploreToM, a framework for evaluating Large Language Models' (LLMs) non-conversational Theory of Mind (ToM) capabilities. This research assesses how well models can induce specific belief states in others through actions rather than dialogue. Across 600 task instances, GPT-5 demonstrated strong performance, succeeding in approximately 80% of tasks and outperforming human participants in this agentic setting, though humans remained more robust overall. The study also noted that all evaluated models, like humans, were better at inducing true beliefs than false beliefs, suggesting potential for alignment efforts. AI
IMPACT Highlights emerging social-reasoning capabilities in LLMs and underscores the need for agentic evaluations for safety and alignment.
RANK_REASON The cluster contains an academic paper detailing a new evaluation framework and benchmark results for LLMs.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →