Anthropic's Claude 4.8 Max model demonstrated an ability to play chess by controlling a user's computer. The AI successfully executed moves and achieved checkmate against a friendly bot. While its gameplay was slow, it notably avoided making any illegal moves during the match. AI
IMPACT Shows potential for LLMs to control external applications for complex tasks, though speed remains a limitation.
RANK_REASON Demonstration of a model's capability to interact with external systems and perform a complex task. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →