Still can't play chess
A user tested Anthropic's latest AI model's ability to play chess, a task they believe is a benchmark for AGI. While the model demonstrated impressive reasoning and understanding of moves, it ultimately failed to keep track of the game and lost. This user remains skeptical about LLMs achieving AGI until they can consistently follow complex game rules. AI
IMPACT User skepticism persists regarding LLM capabilities for complex reasoning tasks like chess, indicating a gap before AGI is considered achievable.