Two AI models, DeepSeek and Google's Gemini, achieved a score of 66 points on a Shanghai high school entrance exam essay question. The prompt asked students to consider how technology reshapes both the world and human imagination. A media outlet, Kechuangban Daily, organized this evaluation. AI
IMPACT Demonstrates AI's growing capabilities in creative writing and standardized testing.
RANK_REASON AI models evaluated on an academic benchmark (exam essay). [lever_c_demoted from research: ic=1 ai=1.0]
Read on Mastodon — mastodon.social →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →