A recent benchmark test indicates that GPT-5.5 achieved a score of 85.3% on the ARC-AGI-2 benchmark. This result places the model's performance at a level comparable to human experts in this specific evaluation. The data was shared via a LinkedIn post. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Sets a new performance baseline on the ARC-AGI-2 benchmark, potentially influencing future model evaluations.
RANK_REASON The cluster reports a specific benchmark result for a new model.