@ emollick Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI
A recent evaluation suggests that GPT-5.2 is performing at an expert level in scientific peer review. In a study involving 45 scientists and 469 hours, AI reviews were found to be competitive with top human reviewers on 82 papers. However, the AI still has weaknesses, indicating that a hybrid approach of AI and human collaboration is optimal for peer review. AI
IMPACT AI models are becoming competitive with human experts in complex tasks like scientific peer review, suggesting potential for increased efficiency and quality in research.