LLMs now automate most real-effort tasks used in economic studies

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-25 04:00

A new study published on arXiv investigates the impact of Large Language Models (LLMs) on real-effort tasks, commonly used in economic experiments. Researchers found that most of these tasks, which require cognitive effort and depend on actual performance, can now be accurately solved by LLMs at minimal cost. The study highlights that performance on these tasks improves with newer AI model generations, and even mid-tier models are rapidly automating them. This raises concerns about the validity of real-effort tasks in unsupervised settings, as observed performance may no longer reflect genuine human effort due to the ease of outsourcing to AI. AI

影响 Challenges the validity of traditional economic experiments relying on human effort, necessitating new methodologies to account for AI automation.

排序理由 The cluster contains an academic paper detailing research findings. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 (CA) · Federico Belotti, Stefano Coniglio, Antonio Cosma, Francesco Fallucchi · 2026-05-26 04:00

Artificial Effort

arXiv:2605.23920v1 Announce Type: cross Abstract: Real-effort tasks, in which participants perform cognitively costly activities whose outcomes depend on actual performance, are widely used in experimental economics. Their validity, however, rests on the assumption that a human p…
arXiv cs.AI TIER_1 English(EN) · Daniel C. Ruiz, Anna Serbina, Ashwin Rao, Emilio Ferrara, Luca Luceri · 2026-05-25 04:00

它们会走多远？利用大型语言模型进行在线影响力红队测试

arXiv:2605.22880v1 Announce Type: cross Abstract: As large language model (LLM)-based agents increasingly participate in online discourse, red-teaming their capacity to support political influence campaigns is critical for information integrity. In pursuit of this goal, we focus …

报道来源 [2]

Artificial Effort

它们会走多远？利用大型语言模型进行在线影响力红队测试

相关实体

相关话题