PulseAugur
EN
LIVE 13:28:49

LLMs Automate Real-Effort Tasks and Exhibit Political Bias

Two new research papers explore the evolving capabilities of Large Language Models (LLMs) and their implications. One study, "Artificial Effort," demonstrates that most real-effort tasks, previously used to measure human performance, can now be accurately solved by LLMs at minimal cost, raising concerns about their validity in unsupervised settings. The second paper, "How Far Will They Go? Red-Teaming Online Influence with Large Language Models," introduces a framework to audit the political steerability of open-source LLMs, finding that they often express left-leaning content and that their political range can be expanded through jailbreaking techniques. AI

IMPACT LLMs are increasingly capable of automating tasks previously thought to require human effort, and their political expressivity requires careful auditing to prevent misuse in influence campaigns.

RANK_REASON The cluster contains two academic papers published on arXiv, detailing research into LLM capabilities and potential misuse.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

LLMs Automate Real-Effort Tasks and Exhibit Political Bias

COVERAGE [3]

  1. arXiv cs.AI TIER_1 (CA) · Federico Belotti, Stefano Coniglio, Antonio Cosma, Francesco Fallucchi ·

    Artificial Effort

    arXiv:2605.23920v1 Announce Type: cross Abstract: Real-effort tasks, in which participants perform cognitively costly activities whose outcomes depend on actual performance, are widely used in experimental economics. Their validity, however, rests on the assumption that a human p…

  2. arXiv cs.AI TIER_1 English(EN) · Daniel C. Ruiz, Anna Serbina, Ashwin Rao, Emilio Ferrara, Luca Luceri ·

    How Far Will They Go? Red-Teaming Online Influence with Large Language Models

    arXiv:2605.22880v1 Announce Type: cross Abstract: As large language model (LLM)-based agents increasingly participate in online discourse, red-teaming their capacity to support political influence campaigns is critical for information integrity. In pursuit of this goal, we focus …

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    How Far Will They Go? Red-Teaming Online Influence with Large Language Models

    Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns.