LLMs Automate Real-Effort Tasks and Exhibit Political Bias

By PulseAugur Editorial · [3 sources] · 2026-05-20 19:25

Two new research papers explore the evolving capabilities of Large Language Models (LLMs) and their implications. One study, "Artificial Effort," demonstrates that most real-effort tasks, previously used to measure human performance, can now be accurately solved by LLMs at minimal cost, raising concerns about their validity in unsupervised settings. The second paper, "How Far Will They Go? Red-Teaming Online Influence with Large Language Models," introduces a framework to audit the political steerability of open-source LLMs, finding that they often express left-leaning content and that their political range can be expanded through jailbreaking techniques. AI

IMPACT LLMs are increasingly capable of automating tasks previously thought to require human effort, and their political expressivity requires careful auditing to prevent misuse in influence campaigns.

RANK_REASON The cluster contains two academic papers published on arXiv, detailing research into LLM capabilities and potential misuse.

Read on Hugging Face Daily Papers →

paper
safety

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

LLMs Automate Real-Effort Tasks and Exhibit Political Bias

COVERAGE [3]

arXiv cs.AI TIER_1 (CA) · Federico Belotti, Stefano Coniglio, Antonio Cosma, Francesco Fallucchi · 2026-05-26 04:00

Artificial Effort

arXiv:2605.23920v1 Announce Type: cross Abstract: Real-effort tasks, in which participants perform cognitively costly activities whose outcomes depend on actual performance, are widely used in experimental economics. Their validity, however, rests on the assumption that a human p…
arXiv cs.AI TIER_1 English(EN) · Daniel C. Ruiz, Anna Serbina, Ashwin Rao, Emilio Ferrara, Luca Luceri · 2026-05-25 04:00

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

arXiv:2605.22880v1 Announce Type: cross Abstract: As large language model (LLM)-based agents increasingly participate in online discourse, red-teaming their capacity to support political influence campaigns is critical for information integrity. In pursuit of this goal, we focus …
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-20 19:25

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

Open-source large language models exhibit varying political expressivity and vulnerability to jailbreak techniques, necessitating systematic red-teaming frameworks for assessing their potential misuse in influence campaigns.

COVERAGE [3]

Artificial Effort

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

RELATED ENTITIES

RELATED TOPICS