New benchmark reveals LLMs exhibit significant framing bias in news summaries

By PulseAugur Editorial · [1 sources] · 2026-05-22 04:00

Researchers have developed a new benchmark called Frame In, Frame Out (FIFO) to measure framing bias in news summaries generated by large language models. The benchmark, which includes over 15,000 jury-annotated examples, found that LLM-generated summaries often exhibit higher framing rates than human-written ones. This bias was particularly pronounced in summaries related to science and public health, highlighting framing as a critical but often overlooked aspect of summarization quality. AI

IMPACT Highlights a new evaluation metric for LLM-generated text, potentially influencing future model development and deployment in news summarization.

RANK_REASON The cluster describes a new academic paper introducing a novel benchmark for evaluating LLM-generated content. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New benchmark reveals LLMs exhibit significant framing bias in news summaries

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Valeria Pastorino, Nafise Sadat Moosavi · 2026-05-22 04:00

Frame In, Frame Out: Measuring Framing Bias in LLM-Generated News Summaries

arXiv:2505.05406v3 Announce Type: replace Abstract: News headlines and summaries shape how events are interpreted through selective emphasis and omission, a phenomenon commonly referred to as framing. Large language models are now routinely used to generate such content, yet exis…

COVERAGE [1]

Frame In, Frame Out: Measuring Framing Bias in LLM-Generated News Summaries

RELATED ENTITIES

RELATED TOPICS