Frame In, Frame Out: Measuring Framing Bias in LLM-Generated News Summaries
Researchers have developed a new benchmark called Frame In, Frame Out (FIFO) to measure framing bias in news summaries generated by large language models. The benchmark, which includes over 15,000 jury-annotated examples, found that LLM-generated summaries often exhibit higher framing rates than human-written ones. This bias was particularly pronounced in summaries related to science and public health, highlighting framing as a critical but often overlooked aspect of summarization quality. AI
IMPACT Highlights a new evaluation metric for LLM-generated text, potentially influencing future model development and deployment in news summarization.