PulseAugur
EN
LIVE 21:16:30

LLM language alignment fails to resist Russian disinformation

A new research paper reveals that large language models fine-tuned for specific linguistic communities do not necessarily align with the expected political orientation. The study found that a Ukrainian-oriented model was less resistant to Russian disinformation when queried in Russian, while a Russian-oriented model showed stronger rejection of such narratives. The research suggests that factors like corpus composition and prompt format are more influential than nominal cultural alignment in determining an LLM's susceptibility to disinformation. AI

IMPACT Challenges the assumption that culturally aligned LLMs inherently resist disinformation, suggesting a need for more robust evaluation methods.

RANK_REASON Academic paper analyzing LLM behavior [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Tetiana Klynina ·

    Friend or Foe? Language as an ideological switch in open-weight LLMs under Russian disinformation stress

    As Russia's war against Ukraine extends into generative AI, large language models (LLMs) adapted for local post-Soviet languages are deployed in contested information environments. Policy and industry discourse assumes that culturally aligned adaptation encodes the political orie…