PulseAugur
实时 10:53:27

New benchmark reveals LLM bias towards Brazilian Portuguese

A new benchmark called P3B3 has been developed to assess how large language models (LLMs) handle variations in Portuguese, specifically European Portuguese (pt-PT) and Brazilian Portuguese (pt-BR). The benchmark aims to address the current imbalance where pt-BR data is more prevalent, leading to LLMs exhibiting a bias towards this variety. Experiments using P3B3 revealed that most tested LLMs show a strong preference for pt-BR, with varying degrees of controllability across different models, underscoring the need for more balanced representation of language varieties in LLMs. AI

影响 Highlights the need for improved representation of linguistic diversity in LLMs to ensure equitable and reliable performance across different language varieties.

排序理由 The cluster describes a new academic paper introducing a benchmark for LLM research.

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Rafael Ferreira, In\^es Vieira, In\^es Calvo, James Furtado, Iago Paulo, Diogo Tavares, Diogo Gl\'oria-Silva, David Semedo, Jo\~ao Magalh\~aes ·

    P3B3: A Multi-Turn Conversational Benchmark for Measuring European and Brazilian Portuguese Variety Bias in LLMs

    arXiv:2606.16753v1 Announce Type: cross Abstract: As Large Language Models (LLMs) become embedded in everyday communication, capturing regional linguistic variation is essential for reliable and equitable language use. In Portuguese, European (pt-PT) and Brazilian (pt-BR) varieti…

  2. arXiv cs.AI TIER_1 English(EN) · João Magalhães ·

    P3B3: A Multi-Turn Conversational Benchmark for Measuring European and Brazilian Portuguese Variety Bias in LLMs

    As Large Language Models (LLMs) become embedded in everyday communication, capturing regional linguistic variation is essential for reliable and equitable language use. In Portuguese, European (pt-PT) and Brazilian (pt-BR) varieties remain unevenly represented, with pt-BR dominat…