PulseAugur
EN
LIVE 14:43:50

New benchmark reveals LLMs struggle with Romanized Indic-English code-mixing

A new benchmark called Indi-RomCoM has been developed to evaluate Large Language Models (LLMs) on their ability to understand and process Romanized Code Mixing (RCM), a communication style blending Indic languages with English in the Roman script. Researchers found that current LLMs significantly underperform on RCM instructions, with performance decreasing as the density of code-mixing increases. However, reasoning tasks showed less degradation compared to detection tasks, as the generated explanations provided crucial context. This benchmark aims to foster the development of more inclusive multilingual AI systems. AI

IMPACT Highlights a gap in LLM capabilities for multilingual communication, potentially driving development of more inclusive AI systems.

RANK_REASON The cluster is about a new academic paper introducing a benchmark for evaluating LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New benchmark reveals LLMs struggle with Romanized Indic-English code-mixing

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Avisha Das, Mihir Parmar, Mohana Ramnath, Pulkit Verma ·

    Indi-RomCoM: Code-Mixed Benchmark for Evaluating LLMs on Romanized Indic-English Instructions

    arXiv:2606.30790v1 Announce Type: cross Abstract: Romanized Code Mixing (RCM), where bilingual speakers fluidly blend local languages with English in Roman script, has emerged as the dominant form of communication across multilingual communities. While Large Language Models (LLMs…