How LLM and Claude preform in not so well known language
The Institute of the Estonian Language (EKI) has developed a new benchmark to assess large language model performance in Estonian. This benchmark evaluates not only language proficiency and reasoning but also factual accuracy and resistance to propaganda. Notably, Claude demonstrated strong resistance to propaganda, highlighting that models excelling in English may falter in smaller language contexts. AI
IMPACT Highlights the need for language-specific evaluations to uncover LLM weaknesses beyond English-centric benchmarks.