PulseAugur
实时 23:56:40

LLMs struggle with cultural nuances and cross-lingual transfer in sentiment analysis

Two new papers explore the capabilities of large language models (LLMs) in understanding nuanced language across different cultures and languages. One study evaluates cross-lingual transfer strategies for aspect-based sentiment analysis, finding that fine-tuned LLMs perform best, especially when trained on multiple non-target languages. The other paper investigates whether LLMs grasp embodied cognition and cultural variations, concluding that current models fail to inherently understand cultural differences and default to English-centric reasoning. AI

影响 Highlights limitations in current LLMs' cross-lingual and cultural understanding, suggesting areas for future model development.

排序理由 The cluster contains two academic papers published on arXiv, detailing research into LLM capabilities.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 10 个来源。 我们如何撰写摘要 →

LLMs struggle with cultural nuances and cross-lingual transfer in sentiment analysis

报道来源 [10]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    What Makes Words Hard? Sakura at BEA 2026 Shared Task on Vocabulary Difficulty Prediction

    We describe two types of models for vocabulary difficulty prediction: a high-accuracy black-box model, which achieved the top shared task result in the open track, and an explainable model, which outperforms a fine-tuned encoder baseline. As the black-box model, we fine-tuned an …

  2. arXiv cs.CL TIER_1 English(EN) · Lisa Beinborn ·

    What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty

    What makes a word difficult to learn, and how does the difficulty depend on the learner's native language? We computationally model vocabulary difficulty for English learners whose first language is Spanish, German, or Chinese with gradient-boosted models trained on features rela…

  3. arXiv cs.CL TIER_1 English(EN) · Debasmita Bhattacharya, Marten van Schijndel ·

    Code-switching in text and speech challenges information-theoretic speaker design

    arXiv:2408.04596v2 Announce Type: replace Abstract: In this work, we use language modeling to investigate the factors that influence insertional code-switching. Code-switching occurs when a speaker alternates between one language variety (the primary language) and another (the se…

  4. arXiv cs.LG TIER_1 English(EN) · Gregory N. Frank ·

    Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails

    arXiv:2603.18280v3 Announce Type: replace Abstract: Current alignment evaluation mostly measures whether models encode dangerous concepts and whether they refuse harmful requests. Both miss the layer where alignment often operates: routing from concept detection to behavioral pol…

  5. arXiv cs.CL TIER_1 English(EN) · Jakob Fehle, Nils Constantin Hellwig, Udo Kruschwitz, Christian Wolff ·

    Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

    arXiv:2604.26619v1 Announce Type: new Abstract: Aspect-based Sentiment Analysis (ABSA) extracts fine-grained opinions toward specific aspects within text but remains largely English-focused despite major advances in transformer-based and instruction-tuned models. This work presen…

  6. arXiv cs.CL TIER_1 English(EN) · Christian Wolff ·

    Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

    Aspect-based Sentiment Analysis (ABSA) extracts fine-grained opinions toward specific aspects within text but remains largely English-focused despite major advances in transformer-based and instruction-tuned models. This work presents a multilingual evaluation of state-of-the-art…

  7. arXiv cs.CL TIER_1 English(EN) · Yu Wang, Emmanuele Chersoni, Chu-Ren Huang ·

    Do LLMs Capture Embodied Cognition and Cultural Variation? Cross-Linguistic Evidence from Demonstratives

    arXiv:2604.25423v1 Announce Type: new Abstract: Do large language models (LLMs) truly acquire embodied cognition and cultural conventions from text? We introduce demonstratives, fundamental spatial expressions like "this/that" in English and "zh\"e/n\"a" in Chinese, as a novel pr…

  8. Hugging Face Daily Papers TIER_1 English(EN) ·

    Do LLMs Capture Embodied Cognition and Cultural Variation? Cross-Linguistic Evidence from Demonstratives

    Do large language models (LLMs) truly acquire embodied cognition and cultural conventions from text? We introduce demonstratives, fundamental spatial expressions like "this/that" in English and "zhè/nà" in Chinese, as a novel probe for grounded knowledge. Using 6,400 responses fr…

  9. arXiv cs.CL TIER_1 English(EN) · Chu-Ren Huang ·

    Do LLMs Capture Embodied Cognition and Cultural Variation? Cross-Linguistic Evidence from Demonstratives

    Do large language models (LLMs) truly acquire embodied cognition and cultural conventions from text? We introduce demonstratives, fundamental spatial expressions like "this/that" in English and "zhè/nà" in Chinese, as a novel probe for grounded knowledge. Using 6,400 responses fr…

  10. arXiv cs.CL TIER_1 English(EN) · Lung-Hao Lee, Liang-Chih Yu, Natalia Loukashevich, Ilseyar Alimova, Alexander Panchenko, Tzu-Mi Lin, Zhe-Yu Xu, Jian-Yu Zhou, Guangmin Zheng, Jin Wang, Sharanya Awasthi, Jonas Becker, Jan Philip Wahle, Terry Ruas, Shamsuddeen Hassan Muhammad, Saif M. Moha ·

    DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

    arXiv:2601.23022v3 Announce Type: replace Abstract: Aspect-Based Sentiment Analysis (ABSA) focuses on extracting sentiment at a fine-grained aspect level and has been widely applied across real-world domains. However, existing ABSA research relies on coarse-grained categorical la…