PulseAugur
实时 21:45:43

New corpus maps LLM debates on societal issues with shadowed human traits

Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 different LLMs, each prompted to adopt specific human personas or an AI-assistant role. CDS includes LLM responses on controversial topics like healthcare, disinformation, and gender gaps, with persona-conditioned records encoding 17 sociodemographic and psychological attributes to link prompts with language, stances, and reasoning. AI

影响 Provides a novel dataset for auditing LLM bias and social sensitivity in discourse.

排序理由 Academic paper release on arXiv detailing a new synthetic corpus for LLM research.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

New corpus maps LLM debates on societal issues with shadowed human traits

报道来源 [3]

  1. arXiv cs.AI TIER_1 English(EN) · Ali Aghazadeh Ardebili, Massimo Stella ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    arXiv:2604.27624v1 Announce Type: cross Abstract: Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record…

  2. arXiv cs.CL TIER_1 English(EN) · Massimo Stella ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…