PulseAugur
LIVE 08:24:51
research · [3 sources] ·
0
research

New corpus maps LLM debates on societal issues with shadowed human traits

Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 different LLMs, each prompted to adopt specific human personas or an AI-assistant role. CDS includes LLM responses on controversial topics like healthcare, disinformation, and gender gaps, with persona-conditioned records encoding 17 sociodemographic and psychological attributes to link prompts with language, stances, and reasoning. AI

Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →

IMPACT Provides a novel dataset for auditing LLM bias and social sensitivity in discourse.

RANK_REASON Academic paper release on arXiv detailing a new synthetic corpus for LLM research.

Read on arXiv cs.CL →

COVERAGE [3]

  1. arXiv cs.AI TIER_1 · Ali Aghazadeh Ardebili, Massimo Stella ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    arXiv:2604.27624v1 Announce Type: cross Abstract: Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record…

  2. arXiv cs.CL TIER_1 · Massimo Stella ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…

  3. Hugging Face Daily Papers TIER_1 ·

    Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

    Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…