New corpus maps LLM debates on societal issues with shadowed human traits

By PulseAugur Editorial · [3 sources] · 2026-04-30 09:13

Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 different LLMs, each prompted to adopt specific human personas or an AI-assistant role. CDS includes LLM responses on controversial topics like healthcare, disinformation, and gender gaps, with persona-conditioned records encoding 17 sociodemographic and psychological attributes to link prompts with language, stances, and reasoning. AI

IMPACT Provides a novel dataset for auditing LLM bias and social sensitivity in discourse.

RANK_REASON Academic paper release on arXiv detailing a new synthetic corpus for LLM research.

Read on arXiv cs.CL →

paper
safety

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

New corpus maps LLM debates on societal issues with shadowed human traits

COVERAGE [3]

arXiv cs.AI TIER_1 English(EN) · Ali Aghazadeh Ardebili, Massimo Stella · 2026-05-01 04:00

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

arXiv:2604.27624v1 Announce Type: cross Abstract: Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record…
arXiv cs.CL TIER_1 English(EN) · Massimo Stella · 2026-04-30 09:13

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-30 09:13

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-gener…

COVERAGE [3]

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

RELATED ENTITIES

RELATED TOPICS