Computation and Language
PulseAugur coverage of Computation and Language — every cluster mentioning Computation and Language across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
New framework models LLM generation with controlled hallucinations
Researchers have introduced a new framework for language generation in the limit, which aims to better reflect the capabilities and constraints of modern large language models. This approach addresses the trade-off betw…
-
New framework induces hierarchies from diverse text sources
Researchers have developed a new term-centric framework for creating interpretable hierarchical taxonomies from diverse text sources. This method uses automatic term extraction to map documents into a shared representat…
-
New Red Teaming Framework Exposes LLM Faithfulness Vulnerabilities
Researchers have developed a novel red teaming framework to systematically uncover vulnerabilities in large language models (LLMs). This framework utilizes a multi-role architecture with target, attacker, and jury model…
-
New method decomposes annotation tasks to improve efficiency
Researchers have proposed a new method for efficient annotation of structured data by decomposing complex tasks into smaller sub-tasks. This approach aims to reduce the inferential load on annotators, whether human or m…
-
Research: Intrinsic Self-Correction in LLMs is Task-Dependent
A new research paper explores the effectiveness of intrinsic self-correction (SC) in large language models, moving beyond general assessments to a task-sensitive analysis. The study investigates how SC functions through…
-
JudgmentBench dataset shows preference judgments outperform rubrics for AI evaluation
Researchers have introduced JudgmentBench, a new benchmark dataset designed to compare rubric-based scoring against pairwise preference judgments for evaluating AI model outputs. The dataset comprises 1,539 rubric score…
-
Readers regress to errors in complex sentences, study finds
A new paper explores how readers process complex sentences that contain plausible errors. Researchers observed that readers make specific eye movements, regressing to earlier parts of the text when later information sug…
-
New corpus maps LLM debates on societal issues with shadowed human traits
Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 differe…
-
New framework synthesizes rich data for dialogue-based referring expression comprehension
Researchers have developed a novel three-tier data synthesis framework to address the scarcity of annotated dialogue grounding data for generalized referring expression comprehension. This method aims to improve model p…