ENTITY Computation and Language

Computation and Language

PulseAugur coverage of Computation and Language — every cluster mentioning Computation and Language across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

9 over 90d

Releases · 30d

0 over 90d

Papers · 30d

9 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

TOOL · CL_117771 · Jun 30 · 04:00

New framework models LLM generation with controlled hallucinations

Researchers have introduced a new framework for language generation in the limit, which aims to better reflect the capabilities and constraints of modern large language models. This approach addresses the trade-off betw…
RESEARCH · CL_111601 · Jun 25 · 12:37

New framework induces hierarchies from diverse text sources

Researchers have developed a new term-centric framework for creating interpretable hierarchical taxonomies from diverse text sources. This method uses automatic term extraction to map documents into a shared representat…
RESEARCH · CL_109547 · Jun 24 · 07:00

New Red Teaming Framework Exposes LLM Faithfulness Vulnerabilities

Researchers have developed a novel red teaming framework to systematically uncover vulnerabilities in large language models (LLMs). This framework utilizes a multi-role architecture with target, attacker, and jury model…
RESEARCH · CL_107741 · Jun 23 · 15:58

New method decomposes annotation tasks to improve efficiency

Researchers have proposed a new method for efficient annotation of structured data by decomposing complex tasks into smaller sub-tasks. This approach aims to reduce the inferential load on annotators, whether human or m…
TOOL · CL_105142 · Jun 22 · 11:44

Research: Intrinsic Self-Correction in LLMs is Task-Dependent

A new research paper explores the effectiveness of intrinsic self-correction (SC) in large language models, moving beyond general assessments to a task-sensitive analysis. The study investigates how SC functions through…
TOOL · CL_51032 · May 26 · 04:00

JudgmentBench dataset shows preference judgments outperform rubrics for AI evaluation

Researchers have introduced JudgmentBench, a new benchmark dataset designed to compare rubric-based scoring against pairwise preference judgments for evaluating AI model outputs. The dataset comprises 1,539 rubric score…
TOOL · CL_38286 · May 18 · 15:42

Readers regress to errors in complex sentences, study finds

A new paper explores how readers process complex sentences that contain plausible errors. Researchers observed that readers make specific eye movements, regressing to earlier parts of the text when later information sug…
RESEARCH · CL_11444 · Apr 30 · 09:13

New corpus maps LLM debates on societal issues with shadowed human traits

Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 differe…
RESEARCH · CL_06696 · Apr 28 · 04:00

New framework synthesizes rich data for dialogue-based referring expression comprehension

Researchers have developed a novel three-tier data synthesis framework to address the scarcity of annotated dialogue grounding data for generalized referring expression comprehension. This method aims to improve model p…

New framework models LLM generation with controlled hallucinations

New framework induces hierarchies from diverse text sources

New Red Teaming Framework Exposes LLM Faithfulness Vulnerabilities

New method decomposes annotation tasks to improve efficiency

Research: Intrinsic Self-Correction in LLMs is Task-Dependent

JudgmentBench dataset shows preference judgments outperform rubrics for AI evaluation

Readers regress to errors in complex sentences, study finds

New corpus maps LLM debates on societal issues with shadowed human traits

New framework synthesizes rich data for dialogue-based referring expression comprehension