PulseAugur
EN
LIVE 04:31:06
ENTITY Computation and Language

Computation and Language

PulseAugur coverage of Computation and Language — every cluster mentioning Computation and Language across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
9
9 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
9
9 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL
  1. TOOL · CL_117771 ·

    New framework models LLM generation with controlled hallucinations

    Researchers have introduced a new framework for language generation in the limit, which aims to better reflect the capabilities and constraints of modern large language models. This approach addresses the trade-off betw…

  2. RESEARCH · CL_111601 ·

    New framework induces hierarchies from diverse text sources

    Researchers have developed a new term-centric framework for creating interpretable hierarchical taxonomies from diverse text sources. This method uses automatic term extraction to map documents into a shared representat…

  3. RESEARCH · CL_109547 ·

    New Red Teaming Framework Exposes LLM Faithfulness Vulnerabilities

    Researchers have developed a novel red teaming framework to systematically uncover vulnerabilities in large language models (LLMs). This framework utilizes a multi-role architecture with target, attacker, and jury model…

  4. RESEARCH · CL_107741 ·

    New method decomposes annotation tasks to improve efficiency

    Researchers have proposed a new method for efficient annotation of structured data by decomposing complex tasks into smaller sub-tasks. This approach aims to reduce the inferential load on annotators, whether human or m…

  5. TOOL · CL_105142 ·

    Research: Intrinsic Self-Correction in LLMs is Task-Dependent

    A new research paper explores the effectiveness of intrinsic self-correction (SC) in large language models, moving beyond general assessments to a task-sensitive analysis. The study investigates how SC functions through…

  6. TOOL · CL_51032 ·

    JudgmentBench dataset shows preference judgments outperform rubrics for AI evaluation

    Researchers have introduced JudgmentBench, a new benchmark dataset designed to compare rubric-based scoring against pairwise preference judgments for evaluating AI model outputs. The dataset comprises 1,539 rubric score…

  7. TOOL · CL_38286 ·

    Readers regress to errors in complex sentences, study finds

    A new paper explores how readers process complex sentences that contain plausible errors. Researchers observed that readers make specific eye movements, regressing to earlier parts of the text when later information sug…

  8. RESEARCH · CL_11444 ·

    New corpus maps LLM debates on societal issues with shadowed human traits

    Researchers have developed a new synthetic corpus called Cognitive Digital Shadows (CDS) containing 190,000 records to study how Large Language Models (LLMs) debate societal issues. The corpus is generated by 19 differe…

  9. RESEARCH · CL_06696 ·

    New framework synthesizes rich data for dialogue-based referring expression comprehension

    Researchers have developed a novel three-tier data synthesis framework to address the scarcity of annotated dialogue grounding data for generalized referring expression comprehension. This method aims to improve model p…