Rouge
PulseAugur coverage of Rouge — every cluster mentioning Rouge across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New defense framework targets data poisoning in text summarization models
Researchers have developed a new framework to defend text summarization models against data poisoning attacks that occur during the fine-tuning stage. This method, called Detect, Unlearn, Restore, can identify poisoned …
-
Fine-tuned PEGASUS model achieves state-of-the-art abstractive summarization
Researchers have fine-tuned the PEGASUS model on the XL-Sum English corpus to improve abstractive summarization performance. This fine-tuned model achieved state-of-the-art results on the XL-Sum English Corpus, demonstr…
-
Direct Preference Optimization Simplifies LLM Fine-Tuning
Researchers have published a study on Direct Preference Optimization (DPO), a reinforcement learning technique for fine-tuning large language models. The paper details how DPO simplifies training, enhances computational…
-
LLM-as-a-Judge replaces traditional metrics for AI evaluation
Traditional NLP metrics like BLEU and ROUGE are insufficient for evaluating generative AI responses in production, especially in complex domains like financial regulatory documentation. These metrics, designed for tasks…
-
New RCD method optimizes LLM processing of long clinical texts within budget
Researchers have developed a new method called RCD for selecting relevant subsets of long clinical texts to reduce token costs for large language models. This approach frames the problem as a knapsack-constrained subset…
-
New research proposes reasoning-aware training for better dialogue summarization
Researchers have developed a new framework for multi-role dialogue summarization that moves beyond traditional overlap metrics like ROUGE. Their approach incorporates explicit cognitive-style reasoning and reward-based …
-
Eugene Yan explores challenges in evaluating abstractive summaries and detecting hallucinations
Evaluating abstractive summarization, which involves rephrasing source material rather than copying sentences, presents challenges, particularly in assessing relevance and factual consistency. While fluency and coherenc…