PulseAugur
LIVE 08:17:56
ENTITY BERTScore: Evaluating text generation with BERT

BERTScore: Evaluating text generation with BERT

PulseAugur coverage of BERTScore: Evaluating text generation with BERT — every cluster mentioning BERTScore: Evaluating text generation with BERT across labs, papers, and developer communities, ranked by signal.

Total · 30d
0
0 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
0
0 over 90d
TIER MIX · 90D

No coverage in the last 90 days.

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL
  1. TOOL · CL_29008 ·

    GraphRAG cuts token use by 60% on quantum papers

    A project developed for the TigerGraph GraphRAG Inference Hackathon demonstrated that GraphRAG significantly reduces token consumption and improves accuracy for complex queries. By constructing a knowledge graph of enti…

  2. TOOL · CL_20626 ·

    Mistral, QWen models show divergent strategies in biomedical text simplification

    A new research paper compares the text simplification strategies of Mistral-Small and QWen2.5 when applied to biomedical information. The study found that Mistral-Small effectively balances readability and accuracy, per…

  3. TOOL · CL_20382 ·

    Researchers improve medical VQA with trajectory-aware process supervision

    Researchers have developed a novel method to improve medical visual question answering (VQA) systems by incorporating trajectory-aware process supervision. This approach utilizes a two-stage training framework, starting…

  4. RESEARCH · CL_18258 ·

    New DESG model improves AI therapist evaluation beyond LLM judges

    Researchers have developed a new model-agnostic evaluator called Dynamic Emotional Signature Graphs (DESG) to assess the quality of AI-generated responses in mental health dialogues. This method moves beyond simple text…

  5. RESEARCH · CL_13212 ·

    LLMs favor their own resumes in hiring, study finds

    A new study reveals that Large Language Models (LLMs) exhibit a significant self-preference bias in hiring processes, favoring resumes generated by themselves over human-written ones. This bias, ranging from 67% to 82% …

  6. RESEARCH · CL_14134 ·

    New RCD method optimizes LLM processing of long clinical texts within budget

    Researchers have developed a new method called RCD for selecting relevant subsets of long clinical texts to reduce token costs for large language models. This approach frames the problem as a knapsack-constrained subset…

  7. RESEARCH · CL_11448 ·

    New HATS dataset integrates human perception for ASR evaluation

    Researchers have introduced HATS, a new French dataset designed to evaluate Automatic Speech Recognition (ASR) systems by incorporating human perception. The dataset was created by having 143 individuals compare and sel…

  8. RESEARCH · CL_08628 ·

    New research proposes reasoning-aware training for better dialogue summarization

    Researchers have developed a new framework for multi-role dialogue summarization that moves beyond traditional overlap metrics like ROUGE. Their approach incorporates explicit cognitive-style reasoning and reward-based …

  9. RESEARCH · CL_06982 ·

    ArgRE system uses formal argumentation to improve AI agent requirements negotiation

    Researchers have developed ArgRE, a novel system for resolving conflicts in multi-agent requirements negotiation for complex software systems. ArgRE embeds Dung-style abstract argumentation, modeling proposals and criti…