PulseAugur
LIVE 00:50:42
ENTITY Bleu

Bleu

PulseAugur coverage of Bleu — every cluster mentioning Bleu across labs, papers, and developer communities, ranked by signal.

Total · 30d
8
8 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
7
7 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 5 TOTAL
  1. RESEARCH · CL_20329 ·

    New DiffCap-Bench benchmark evaluates multimodal LLMs on image difference captioning

    Researchers have introduced DiffCap-Bench, a new benchmark designed to evaluate image difference captioning capabilities in multimodal large language models. This benchmark addresses limitations in existing datasets by …

  2. RESEARCH · CL_18262 ·

    RAG+prompt system boosts Japanese-Chinese translation accuracy with linguistic analysis

    Researchers have developed a retrieval-augmented generation (RAG) system combined with prompting techniques to improve Japanese-Chinese machine translation, particularly for sentences with noun-modifying clause construc…

  3. RESEARCH · CL_06515 ·

    VLMs over-correct math OCR, hiding student errors; new metric PINK improves evaluation

    Researchers have identified a significant issue in evaluating handwritten math OCR systems, particularly with Vision-Language Models (VLMs). These models often over-correct student errors instead of accurately transcrib…

  4. RESEARCH · CL_06260 ·

    New study compares pose estimators for sign language translation systems

    A new paper evaluates various pose estimation systems for their effectiveness in sign language translation (SLT). Researchers compared common tools like MediaPipe Holistic and OpenPose against newer models such as SDPos…

  5. RESEARCH · CL_06298 ·

    LLM code translation evaluation moves beyond BLEU to semantic correctness

    A new paper analyzes cross-lingual text simplification (CLTS) strategies for English and French using large language models. The study compared five prompting systems, including direct, composition, and decomposition ap…