PulseAugur / Brief
EN
LIVE 20:49:35

Brief

last 24h
[1/1] 222 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. EntSQL: A Benchmark for Grounding Text-to-SQL in Long-Context Enterprise Knowledge

    Researchers have introduced EntSQL, a new benchmark designed to evaluate Text-to-SQL capabilities in enterprise settings. Unlike previous benchmarks, EntSQL focuses on grounding SQL generation in long-context, proprietary business documents. The benchmark includes 1,066 aligned Chinese-English examples across five business domains, many of which require knowledge beyond the immediate question and schema. Current systems struggle with this task, with the best performing model achieving only 15.9% accuracy on English inputs when provided with long-form documents. AI

    IMPACT Highlights the challenge of applying LLMs to enterprise-specific data, potentially driving development of more context-aware Text-to-SQL systems.