English(EN) PExA: Parallel Exploration Agent for Complex Text-to-SQL

新框架通过灵活交互和细粒度反馈增强文本到SQL模型

作者 PulseAugur 编辑部 · [22 个来源] · 2026-04-28 04:00

研究人员开发了几个新框架来改进文本到SQL生成，特别是针对小型语言模型和复杂的数据库交互。FineStep和FINER-SQL引入了新颖的强化学习方法，具有步级信用分配和细粒度执行反馈，以提高准确性和效率。Rose-SQL利用小推理模型的上下文学习进行多轮查询，而FlexSQL专注于灵活的数据库交互和探索以更好地解释查询。此外，EGRefine通过优化命名约定来解决模式歧义，以提高各种模型在下游文本到SQL方面的性能。 AI

影响这些进展提供了更高效、更准确、更注重隐私的文本到SQL解决方案，有可能促进自然语言数据库查询的更广泛采用。

排序理由多篇研究论文介绍了用于改进文本到SQL生成的新颖框架和技术。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 22 个来源。我们如何撰写摘要 →

报道来源 [22]

arXiv cs.CL TIER_1 English(EN) · Andrea Giovannini · 2026-05-08 14:32

PolySQL：通过自动化后端同构性跨 SQL 方言扩展 Text-to-SQL 评估

SQL dialects vary in syntax, types, and functions across database engines. Text-to-SQL benchmarks, however, predominantly support only SQLite. This creates a critical evaluation gap: cross-dialect evaluation reveals weak per-query agreement (Cohen's ), showing that SQLite perform…
arXiv cs.CL TIER_1 English(EN) · Vicki Stover Hertzberg, Eduardo Valverde, Joyce C. Ho · 2026-05-08 04:00

查询的解剖：文本到SQL评估的W5H维度和FAR模式

arXiv:2605.05525v1 Announce Type: cross Abstract: Natural language interfaces to databases have gained popularity, yet the theoretical foundations for evaluating and designing these systems remain underdeveloped. We present QUEST (Query Understanding Evaluation through Semantic T…
arXiv cs.CL TIER_1 English(EN) · Yaxun Dai, Baolin Sun, Junying Wang, Pengfei Wang, Yingqi Gao, Xuemei Dong, Mengdie Chu, Xiang Qi, Pingfu Chao · 2026-05-07 04:00

步步为营：面向工具集成文本到SQL的步级信用分配

arXiv:2605.04719v1 Announce Type: new Abstract: Tool-integrated Text-to-SQL parsing has emerged as a promising paradigm, framing SQL generation as a sequential decision-making process interleaved with tool execution. However, existing reinforcement learning approaches mainly rely…
arXiv cs.CL TIER_1 English(EN) · Pingfu Chao · 2026-05-06 10:10

步步为营：面向工具集成文本到SQL的步级信用分配

Tool-integrated Text-to-SQL parsing has emerged as a promising paradigm, framing SQL generation as a sequential decision-making process interleaved with tool execution. However, existing reinforcement learning approaches mainly rely on coarse-grained outcome supervision, resultin…
arXiv cs.CL TIER_1 English(EN) · Le Zhou, Feng Yao, Fengcai Qiao, Bo Xu, Fangyuan Wang, Boyan Xu · 2026-05-06 04:00

Rose-SQL：基于角色-状态演化的结构化推理用于多轮文本到SQL

arXiv:2605.03720v1 Announce Type: new Abstract: Recent advances in Large Reasoning Models (LRMs) trained with Long Chain-of-Thought have demonstrated remarkable capabilities in code generation and mathematical reasoning. However, their potential in multi-turn Text-to-SQL tasks re…
arXiv cs.CL TIER_1 English(EN) · Thanh Dat Hoang, Thanh Trung Huynh, Matthias Weidlich, Thanh Tam Nguyen, Tong Chen, Hongzhi Yin, Quoc Viet Hung Nguyen · 2026-05-06 04:00

FINER-SQL：提升小型语言模型在文本到SQL方面的能力

arXiv:2605.03465v1 Announce Type: cross Abstract: Large language models have driven major advances in Text-to-SQL generation. However, they suffer from high computational cost, long latency, and data privacy concerns, which make them impractical for many real-world applications. …
arXiv cs.CL TIER_1 English(EN) · Boyan Xu · 2026-05-05 13:06

Rose-SQL：基于角色-状态演化的结构化推理用于多轮文本到SQL

Recent advances in Large Reasoning Models (LRMs) trained with Long Chain-of-Thought have demonstrated remarkable capabilities in code generation and mathematical reasoning. However, their potential in multi-turn Text-to-SQL tasks remains largely underexplored. Existing approaches…
arXiv cs.CL TIER_1 English(EN) · Quoc Viet Hung Nguyen · 2026-05-05 07:51

FINER-SQL：提升小型语言模型在文本到SQL方面的能力

Large language models have driven major advances in Text-to-SQL generation. However, they suffer from high computational cost, long latency, and data privacy concerns, which make them impractical for many real-world applications. A natural alternative is to use small language mod…
arXiv cs.CL TIER_1 English(EN) · Quang Hieu Pham, Yang He, Ping Nie, Canwen Xu, Davood Rafiei, Yuepeng Wang, Xi Ye, Jocelyn Qiaochu Chen · 2026-05-05 04:00

FlexSQL：灵活的探索和执行造就更佳的文本到SQL代理

arXiv:2605.02815v1 Announce Type: new Abstract: Text-to-SQL over large analytical databases requires navigating complex schemas, resolving ambiguous queries, and grounding decisions in actual data. Most current systems follow a fixed pipeline where schema elements are retrieved o…
arXiv cs.CL TIER_1 English(EN) · Jocelyn Qiaochu Chen · 2026-05-04 16:51

FlexSQL：灵活的探索和执行造就更优的 Text-to-SQL 代理

Text-to-SQL over large analytical databases requires navigating complex schemas, resolving ambiguous queries, and grounding decisions in actual data. Most current systems follow a fixed pipeline where schema elements are retrieved once upfront and the database is only revisited f…
arXiv cs.CL TIER_1 English(EN) · Jiaqian Wang, Yutao Qi, Wenjin Hou, Yu Pang, Rui Yang · 2026-05-04 04:00

EGREFINE：文本到SQL模式精炼的执行基础优化框架

arXiv:2605.00628v1 Announce Type: cross Abstract: Text-to-SQL enables non-expert users to query databases in natural language, yet real-world schemas often suffer from ambiguous, abbreviated, or inconsistent naming conventions that degrade model accuracy. Existing approaches trea…
arXiv cs.CL TIER_1 English(EN) · Rui Yang · 2026-05-01 13:01

EGREFINE：文本到SQL模式精炼的执行基础优化框架

Text-to-SQL enables non-expert users to query databases in natural language, yet real-world schemas often suffer from ambiguous, abbreviated, or inconsistent naming conventions that degrade model accuracy. Existing approaches treat schemas as fixed and address errors downstream. …
arXiv cs.AI TIER_1 English(EN) · Smit Jivani, Sarvam Maheshwari, Sunita Sarawagi · 2026-05-01 04:00

可靠回答重复性问题：通过模板约束解码提高 Text-to-SQL 准确性

arXiv:2604.28028v1 Announce Type: cross Abstract: Large language models (LLMs) have revolutionized Text-to-SQL generation, allowing users to query structured data using natural language with growing ease. Yet, real-world deployment remains challenging, especially in complex or un…
arXiv cs.AI TIER_1 English(EN) · Taslim Jamal Arif, Kuldeep Singh · 2026-05-01 04:00

生产环境中文本到SQL系统的代理无关SQL准确性评估

arXiv:2604.28049v1 Announce Type: new Abstract: Text-to-SQL (T2SQL) evaluation in production environments poses fundamental challenges that existing benchmarks do not address. Current evaluation methodologies whether rule-based SQL matching or schema-dependent semantic parsers as…
arXiv cs.AI TIER_1 English(EN) · Kuldeep Singh · 2026-04-30 15:59

生产环境中文本到SQL系统的代理无关SQL准确性评估

Text-to-SQL (T2SQL) evaluation in production environments poses fundamental challenges that existing benchmarks do not address. Current evaluation methodologies whether rule-based SQL matching or schema-dependent semantic parsers assume access to ground-truth queries and structur…
arXiv cs.CL TIER_1 English(EN) · Sunita Sarawagi · 2026-04-30 15:44

可靠答案应对重复问题：通过模板约束解码提升 Text-to-SQL 准确性

Large language models (LLMs) have revolutionized Text-to-SQL generation, allowing users to query structured data using natural language with growing ease. Yet, real-world deployment remains challenging, especially in complex or unseen schemas, due to inconsistent accuracy and the…
arXiv cs.CL TIER_1 English(EN) · Hojae Han, Yeonseok Jeong, Seung-won Hwang, Zhewei Yao, Yuxiong He · 2026-04-29 04:00

R$^3$-SQL：文本到SQL的排名、奖励和重采样

arXiv:2604.25325v1 Announce Type: cross Abstract: Modern Text-to-SQL systems generate multiple candidate SQL queries and rank them to judge a final prediction. However, existing methods face two limitations. First, they often score functionally equivalent SQL queries inconsistent…
arXiv cs.CL TIER_1 English(EN) · Yusuf Denizay D\"onder, Derek Hommel, Andrea W Wen-Yi, David Mimno, Unso Eun Seo Jo · 2026-04-29 04:00

更便宜、更好、更快、更强：无需思维链或微调的强大文本到SQL

arXiv:2505.14174v2 Announce Type: replace Abstract: LLMs are effective at code generation tasks like text-to-SQL, but is it worth the cost? Many state-of-the-art approaches use non-task-specific LLM techniques including Chain-of-Thought (CoT), self-consistency, and fine-tuning. T…
arXiv cs.CL TIER_1 English(EN) · Yuxiong He · 2026-04-28 07:40

R$^3$-SQL：用于文本到SQL的排名、奖励和重采样

Modern Text-to-SQL systems generate multiple candidate SQL queries and rank them to judge a final prediction. However, existing methods face two limitations. First, they often score functionally equivalent SQL queries inconsistently despite identical execution results. Second, ra…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-28 07:40

R$^3$-SQL：文本到SQL的排序、奖励和重采样

Modern Text-to-SQL systems generate multiple candidate SQL queries and rank them to judge a final prediction. However, existing methods face two limitations. First, they often score functionally equivalent SQL queries inconsistently despite identical execution results. Second, ra…
arXiv cs.AI TIER_1 English(EN) · Sepideh Abedini, M. Tamer \"Ozsu · 2026-04-28 04:00

SQLyzr：文本到SQL的综合基准和评估平台

arXiv:2604.21214v2 Announce Type: replace-cross Abstract: Text-to-SQL models have significantly improved with the adoption of Large Language Models (LLMs), leading to their increasing use in real-world applications. Although many benchmarks exist for evaluating the performance of…
arXiv cs.CL TIER_1 English(EN) · Tanmay Parekh, Ella Hofmann-Coyle, Shuyi Wang, Sachith Sri Ram Kothur, Srivas Prasad, Yunmo Chen · 2026-04-28 04:00

PExA：复杂文本到SQL的并行探索代理

arXiv:2604.22934v1 Announce Type: cross Abstract: LLM-based agents for text-to-SQL often struggle with latency-performance trade-off, where performance improvements come at the cost of latency or vice versa. We reformulate text-to-SQL generation within the lens of software test c…

报道来源 [22]

相关实体

相关话题