Reinforcement learning trains small models for text-to-SPARQL generation

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-19 16:20

Researchers have explored using reinforcement learning to train smaller language models for zero-shot Text-to-SPARQL generation, a task crucial for knowledge graph question answering. They applied Group-Relative Policy Optimization (GRPO) to the Qwen3-1.7B model, utilizing execution feedback and answer-level rewards instead of requiring gold query annotations. The GRPO-trained models showed significant improvement over a zero-shot baseline, demonstrating the viability of outcome-based reinforcement learning for this task when full supervision is unavailable. AI

影响 Demonstrates a viable method for training smaller models on complex tasks without extensive labeled data, potentially lowering barriers to knowledge graph querying.

排序理由 Academic paper detailing a novel approach to text-to-SPARQL generation using reinforcement learning. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Ricardo Usbeck · 2026-05-19 16:20

Text-to-SPARQL Generation with Reinforcement Learning: A GRPO-based Approach on DBLP

Knowledge graph question answering seeks to translate natural language questions into executable queries over knowledge graphs, but existing approaches often rely on large models or full supervision in the form of gold query annotations. This study examines whether reinforcement …

报道来源 [1]

Text-to-SPARQL Generation with Reinforcement Learning: A GRPO-based Approach on DBLP

相关实体

相关话题