Researchers have developed a text-to-SQL system leveraging large language models for querying astronomical databases, specifically the ALeRCE system for the Zwicky Transient Facility and Vera C. Rubin Observatory. The system, which uses in-context learning, allows users to input natural language queries to generate executable SQL. A dataset of 110 NL/SQL pairs was created to evaluate the framework, which includes modules for schema linking, query classification, prompt decomposition, and self-correction. Performance tests on thirteen LLMs showed that Claude Opus 4.6, Gemini 2.5 Pro, Gemini 3 Flash, and GPT-5.2-Codex were the top performers, with Claude Opus 4.6 achieving high accuracy on simpler queries but decreasing with complexity. AI
IMPACT Enables natural language querying of complex scientific databases, potentially accelerating research in fields like astronomy.
RANK_REASON The cluster describes a research paper detailing a new system and evaluation of LLMs for a specific scientific application. [lever_c_demoted from research: ic=1 ai=1.0]
- Claude Opus 4.6
- Gemini 2.5 Pro
- Gemini 3 Flash
- GPT-5.2 Codex
- Pablo A. Estévez
- Vera C. Rubin Observatory
- Zwicky Transient Facility
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →