A new benchmark reveals that common data formats like JSON and TOON struggle with large language models, failing to maintain accuracy and validity at scale. The study found that JSON breaks down with as few as 500 records, leading models like GPT-5.5 to return empty strings and Opus to miscount significantly. TOON also fails to produce valid output, with all tested frontier models making consistent encoding errors. The new GCF format, however, demonstrated 100% comprehension and valid generation across all tested models, outperforming JSON and TOON in both accuracy and cost. AI
IMPACT New data format GCF shows superior performance over JSON and TOON for LLMs, potentially improving efficiency and accuracy in data processing.
RANK_REASON The cluster describes a novel benchmark and a new data format designed to improve LLM performance, fitting the definition of research. [lever_c_demoted from research: ic=1 ai=1.0]
- Claude Opus
- Claude Sonnet
- Gemini 2.5 Pro
- Gemini 3.1 Flash Lite
- Gemini 3.1 Pro
- Gemini 3.5 Flash
- GPT-5.4
- GPT-5.4-mini
- GPT-5.5
- Haiku
- JSON
- Opus
- TOON
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →