Claude Sonnet 4.6 vs GPT-4.1 vs Gemini 2.5 Flash: which wins JSON extraction?
A recent benchmark evaluated six large language models on their ability to extract structured data, specifically JSON, from customer support emails. The analysis found that Anthropic's Claude Haiku 4.5 offered the best value, achieving high accuracy at a significantly lower cost compared to more powerful models. While Gemini 2.5 Flash was fast and inexpensive, it struggled with accuracy, particularly in hallucinating data. The study suggests using Haiku for most extraction tasks, Sonnet for more complex reasoning, and avoiding more expensive frontier models for simple data extraction. AI
IMPACT Identifies the most cost-effective LLM for structured data extraction, guiding developers on model selection for production features.