Claude Haiku 4.5 leads in cost-effective JSON extraction benchmark

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-20 10:23

A recent benchmark evaluated six large language models on their ability to extract structured data, specifically JSON, from customer support emails. The analysis found that Anthropic's Claude Haiku 4.5 offered the best value, achieving high accuracy at a significantly lower cost compared to more powerful models. While Gemini 2.5 Flash was fast and inexpensive, it struggled with accuracy, particularly in hallucinating data. The study suggests using Haiku for most extraction tasks, Sonnet for more complex reasoning, and avoiding more expensive frontier models for simple data extraction. AI

影响 Identifies the most cost-effective LLM for structured data extraction, guiding developers on model selection for production features.

排序理由 The cluster describes a benchmark comparing LLM performance on a specific task, rather than a new model release or major industry event. [lever_c_demoted from research: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

Claude Haiku 4.5 leads in cost-effective JSON extraction benchmark

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · shaun vd · 2026-05-20 10:23

Claude Sonnet 4.6 对比 GPT-4.1 对比 Gemini 2.5 Flash：哪个在 JSON 提取中胜出？

We had a question: for structured-output tasks where you just need clean JSON back, which frontier model wins on a cost/quality basis? The answer matters because most production LLM features aren't writing poetry — they're extracting fields from emails, summ…

报道来源 [1]

Claude Sonnet 4.6 对比 GPT-4.1 对比 Gemini 2.5 Flash：哪个在 JSON 提取中胜出？

相关实体

相关话题