ENTITY Artificial Analysis

Artificial Analysis

PulseAugur coverage of Artificial Analysis — every cluster mentioning Artificial Analysis across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

48 over 90d

Releases · 30d

0 over 90d

Papers · 30d

6 over 90d

TIER MIX · 90D

frontier release 3
significant 3
research 10
tool 26
commentary 6

TOPICS

model release 31
product 29
infra 13
other 11
paper 6
safety 2
opinion 1
policy 1

RELATIONSHIPS

instance of GLM-5.2 95%
used by Fireworks AI 90%
instance of MiniMax M2.7 90%
instance of MiniMax M3 90%
competes with Opus 4.8 70%
used by MiniMax M3 70%
instance of Fireworks AI 70%
used by MiniMax M2.7 50%

TIMELINE

2026-06-16 research_milestone Artificial Analysis released an updated version of its Intelligence Index, version 4.1, which includes a greater emphasis on agentic workloads and improved benchmarks. source

SENTIMENT · 30D

17 day(s) with sentiment data

RECENT · PAGE 1/3 · 48 TOTAL

TOOL · CL_109838 · Jun 25 · 06:02

Together AI claims world's fastest speech-to-text stack

Together AI has developed a speech-to-text system that achieves industry-leading speed. Their 'parakeet' model, running on Together's infrastructure, processes audio at approximately 302 seconds of audio per second of p…
RESEARCH · CL_108898 · Jun 23 · 15:31

Krea 2: New 12B open-weights image model prioritizes creative exploration

Krea 2, a new 12B parameter open-weights image generation model, has been released with a focus on creative exploration rather than just polished defaults. The model utilizes a diffusion transformer architecture and a m…
SIGNIFICANT · CL_106010 · Jun 23 · 15:01

Z.ai's GLM-5.2 achieves near-top-tier performance in open-weight models · 1 source tracked

GLM-5.2, an open-weight model from Z.ai, has demonstrated significant advancements, closing the gap with leading proprietary models. In just ten weeks, GLM-5.2 improved its Artificial Analysis Intelligence Index score b…
TOOL · CL_104149 · Jun 22 · 17:54

AI video editing models compared; SpaceX inks $6.3B compute deal

Artificial Analysis has launched a video editing model comparison arena, allowing users to vote on the performance of models like Seedance 2.0, Runway Aleph 2.0, Wan 2.7, HappyHorse 1.0, Kling 3.0 Omni, and SkyReels V4 …
COMMENTARY · CL_103498 · Jun 22 · 03:16

Users seek better leaderboards for quantized AI models

A user on r/LocalLLaMA is seeking a better method for comparing the performance of quantized large language models. They find the existing "Artificial Analysis" leaderboard useful for assessing model intelligence but no…
COMMENTARY · CL_103107 · Jun 21 · 20:56

Open AI Models: Minimal Downside to Switching from Proprietary Leaders

Andrew Marble argues that the professional risks associated with using open-source AI models are diminishing, drawing parallels to the past transition from Windows to Linux. While proprietary models like Claude and GPT …
TOOL · CL_106338 · Jun 21 · 02:19

New Intelligence Index Ranks Frontier AI Models

Artificial Analysis has developed an "Intelligence Index" to quantify the capabilities of frontier AI models. This index is a weighted average of nine evaluations, with a strong emphasis on agentic tasks. While closed-s…
SIGNIFICANT · CL_100054 · Jun 19 · 05:53

GLM-5.2 emerges as top open-weight AI model, rivaling GPT-5.5

The open-weight language model GLM-5.2 has garnered significant attention, with multiple sources indicating it performs comparably to frontier models like GPT-5.5 and Anthropic's Opus 4.8. This model features architectu…
TOOL · CL_99829 · Jun 19 · 02:32

GLM-5.2 matches Anthropic Opus 4.8 on coding, driving cost competition

Artificial Analysis has ranked GLM-5.2 as the leading open-weight model, noting its performance on coding tasks is comparable to Anthropic's Opus 4.8. This development suggests significant cost competition for major AI …
TOOL · CL_99467 · Jun 19 · 00:22

Artificial Analysis unveils new AABriefcase benchmark for AI systems

Artificial Analysis has introduced a new benchmark named AABriefcase, designed to evaluate AI systems. The announcement was made via a post on X, formerly Twitter, and shared on Reddit's r/singularity.
COMMENTARY · CL_97924 · Jun 18 · 04:01

LLM Gateway Latency Overheads Are Negligible, Developer Finds

A developer spent a month meticulously benchmarking LLM gateway latency, only to discover that the gateway's contribution to overall request time was negligible, often less than 1%. The actual performance bottlenecks li…
TOOL · CL_97108 · Jun 17 · 15:58

MiniMax M3 model tops leaderboards, offers free access, and integrates with Unreal Engine

MiniMax AI's M3 model is gaining recognition, topping leaderboards and being offered for free access on B.AI. The model is also being integrated into hackathons and tested with advanced software like Unreal Engine 5.8 M…
RESEARCH · CL_96526 · Jun 17 · 09:12

GLM-5.2 leads open weights models on Artificial Analysis Intelligence Index · 4 sources tracked

Z.ai's GLM-5.2 has been recognized as the top-performing open-weights model on the Artificial Analysis Intelligence Index, achieving a score of 51. Despite maintaining the same scale as its predecessor, GLM-5.1, GLM-5.2…
RESEARCH · CL_94468 · Jun 16 · 10:51

Artificial Analysis updates Intelligence Index for AI model evaluation · 2 sources tracked

Artificial Analysis has released version 4.1 of its Intelligence Index, a comprehensive metric for evaluating model intelligence. This update places a greater emphasis on agentic workloads and incorporates improved benc…
RESEARCH · CL_90305 · Jun 14 · 15:25

DeepSeek V4 Pro tops speed and latency benchmarks on Together AI

DeepSeek V4 Pro, when deployed on the Together AI platform, has achieved the top ranking on Artificial Analysis for both output speed and latency. This performance is attributed to advancements in inference systems, inc…
COMMENTARY · CL_89700 · Jun 14 · 02:48

AI Model Race Visualized: Trends from 2022-2026

Jianqi Pan created a visualization of the AI model race from 2022 to 2026, based on data from Artificial Analysis. The visualization highlights the competitive landscape and trends in AI model benchmarks rather than foc…
RESEARCH · CL_88579 · Jun 13 · 04:30

Anthropic suspends Fable/Mythos models citing US gov directive

Anthropic has suspended access to its Fable 5 and Mythos 5 models for all customers worldwide following a directive from the U.S. government, citing national cybersecurity risks. This abrupt revocation has disrupted dow…
RESEARCH · CL_88322 · Jun 12 · 23:44

Together AI benchmarks Blackwell hardware for agent infrastructure

Together AI has released benchmarks demonstrating the performance of their inference stack on NVIDIA's Blackwell hardware, showing a 31% increase in transactions per second compared to other open-source engines. This pe…
RESEARCH · CL_88265 · Jun 12 · 21:00

NVIDIA Blackwell Systems Lead New Agentic AI Benchmarks

NVIDIA has set new performance records on the first agentic AI benchmarks, AgentPerf and Agentic AI Benchmark. The company's GB300 NVL72 system, powered by Blackwell architecture, demonstrated up to a 20x performance le…
SIGNIFICANT · CL_88106 · Jun 12 · 19:04

MiniMax M3 launches with 512K context on Fireworks AI

MiniMax AI has launched its M3 model, available on the Fireworks AI platform. This new model boasts a 512K context window, native image and video input capabilities, and utilizes MSA sparse attention for significantly f…

Together AI claims world's fastest speech-to-text stack

Krea 2: New 12B open-weights image model prioritizes creative exploration

Z.ai's GLM-5.2 achieves near-top-tier performance in open-weight models · 1 source tracked

AI video editing models compared; SpaceX inks $6.3B compute deal

Users seek better leaderboards for quantized AI models

Open AI Models: Minimal Downside to Switching from Proprietary Leaders

New Intelligence Index Ranks Frontier AI Models

GLM-5.2 emerges as top open-weight AI model, rivaling GPT-5.5

GLM-5.2 matches Anthropic Opus 4.8 on coding, driving cost competition

Artificial Analysis unveils new AABriefcase benchmark for AI systems

LLM Gateway Latency Overheads Are Negligible, Developer Finds

MiniMax M3 model tops leaderboards, offers free access, and integrates with Unreal Engine

GLM-5.2 leads open weights models on Artificial Analysis Intelligence Index · 4 sources tracked

Artificial Analysis updates Intelligence Index for AI model evaluation · 2 sources tracked

DeepSeek V4 Pro tops speed and latency benchmarks on Together AI

AI Model Race Visualized: Trends from 2022-2026

Anthropic suspends Fable/Mythos models citing US gov directive

Together AI benchmarks Blackwell hardware for agent infrastructure

NVIDIA Blackwell Systems Lead New Agentic AI Benchmarks

MiniMax M3 launches with 512K context on Fireworks AI