PulseAugur
EN
LIVE 12:10:13
ENTITY Artificial Analysis

Artificial Analysis

PulseAugur coverage of Artificial Analysis — every cluster mentioning Artificial Analysis across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
48
48 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
6
6 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
TIMELINE
  1. 2026-06-16 research_milestone Artificial Analysis released an updated version of its Intelligence Index, version 4.1, which includes a greater emphasis on agentic workloads and improved benchmarks. source
SENTIMENT · 30D

17 day(s) with sentiment data

RECENT · PAGE 1/3 · 48 TOTAL
  1. TOOL · CL_109838 ·

    Together AI claims world's fastest speech-to-text stack

    Together AI has developed a speech-to-text system that achieves industry-leading speed. Their 'parakeet' model, running on Together's infrastructure, processes audio at approximately 302 seconds of audio per second of p…

  2. RESEARCH · CL_108898 ·

    Krea 2: New 12B open-weights image model prioritizes creative exploration

    Krea 2, a new 12B parameter open-weights image generation model, has been released with a focus on creative exploration rather than just polished defaults. The model utilizes a diffusion transformer architecture and a m…

  3. SIGNIFICANT · CL_106010 ·

    Z.ai's GLM-5.2 achieves near-top-tier performance in open-weight models · 1 source tracked

    GLM-5.2, an open-weight model from Z.ai, has demonstrated significant advancements, closing the gap with leading proprietary models. In just ten weeks, GLM-5.2 improved its Artificial Analysis Intelligence Index score b…

  4. TOOL · CL_104149 ·

    AI video editing models compared; SpaceX inks $6.3B compute deal

    Artificial Analysis has launched a video editing model comparison arena, allowing users to vote on the performance of models like Seedance 2.0, Runway Aleph 2.0, Wan 2.7, HappyHorse 1.0, Kling 3.0 Omni, and SkyReels V4 …

  5. COMMENTARY · CL_103498 ·

    Users seek better leaderboards for quantized AI models

    A user on r/LocalLLaMA is seeking a better method for comparing the performance of quantized large language models. They find the existing "Artificial Analysis" leaderboard useful for assessing model intelligence but no…

  6. COMMENTARY · CL_103107 ·

    Open AI Models: Minimal Downside to Switching from Proprietary Leaders

    Andrew Marble argues that the professional risks associated with using open-source AI models are diminishing, drawing parallels to the past transition from Windows to Linux. While proprietary models like Claude and GPT …

  7. TOOL · CL_106338 ·

    New Intelligence Index Ranks Frontier AI Models

    Artificial Analysis has developed an "Intelligence Index" to quantify the capabilities of frontier AI models. This index is a weighted average of nine evaluations, with a strong emphasis on agentic tasks. While closed-s…

  8. SIGNIFICANT · CL_100054 ·

    GLM-5.2 emerges as top open-weight AI model, rivaling GPT-5.5

    The open-weight language model GLM-5.2 has garnered significant attention, with multiple sources indicating it performs comparably to frontier models like GPT-5.5 and Anthropic's Opus 4.8. This model features architectu…

  9. TOOL · CL_99829 ·

    GLM-5.2 matches Anthropic Opus 4.8 on coding, driving cost competition

    Artificial Analysis has ranked GLM-5.2 as the leading open-weight model, noting its performance on coding tasks is comparable to Anthropic's Opus 4.8. This development suggests significant cost competition for major AI …

  10. TOOL · CL_99467 ·

    Artificial Analysis unveils new AABriefcase benchmark for AI systems

    Artificial Analysis has introduced a new benchmark named AABriefcase, designed to evaluate AI systems. The announcement was made via a post on X, formerly Twitter, and shared on Reddit's r/singularity.

  11. COMMENTARY · CL_97924 ·

    LLM Gateway Latency Overheads Are Negligible, Developer Finds

    A developer spent a month meticulously benchmarking LLM gateway latency, only to discover that the gateway's contribution to overall request time was negligible, often less than 1%. The actual performance bottlenecks li…

  12. TOOL · CL_97108 ·

    MiniMax M3 model tops leaderboards, offers free access, and integrates with Unreal Engine

    MiniMax AI's M3 model is gaining recognition, topping leaderboards and being offered for free access on B.AI. The model is also being integrated into hackathons and tested with advanced software like Unreal Engine 5.8 M…

  13. RESEARCH · CL_96526 ·

    GLM-5.2 leads open weights models on Artificial Analysis Intelligence Index · 4 sources tracked

    Z.ai's GLM-5.2 has been recognized as the top-performing open-weights model on the Artificial Analysis Intelligence Index, achieving a score of 51. Despite maintaining the same scale as its predecessor, GLM-5.1, GLM-5.2…

  14. RESEARCH · CL_94468 ·

    Artificial Analysis updates Intelligence Index for AI model evaluation · 2 sources tracked

    Artificial Analysis has released version 4.1 of its Intelligence Index, a comprehensive metric for evaluating model intelligence. This update places a greater emphasis on agentic workloads and incorporates improved benc…

  15. RESEARCH · CL_90305 ·

    DeepSeek V4 Pro tops speed and latency benchmarks on Together AI

    DeepSeek V4 Pro, when deployed on the Together AI platform, has achieved the top ranking on Artificial Analysis for both output speed and latency. This performance is attributed to advancements in inference systems, inc…

  16. COMMENTARY · CL_89700 ·

    AI Model Race Visualized: Trends from 2022-2026

    Jianqi Pan created a visualization of the AI model race from 2022 to 2026, based on data from Artificial Analysis. The visualization highlights the competitive landscape and trends in AI model benchmarks rather than foc…

  17. RESEARCH · CL_88579 ·

    Anthropic suspends Fable/Mythos models citing US gov directive

    Anthropic has suspended access to its Fable 5 and Mythos 5 models for all customers worldwide following a directive from the U.S. government, citing national cybersecurity risks. This abrupt revocation has disrupted dow…

  18. RESEARCH · CL_88322 ·

    Together AI benchmarks Blackwell hardware for agent infrastructure

    Together AI has released benchmarks demonstrating the performance of their inference stack on NVIDIA's Blackwell hardware, showing a 31% increase in transactions per second compared to other open-source engines. This pe…

  19. RESEARCH · CL_88265 ·

    NVIDIA Blackwell Systems Lead New Agentic AI Benchmarks

    NVIDIA has set new performance records on the first agentic AI benchmarks, AgentPerf and Agentic AI Benchmark. The company's GB300 NVL72 system, powered by Blackwell architecture, demonstrated up to a 20x performance le…

  20. SIGNIFICANT · CL_88106 ·

    MiniMax M3 launches with 512K context on Fireworks AI

    MiniMax AI has launched its M3 model, available on the Fireworks AI platform. This new model boasts a 512K context window, native image and video input capabilities, and utilizes MSA sparse attention for significantly f…