PulseAugur / Brief
EN
LIVE 00:01:05

Brief

last 24h
[3/3] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Which LLM is the best stock picker? I built a benchmark to find out.

    A new benchmark, dubbed 1rok, has been launched to evaluate the stock-picking capabilities of frontier large language models. The benchmark assigns each participating LLM a virtual portfolio of $100,000 and tasks them with selecting stocks weekly, with performance tracked against market outcomes. This initiative aims to provide a more practical, downstream evaluation of LLMs beyond traditional coding and reasoning benchmarks, focusing on decision-making under uncertainty. AI

    Which LLM is the best stock picker? I built a benchmark to find out.

    IMPACT Provides a novel benchmark for evaluating LLM decision-making under uncertainty, moving beyond traditional coding and reasoning tasks.

  2. xAI retired 8 Grok models on May 15 — the slugs still resolve, so your bill and output quality changed silently

    xAI silently retired eight Grok model slugs on May 15, 2026, without requiring code changes from users. This change redirects requests to different, more expensive models and alters reasoning capabilities without explicit error signals. The silent nature of this deprecation means that cost attribution dashboards may become inaccurate, and applications relying on specific model behaviors could experience degraded performance or unexpected cost increases. AI

    xAI retired 8 Grok models on May 15 — the slugs still resolve, so your bill and output quality changed silently

    IMPACT Developers face silent cost increases and potential performance degradation due to opaque model deprecations, necessitating robust monitoring and configuration management.

  3. Connect Grok to Hermes Agent

    xAI has integrated its Grok AI model into Nous Research's open-source Hermes Agent. This allows users to leverage Grok 4.3, its text-to-speech capabilities, and image generation features directly within the self-improving Hermes Agent. The integration aims to enhance information gathering and agent functionality by combining Grok's advanced reasoning with Hermes' persistent memory and learning capabilities. AI

    Connect Grok to Hermes Agent

    IMPACT Enhances agent capabilities by integrating advanced reasoning and generative features, potentially improving information gathering and task automation.