Brief

last 24h

[3/3] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · dev.to — LLM tag English(EN) · 5d

Which LLM is the best stock picker? I built a benchmark to find out.

A new benchmark, dubbed 1rok, has been launched to evaluate the stock-picking capabilities of frontier large language models. The benchmark assigns each participating LLM a virtual portfolio of $100,000 and tasks them with selecting stocks weekly, with performance tracked against market outcomes. This initiative aims to provide a more practical, downstream evaluation of LLMs beyond traditional coding and reasoning benchmarks, focusing on decision-making under uncertainty. AI

IMPACT Provides a novel benchmark for evaluating LLM decision-making under uncertainty, moving beyond traditional coding and reasoning tasks.
- OpenAI
- Google
- xAI
- GPT-5.5
- Gemini 3.1 Pro Preview
- Kimi K2.6
- GLM-5.1
- DeepSeek V4 Pro
- Moonshot
- Grok 4.3
- MiniMax M2.7
- 1rok
TOOL · dev.to — LLM tag English(EN) · 1w · [2 sources]

xAI retired 8 Grok models on May 15 — the slugs still resolve, so your bill and output quality changed silently

xAI silently retired eight Grok model slugs on May 15, 2026, without requiring code changes from users. This change redirects requests to different, more expensive models and alters reasoning capabilities without explicit error signals. The silent nature of this deprecation means that cost attribution dashboards may become inaccurate, and applications relying on specific model behaviors could experience degraded performance or unexpected cost increases. AI

IMPACT Developers face silent cost increases and potential performance degradation due to opaque model deprecations, necessitating robust monitoring and configuration management.
TOOL · xAI news English(EN) · 1w · [7 sources]

Connect Grok to Hermes Agent

xAI has integrated its Grok AI model into Nous Research's open-source Hermes Agent. This allows users to leverage Grok 4.3, its text-to-speech capabilities, and image generation features directly within the self-improving Hermes Agent. The integration aims to enhance information gathering and agent functionality by combining Grok's advanced reasoning with Hermes' persistent memory and learning capabilities. AI

IMPACT Enhances agent capabilities by integrating advanced reasoning and generative features, potentially improving information gathering and task automation.
- xAI
- Grok
- Nous Research
- Hermes Agent
- Grok 4.3

Brief

Which LLM is the best stock picker? I built a benchmark to find out.

xAI retired 8 Grok models on May 15 — the slugs still resolve, so your bill and output quality changed silently

Connect Grok to Hermes Agent