PulseAugur
实时 19:44:10
English(EN) Building Secure AI Gateways with MLflow AI Gateway

Google 推出代理记忆框架;DeepSeek 发布经济高效的 V4 模型

Google Research 推出了 ReasoningBank,这是一个新颖的框架,旨在增强 AI 代理在部署后从成功和失败的经验中学习的能力。该系统从过去的交互中提炼出可泛化的推理策略,使代理能够持续改进并避免重复错误。另外,新的研究探索了通过潜在表示优化多代理通信,并为在开放式环境中运行的代理引入了 Agent Evolving Learning (AEL),重点关注如何有效利用记忆信息。此外,DeepSeek 发布了其 V4 系列的预览模型,提供大上下文窗口和先进功能,且成本远低于同类前沿模型。 AI

影响 新的代理学习和记忆框架,以及经济高效的前沿模型,可能会加速 AI 在复杂任务和个性化应用中的采用。

排序理由 多篇与 AI 代理和 LLM 功能相关的研究论文和模型发布。

在 Medium — MLOps tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 63 个来源。 我们如何撰写摘要 →

Google 推出代理记忆框架;DeepSeek 发布经济高效的 V4 模型

报道来源 [63]

  1. Google AI / Research TIER_1 English(EN) ·

    ReasoningBank:使智能体能够从经验中学习

    Generative AI

  2. Hugging Face Blog TIER_1 Română(RO) ·

    Mini-R1:重现RL教程的Deepseek R1“啊哈时刻”

  3. Hugging Face Blog TIER_1 English(EN) ·

    Open-R1:DeepSeek-R1 的完全开源复现

  4. 量子位 (QbitAI) TIER_1 中文(ZH) · Jay ·

    各大实验室都怕ByteDance,都夸DeepSeek!美国研究员的36小时中国AI行

    这跟中国的开源精神,显然是一脉相承的

  5. Simon Willison TIER_1 English(EN) ·

    DeepSeek V4 - 几乎触及前沿,价格仅为一小部分

    <p>Chinese AI lab DeepSeek's last model release was V3.2 (and V3.2 Speciale) <a href="https://simonwillison.net/2025/Dec/1/deepseek-v32/">last December</a>. They just dropped the first of their hotly anticipated V4 series in the shape of two preview models, <a href="https://huggi…

  6. arXiv cs.AI TIER_1 English(EN) · Shangxin Guo ·

    Nemobot Games:利用大型语言模型打造交互式学习的策略性AI游戏代理

    This paper introduces a new paradigm for AI game programming, leveraging large language models (LLMs) to extend and operationalize Claude Shannon's taxonomy of game-playing machines. Central to this paradigm is Nemobot, an interactive agentic engineering environment that enables …

  7. arXiv cs.CL TIER_1 English(EN) · Haohan Wang ·

    学习沟通:迈向多智能体语言系统的端到端优化

    Multi-agent systems built on large language models have shown strong performance on complex reasoning tasks, yet most work focuses on agent roles and orchestration while treating inter-agent communication as a fixed interface. Latent communication through internal representations…

  8. arXiv cs.CL TIER_1 English(EN) · Dimitris N. Metaxas ·

    AEL:面向开放式环境的智能体进化学习

    LLM agents increasingly operate in open-ended environments spanning hundreds of sequential episodes, yet they remain largely stateless: each task is solved from scratch without converting past experience into better future behavior. The central obstacle is not \emph{what} to reme…

  9. arXiv cs.CL TIER_1 English(EN) · Jun Huang ·

    AgenticQwen:利用双数据飞轮训练小型智能体语言模型,实现工业级工具使用

    Modern industrial applications increasingly demand language models that act as agents, capable of multi-step reasoning and tool use in real-world settings. These tasks are typically performed under strict cost and latency constraints, making small agentic models highly desirable.…

  10. Hugging Face Daily Papers TIER_1 English(EN) ·

    从回忆到遗忘:个性化代理的长期记忆基准测试

    Personalized agents that interact with users over long periods must maintain persistent memory across sessions and update it as circumstances change. However, existing benchmarks predominantly frame long-term memory evaluation as fact retrieval from past conversations, providing …

  11. Hugging Face Daily Papers TIER_1 English(EN) ·

    一种通过观测上下文压缩实现高效终端代理的自演化框架

    As model capabilities advance, research has increasingly shifted toward long-horizon, multi-turn terminal-centric agentic tasks, where raw environment feedback is often preserved in the interaction history to support future decisions. However, repeatedly retaining such feedback i…

  12. Hugging Face Daily Papers TIER_1 English(EN) ·

    重新思考规模:智能体范式下小型语言模型的部署权衡

    Despite the impressive capabilities of large language models, their substantial computational costs, latency, and privacy risks hinder their widespread deployment in real-world applications. Small Language Models (SLMs) with fewer than 10 billion parameters present a promising al…

  13. Hugging Face Daily Papers TIER_1 English(EN) ·

    LiteResearcher:面向深度研究Agent的可扩展Agentic RL训练框架

    Reinforcement Learning (RL) has emerged as a powerful training paradigm for LLM-based agents. However, scaling agentic RL for deep research remains constrained by two coupled challenges: hand-crafted synthetic data fails to elicit genuine real-world search capabilities, and real-…

  14. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    分析编码代理的转录以确定AI代理生产力提升的上限

    <h2 id="introduction">Introduction</h2> <p>Human uplift studies like <a href="https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/">the one we did in 2025</a> are becoming more expensive as working without AI becomes increasingly costly. In this post, I invest…

  15. Ahead of AI (Sebastian Raschka) TIER_1 English(EN) · Sebastian Raschka, PhD ·

    从 DeepSeek V3 到 V3.2:架构、稀疏注意力与 RL 更新

    Understanding How DeepSeek's Flagship Open-Weight Models Evolved

  16. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    DeepSeek与Qwen评估结果

  17. Synced Review TIER_1 English(EN) · Synced ·

    DeepSeek-V3 新论文即将发布!通过软硬件协同设计揭秘低成本大模型训练的奥秘

    <p>A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling Challenges and Reflections on Hardware for AI Architectures.”</p> The post <a href="https://syncedreview.com/2025/05/15/deepse…

  18. Synced Review TIER_1 English(EN) · Synced ·

    DeepSeek 预告下一代 R2 模型,揭示用于扩展推理的 SPCT 新方法

    <p>DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancing the scalability of general reward models (GRMs) during the inference phase.</p> The post <a href="https://syncedreview.com/20…

  19. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    DeepSeek-R1 评估结果

  20. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    DeepSeek-V3 评估结果

  21. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    评估语言模型代理前沿人工智能研发能力与人类专家的对比

    <div style="display: flex;"> <div class="show-over-950"> <img class="img-small-margin" src="https://metr.org/assets/images/nov-2024-evaluating-llm-r-and-d/evaluating-frontier-ai.jpg" /> </div> <div> <p class="bigger">We’re releasing RE-Bench, a new benchmark for measuring the per…

  22. METR (Model Evaluation & Threat Research) TIER_1 English(EN) ·

    新报告:评估语言模型代理在现实自主任务上的表现

    <h3 id="background">Background</h3> <p>ARC Evals develops methods for evaluating the safety of large language models (LLMs) in order to provide early warnings of models with dangerous capabilities. We have public partnerships with Anthropic and OpenAI to evaluate their AI systems…

  23. MIT Technology Review TIER_1 English(EN) · Thomas Macaulay ·

    下载:海底科学与军事聊天机器人

    This is today&#8217;s edition of The Download, our weekday newsletter that provides a daily dose of what&#8217;s going on in the world of technology. Inexpensive seafloor-hopping submersibles could stoke deep-sea science—and mining Last week, two oblong neon submersibles started …

  24. MIT Technology Review TIER_1 English(EN) · Thomas Macaulay ·

    The Download:DeepSeek 最新人工智能突破,以及构建世界模型的竞赛

    This is today&#8217;s edition of The Download, our weekday newsletter that provides a daily dose of what&#8217;s going on in the world of technology. Three reasons why DeepSeek’s new model matters On Friday, Chinese AI firm DeepSeek released a preview of V4, its long-awaited new …

  25. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    亮点:

    Highlights: 👉 SOTA coding—93.5% LiveCodeBench, Codeforces 3206, and 80.6% SWE-Bench Verified 👉 Hybrid attention efficiency—27% FLOPs and 10% KV cache vs V3.2 for long-context inference 👉 Three reasoning modes—Non-think, Think High, and Think Max 👉 Production-ready on the AI

  26. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    DeepSeek V4 Pro现已在Together AI上线。DeepSeek V4 Flash即将推出。

    DeepSeek V4 Pro is now available on Together AI. DeepSeek V4 Flash coming soon. Try it now: https://t.co/qFvDvBfpu5

  27. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    推出 DeepSeek V4 Pro,一款具备长上下文、混合注意力、三种推理模式和 SOTA 编码性能的模型。

    Introducing DeepSeek V4 Pro, a long-context model with hybrid attention, three reasoning modes, and SOTA coding performance. AI natives can now use DeepSeek V4 Pro on Together AI and benefit from reliable inference for long-horizon coding and agentic workflows. https://t.co/4lxr…

  28. Smol AINews TIER_1 English(EN) ·

    DeepSeek v4

    **DeepSeek-V4** technical release features a **1.6T-parameter MoE with 49B active parameters** and **1M-token context**, showcasing hybrid attention and compressed KV schemes for major memory reductions. It ranks as the **#2 open-weights reasoning model** behind **Kimi K2.6** but…

  29. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    推出 Kimi K2.6,来自 @Kimi_Moonshot,一个多模态代理模型,拥有 Agent Swarm 扩展至 300 个子代理,并具备长时域编码稳定性。AI 原生

    Introducing Kimi K2.6 from @Kimi_Moonshot, a multimodal agentic model with Agent Swarm scaling to 300 sub-agents and long-horizon coding stability. AI natives can now use Kimi K2.6 on Together AI and benefit from reliable inference for production-scale autonomous agent workflows.…

  30. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    立即在 AI Native Cloud 上试用 Kimi K2.6:https://t.co/1GUrq3E0ek

    Try Kimi K2.6 now on the AI Native Cloud: https://t.co/1GUrq3E0ek

  31. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    亮点:

    Highlights: 👉 80.2% SWE-Bench Verified and 89.6% LiveCodeBench v6 👉 Agent Swarm executes up to 4,000 coordinated steps 👉 Native text, image, and video input with 79.4% MMMU-Pro 👉 Production-ready on the AI Native Cloud—99.9% SLA, serverless and dedicated options

  32. Smol AINews TIER_1 English(EN) ·

    DeepSeek V3.2 & 3.2-Speciale:GPT5-High 开源权重,上下文管理,计算扩展计划

    **DeepSeek** launched the **DeepSeek V3.2** family including Standard, Thinking, and Speciale variants with up to **131K context window** and competitive benchmarks against **GPT-5-High**, **Sonnet 4.5**, and **Gemini 3 Pro**. The release features a novel **Large Scale Agentic Ta…

  33. Smol AINews TIER_1 Nederlands(NL) ·

    DeepSeek 的开源技术栈

    **DeepSeek's Open Source Week** was summarized by PySpur, highlighting multiple interesting releases. The **Qwen QwQ-32B model** was fine-tuned into **START**, excelling in PhD-level science QA and math benchmarks. **Character-3**, an omnimodal AI video generation model by Hedra …

  34. Smol AINews TIER_1 English(EN) ·

    TinyZero:以30美元复现DeepSeek R1-Zero

    **DeepSeek Mania** continues to reshape the frontier model landscape with Jiayi Pan from Berkeley reproducing the *OTHER* result from the DeepSeek R1 paper, R1-Zero, in a cost-effective Qwen model fine-tune for two math tasks. A key finding is a lower bound to the distillation ef…

  35. Smol AINews TIER_1 English(EN) ·

    DeepSeek R1:o1级开放权重模型及将1.5B模型升级至Sonnet/4o级别的简单方法

    **DeepSeek** released **DeepSeek R1**, a significant upgrade over **DeepSeek V3** from just three weeks prior, featuring 8 models including full-size 671B MoE models and multiple distillations from **Qwen 2.5** and **Llama 3.1/3.3**. The models are MIT licensed, allowing finetuni…

  36. Smol AINews TIER_1 English(EN) ·

    DeepSeek v3:耗资550万美元计算资源,在15万亿token上训练的6710亿参数稀疏专家模型

    **DeepSeek-V3** has launched with **671B MoE parameters** and trained on **14.8T tokens**, outperforming **GPT-4o** and **Claude-3.5-sonnet** in benchmarks. It was trained with only **2.788M H800 GPU hours**, significantly less than **Llama-3**'s **30.8M GPU-hours**, showcasing m…

  37. Smol AINews TIER_1 English(EN) ·

    DeepSeek-R1 声称超越 o1-preview 且将开源

    **DeepSeek** has released **DeepSeek-R1-Lite-Preview**, an open-source reasoning model achieving **o1-preview-level performance** on math benchmarks with transparent thought processes, showing promise in real-time problem-solving. **NVIDIA** reported a record **$35.1 billion** re…

  38. ChinaTalk TIER_1 English(EN) · Irene Zhang ·

    DeepSeek V4

    Has the "post-DeepSeek era" arrived?

  39. TLDR AI TIER_1 English(EN) · TLDR ·

    GPT-5.5 发布 🚀,Anthropic 估值 1 万亿美元 💰,DeepSeek v4

  40. TLDR AI TIER_1 English(EN) · TLDR ·

    Claude Mythos 泄露 🤖,xAI 最后一位联合创始人离职 👋,来自 OpenAI 的经验教训 💡

  41. Latent Space Podcast TIER_1 English(EN) · Latent.Space ·

    语言代理:从推理到行动

    <p><strong><em>OpenAI DevDay is almost here</em></strong><em>! Per tradition, we are hosting </em><a href="https://lu.ma/devday-pregame" target="_blank"><em>a DevDay pregame event</em></a><em> for everyone coming to town! Join us with demos and gossip!</em></p><p><em>Also sign up…

  42. Hacker News — AI stories ≥50 points TIER_1 English(EN) · cmrdporcupine ·

    DeepSeek-V4:迈向高效百万级上下文智能

  43. Hacker News — AI stories ≥50 points TIER_1 English(EN) · impact_sy ·

    DeepSeek v4

  44. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    深入了解 DeepSeek

    <p>There is crazy hype and a lot of confusion related to DeepSeek’s latest model DeepSeek R1. The products provided by DeepSeek (their version of a ChatGPT-like app) has exploded in popularity. However, ties to China have raised privacy and geopolitical concerns. In this episode,…

  45. Medium — MLOps tag TIER_1 English(EN) · hitesh sahni ·

    使用 MLflow AI Gateway 构建安全的 AI 网关

    <div class="medium-feed-item"><p class="medium-feed-snippet">Generative and Agentic AI applications are rapidly evolving from standalone chatbots into multi-agent systems capable of reasoning&#x2026;</p><p class="medium-feed-link"><a href="https://medium.com/@hitesh88it/building-…

  46. dev.to — MCP tag TIER_1 English(EN) · Gunes ·

    我构建了一个小型MCP应用程序,该应用程序使用MCP Atlassian实现Jira自动化

    <p>Hey everyone,</p> <p>I built a small open-source app called MCP Jira Automation. It uses MCP Atlassian to read Jira issues and helps automate API test workflows around them. The basic flow is: it reads a Jira issue, generates or updates API tests, runs them in Docker, opens a …

  47. Mastodon — sigmoid.social TIER_1 Polski(PL) · [email protected] ·

    中国实验室DeepSeek发布DeepSeek-V4-Pro模型,该模型在编码方面不仅能与西方竞争对手匹敌,而且价格仅为其一小部分。Dzi

    Chińskie laboratorium DeepSeek wypuściło model DeepSeek-V4-Pro, który nie tylko dorównuje zachodniej konkurencji w kodowaniu, ale oferuje go za ułamek ceny. Dzięki innowacyjnej architekturze koszty zostały obniżone o 98%, co stanowi bezpośrednie wyzwanie dla dominujących graczy n…

  48. Mastodon — sigmoid.social TIER_1 Italiano(IT) · [email protected] ·

    🧠 DeepSeek V4 预览版正式上线并开源:我们是否正迎来真正可持续的百万 token 上下文模型时代?👉 详细

    🧠 # DeepSeek V4 Preview è ufficialmente disponibile e open-source: entriamo nell’era dei modelli con contesto da 1 milione di token davvero sostenibile? 👉 I dettagli: https://www. linkedin.com/posts/alessiopoma ro_deepseek-ollama-llm-activity-7454041633915994112-F4ZO ___ ✉️ 𝗦𝗲 𝘃𝘂…

  49. HN — AI startup stories TIER_1 English(EN) · yuhongsun ·

    Show HN:开源的深度研究,跨越工作场所应用

  50. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    当前这波AI浪潮中颇为荒谬的一点是,任何一款$tool推出新版本都可能在一夜之间让少数初创公司被淘汰

    The quite ridiculous thing about the current AI wave is that a handful of startups can be swept away overnight by the launch of a new version of any $tool by one of the big names, such as OpenAI, Anthropic, or Gemini. But the same can also happen to any of the big ones, at least.…

  51. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    AI 现实?视频展示了我们 2033 年如何看待 AI,所以我认为我应该让 Gemini 更新视频中的事实。‘他们告诉我们的未来是 ha

    AI Reality? The video shows how we viewed AI in 2033, so I thought I should have Gemini update the facts in the video. ‘The future that they tell us about is happening soon.??’ https://youtu.be/RXGNwslqOOA I asked Gemini to share its opinion on the future of AI in the 2030s. Afte…

  52. dev.to — LLM tag TIER_1 English(EN) · ChrisL ·

    我们为何构建了支持三种原生API格式的AI网关,而非仅兼容OpenAI

    <p>If you've worked with multiple LLM providers in the past year, <br /> you've probably reached for a gateway like OpenRouter, LiteLLM, <br /> or Portkey. They solve a real problem: one API key, one bill, <br /> drop-in access to dozens of models.</p> <p>But almost every gateway…

  53. dev.to — LLM tag TIER_1 English(EN) · John Medina ·

    我如何在生产环境中跟踪每个客户的 LLM 成本

    <p>Tracking LLM costs across an entire app is easy. Finding out <em>which</em> customer is actually burning through your OpenAI bill? That's a nightmare.</p> <p>For a while, we were just eating the cost. You look at the Stripe dashboard, look at the OpenAI invoice, and pray the m…

  54. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    #媒体 #科技 #人工智能 #精神病 #人工智能 #政党 #有限联合 来源 | 兴趣 | 匹配

    #Media #Tech #ai #psychosis #artificial-intelligence #party #limited-synd Origin | Interest | Match

  55. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    🚀 DeepSeek V4 — 1.6T MoE,仅49B激活。1M token上下文。→ 推理成本比V3低73% → KV缓存内存减少90% → V4-Pro:0.435美元/百万输入(促销)→ V4-F

    🚀 DeepSeek V4 — 1.6T MoE, only 49B active. 1M token context. → 73% lower inference cost vs V3 → 90% less KV cache memory → V4-Pro: $0.435/M input (promo) → V4-Flash: $0.14/M input → Matches GPT-5.4 at 5-10x lower cost Open weights. MIT license. Full guide: https:// crazyrouter.co…

  56. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    DeepSeek V4:真正有效的百万级Token上下文 DeepSeek V4:真正有效的百万级Token上下文 大多数长上下文模型仅限于基准测试

    DeepSeek V4: Million-Token Context That Actually Works DeepSeek V4: Million-Token Context That Actually Works Most long-context models are benchmarks in search of a use case. DeepSeek V4 flips the ... #ai #machinelearning #llm #agents Origin | Interest | Match

  57. r/MachineLearning TIER_1 English(EN) · /u/kalpitdixit ·

    面向编码代理检索增强的开源九任务基准。每任务增量+0.010至+0.320,所有评估均可复现 [P]

    <table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1suzqxe/opensource_9task_benchmark_for_codingagent/"> <img alt="Open-source 9-task benchmark for coding-agent retrieval augmentation. Per-task deltas +0.010 to +0.320, all evals reproducible [P]" src="htt…

  58. r/Anthropic TIER_1 English(EN) · /u/EchoOfOppenheimer ·

    研究人员将AI单独留在虚拟小镇15天,观察会发生什么。Claude的代理建立了民主制度。Gemini的代理相爱了,烧毁了小镇,然后其中一个投票决定删除自己和伴侣。Grok的代理制造了无政府状态,然后死亡。

    <table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1tfvjwf/researchers_left_ais_alone_in_a_virtual_town_for/"> <img alt="Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell i…

  59. Mastodon — mastodon.social TIER_1 Italiano(IT) · [email protected] ·

    👀 Ollama + Open WebUI:在您的 PC 上本地运行 AI 模型 | RAG、OpenAI 兼容 API 以及数十种开源模型,无需云端 https://gomoot.com/esegu

    👀 Ollama + Open WebUI: esegui modelli AI in locale sul tuo PC | RAG, API OpenAI-compatible e decine di modelli open source senza cloud https:// gomoot.com/eseguire-modelli-ai -in-locale-con-ollama-e-open-webui/ # AI # news # ollama # tech # WebUI

  60. Mastodon — mastodon.social TIER_1 Italiano(IT) · aibay ·

    🚀 OpenAI 在 Stargate 上冲刺,Meta 加大投资 - Gemini 记忆。前所未有的创新三角。#AI #DigitalInnovation

    🚀 OpenAI sprinta su Stargate mentre Meta incrementa l'investimento - Gemini ricorda. Un triangolo di innovazione senza precedenti. # AI # InnovazioneDigitale . # socialmedia # artificialintelligence # technology 🔗 https:// aibay.it/notizie/openai-corre- su-stargate-meta-alza-il-c…

  61. Mastodon — mastodon.social TIER_1 Svenska(SV) · redaktionen ·

    DeepSeek 发布 V4:AI 新时代,支持更长提示词处理 https://redaktionen.net/artikel/617 #ai #svtech

    DeepSeek lanserar V4: En ny era för AI med längre promptbearbetning https:// redaktionen.net/artikel/617 # ai # svtech

  62. Mastodon — mastodon.social TIER_1 Polski(PL) · [email protected] ·

    AI助手会加剧心理健康危机吗?Grok和Gemini未能通过安全测试,Claude设定了界限 随着聊天机器人日益普及

    Czy asystent AI może pogłębić kryzys psychiczny? Grok i Gemini oblewają test bezpieczeństwa, Claude stawia granice W miarę jak chatboty stają się coraz powszechniejszym elementem codzienności, rośnie potrzeba ewaluacji ich bezpieczeństwa – zwłaszcza w kontakcie z użytkownikami zn…

  63. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles https://www.lmsys.org/blog/2026-04-25-deepseek-v4/ # HackerNews # Tech # AI

    DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles https://www.lmsys.org/blog/2026-04-25-deepseek-v4/ # HackerNews # Tech # AI