Pulse

last 48h

[8/8] 89 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

FRONTIER RELEASE · The Decoder · 2d · [15 sources] · MASTOBLOG

Thinking Machines Lab ships its first model and argues interactivity is what OpenAI gets wrong about voice

Thinking Machines Lab, founded by former OpenAI CTO Mira Murati, has unveiled its first AI model, focusing on "interaction models" designed for real-time collaboration across voice, video, and text. Unlike current AI that processes input sequentially, TML's model operates in 200-millisecond chunks, allowing it to listen and respond simultaneously, mimicking natural human conversation. This "full duplex" approach aims to surpass competitors like OpenAI's GPT Realtime 2 and Google's Gemini Live in conversational quality, though it is currently a research preview with a limited release planned. AI

IMPACT Sets a new standard for real-time conversational AI, potentially shifting focus from agentic capabilities to natural human-AI interaction.
FRONTIER RELEASE · 量子位 (QbitAI) 中文(ZH) · 5d · [3 sources] · MASTO

Baidu Releases Wenxin 5.1: Search Capabilities Top Domestic, Pre-training Costs Only 6% of Industry Average

Baidu has released its new large language model, Wenxin 5.1, which significantly enhances search, knowledge, and AI agent capabilities. The model achieves leading domestic search performance and surpasses DeepSeek-V4-Pro in AI agent functionality, while its creative writing and reasoning abilities are comparable to top-tier models. Notably, Wenxin 5.1 was trained using a novel multi-dimensional elastic pre-training technique, reducing training costs to approximately 6% of industry standards. AI

IMPACT Sets new SOTA for domestic search and agent capabilities, while drastically reducing training costs.
FRONTIER RELEASE · OpenAI News · 1w · [45 sources] · HNMASTOBLOG

How OpenAI delivers low-latency voice AI at scale

OpenAI has released three new real-time voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. These models offer enhanced reasoning capabilities, live speech translation for over 70 languages, and low-latency transcription. GPT-Realtime-2, in particular, is described as having "GPT-5-class reasoning" and features a significantly expanded context window of 128K tokens, alongside improved handling of interruptions and tool usage. AI

IMPACT Enhances real-time voice agent capabilities with improved reasoning, translation, and transcription, potentially accelerating adoption of voice-first interfaces.
FRONTIER RELEASE · Smol AINews · 2w · [15 sources] · MASTOBLOG

not much happened today

OpenAI has released GPT-5.5, which offers improvements in factuality, intelligence, and image understanding, and is now the default model for ChatGPT and its API. This release also enhances personalization, allowing ChatGPT to utilize user memories, past chats, and connected files. Additionally, OpenAI has introduced an Agents SDK for TypeScript and updated its Codex model to function as a general-purpose computer work agent, expanding its capabilities beyond coding. AI

IMPACT GPT-5.5's release and enhanced personalization features are likely to accelerate user adoption of AI agents for a wider range of tasks beyond coding.
FRONTIER RELEASE · Simon Willison · 2w · [11 sources] · MASTOBLOG

A pelican for GPT-5.5 via the semi-official Codex backdoor API

OpenAI has released GPT-5.5, available in Codex and rolling out to paid ChatGPT subscribers, though its API access is pending further safety reviews. The new model is described as fast and capable, with early users noting its ability to accurately build requested items. Meanwhile, Simon Willison's LLM library has been updated to version 0.32a0, introducing a more flexible message-based input system and streaming parts for responses to better handle diverse model capabilities. Additionally, issues affecting Claude Code's performance have been identified as harness problems rather than model flaws, with a specific bug causing forgetfulness and repetition. AI

IMPACT GPT-5.5's release and API delay signals continued frontier model development and cautious rollout strategies.
FRONTIER RELEASE · 36氪 (36Kr) 中文(ZH) · 2w · [50 sources] · MASTOX

WildFire Energy shareholders seek to sell the US shale oil operator for over $4 billion

OpenAI has launched its new GPT-5.5 model, reporting rapid API revenue growth and increased demand for its coding tools. The release is accompanied by a peculiar system prompt directive for Codex to "never talk about goblins." Meanwhile, OpenAI President Greg Brockman testified in the ongoing trial against Elon Musk, disclosing his significant equity stake in the company and defending his financial interests and contributions. AI

IMPACT Sets a new benchmark for model efficiency and coding capabilities, potentially intensifying competition with Anthropic.
FRONTIER RELEASE · The Guardian — AI · 1mo · [25 sources] · MASTOBLOG

Anthropic investigates report of rogue access to hack-enabling Mythos AI

Anthropic has announced Claude Mythos Preview, an AI model capable of autonomously finding and weaponizing software vulnerabilities, raising significant cybersecurity concerns. Due to its potential for misuse, the model is not publicly released but is instead being provided to a select group of companies and partners through initiatives like Project Glasswing to help identify and patch flaws. This development has prompted discussions among international financial officials and government ministers about the escalating risks posed by advanced AI in cyber warfare and the need for proactive security measures. AI

IMPACT This model's ability to autonomously find and exploit vulnerabilities could significantly accelerate cyber-attacks, necessitating rapid adaptation of defense strategies.
FRONTIER RELEASE · Practical AI · 68mo · [12 sources] · MASTOBLOG

Cracking the code of failed AI pilots

Anthropic has withheld its new Claude Mythos model from public release due to its advanced capabilities in finding and exploiting software vulnerabilities. The company is instead providing access to select cybersecurity firms through Project Glasswing to help patch critical software before the model's capabilities become more widely available. This decision highlights a shift from previous AI releases, where caution stemmed from unknown risks, to a current scenario where known, potent risks necessitate controlled access. AI

IMPACT This controlled release strategy for a highly capable model could set a precedent for managing advanced AI risks, potentially influencing future AI development and deployment.