Brief

last 24h

[50/3574] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · Together AI blog English(EN) · 2mo

Together AI expands fine-tuning service with tool calling, reasoning, and vision support

Together AI has enhanced its fine-tuning service to better support advanced AI workflows. The update includes native support for tool call, reasoning, and vision-language model fine-tuning, addressing common issues like unreliable tool execution and degraded reasoning in complex interactions. These improvements aim to increase iteration speed and accuracy for AI teams building agentic applications, with enhanced throughput and larger dataset handling for models up to 1T parameters. AI

IMPACT Enables more reliable and efficient fine-tuning of AI agents, potentially accelerating the development of complex AI applications.
- Together AI
- Qwen
- Moonshot AI
- XY.AI Labs
- Z.AI
- OpenAI
TOOL · Latent Space Podcast English(EN) · 2mo

Why Anthropic Thinks AI Should Have Its Own Computer — Felix Rieseberg of Claude Cowork & Claude Code Desktop

Anthropic is developing Claude Cowork, a desktop application designed to bring agentic workflows to users who are not comfortable with terminals. This tool emerged from observations that users were leveraging Claude Code for general knowledge work beyond just coding. Claude Cowork utilizes a virtual machine to provide a safe environment for the AI to execute tasks, install tools, and run scripts, offering a balance between safety and autonomy. The product emphasizes local-first agent workflows, skills as reusable instructions, and integration with existing user stacks. AI
TOOL · X — xAI English(EN) · 2mo · [2 sources]

RT LiveKit: Grok's Text to Speech API is now available in LiveKit Inference. Natural, expressive voices with low-latency streaming. Multilingual in 20...

xAI has launched its Grok Text to Speech API, enabling developers to integrate natural and expressive voice capabilities into their applications. The API supports low-latency streaming and is multilingual, offering over 20 languages. It is designed for easy integration, requiring only a single API key with no additional setup needed for telephony or production use. AI

IMPACT Enables developers to easily add natural-sounding, multilingual voice output to applications.
- LiveKit Inference
- Grok
- xAI
- LiveKit
TOOL · HN — claude cli stories English(EN) · 3mo · [2 sources]

Use the Claude Agent SDK with Your Claude Plan

Anthropic is enhancing its Claude Opus model by offering a 1 million token context window by default for its Max, Team, and Enterprise plans. Additionally, starting June 15, 2026, eligible users on Pro, Max, Team, and Enterprise plans will receive a monthly credit for using the Claude Agent SDK. This credit covers usage for the SDK in custom projects, the `claude -p` command, and third-party applications, but does not apply to interactive use or web-based conversations. AI

IMPACT Anthropic's move expands context window capabilities and incentivizes developer adoption of its Agent SDK.
- Agent SDK
- Claude
- Team
- Max
- Pro
- Anthropic
- Claude Developer Platform
- Enterprise
- Claude Opus
- Claude Agent SDK
TOOL · HN — claude cli stories English(EN) · 3mo

Launch HN: Spine Swarm (YC S23) – AI agents that collaborate on a visual canvas

Spine Swarm, a Y Combinator-backed startup, has launched a platform that utilizes over 300 AI agents to conduct research and generate client-ready documents. The system claims to achieve the top ranking on Google DeepMind's DeepSearchQA benchmark, outperforming models like Claude and ChatGPT. Spine's approach involves parallel agent swarms that handle distinct workstreams, passing structured outputs to create deliverables such as reports, presentations, and spreadsheets. AI

IMPACT This product showcases advanced AI agent orchestration, potentially setting new benchmarks for automated research and document generation.
TOOL · OpenAI News English(EN) · 3mo

Rakuten fixes issues twice as fast with Codex

Rakuten has integrated OpenAI's Codex coding agent into its engineering workflows, resulting in significant improvements in software development and incident response. The company has achieved approximately a 50% reduction in mean time to recovery (MTTR) for issues and estimates potential build times for complex projects could be reduced from quarters to weeks. Codex is being used for automated code reviews, vulnerability checks within CI/CD pipelines, and to drive larger, more ambiguous projects towards completion with increased autonomy. AI
TOOL · OpenAI News English(EN) · 3mo

Wayfair boosts catalog accuracy and support speed with OpenAI

Wayfair has integrated OpenAI models into its core operational systems to enhance product catalog accuracy and automate supplier support. This integration has led to the correction of millions of product tags and the automation of thousands of monthly support tickets. By embedding AI into workflows rather than using it as a standalone tool, Wayfair has significantly improved its data quality and accelerated the rate at which new product attributes can be processed. AI
TOOL · OpenAI News English(EN) · 3mo

Codex Security: now in research preview

OpenAI has launched Codex Security, an application security agent now in research preview for its ChatGPT Pro, Enterprise, Business, and Edu customers. This tool leverages OpenAI's frontier models and the Codex agent to identify complex software vulnerabilities by building deep context about a project and its threat model. Codex Security aims to reduce noise and accelerate remediation by providing high-confidence findings and actionable fixes, significantly improving the signal-to-noise ratio compared to traditional security tools. AI
TOOL · Latent Space Podcast English(EN) · 3mo

Cursor's Third Era: Cloud Agents

Cursor has launched cloud agents, a significant advancement in their AI coding assistant. These agents can now interact with a full computer environment, moving beyond simple code reading to execute and test code end-to-end. This new capability aims to dramatically increase developer productivity by enabling parallel and swarmed agent workflows, allowing for more complex tasks to be accomplished. AI
TOOL · OpenAI News English(EN) · 3mo

How Descript engineers multilingual video dubbing at scale

Descript has enhanced its video dubbing capabilities by integrating OpenAI's reasoning models into its localization pipeline. This advancement allows for the automatic translation of large video content libraries while maintaining both the original meaning and the natural pacing of speech. The new system optimizes for semantic fidelity and duration adherence simultaneously, leading to a 15% increase in dubbed video exports and significant improvements in timing accuracy across various languages. AI
TOOL · The Pragmatic Engineer English(EN) · 3mo

Building Claude Code with Boris Cherny

Boris Cherny, Head of Claude Code at Anthropic, discussed the development and use of their internal AI coding tool. He shared that he personally ships 20-30 pull requests daily by utilizing five parallel Claude instances, which efficiently generate code based on detailed plans. Cherny also highlighted that simple tools like glob and grep, when guided by the AI, proved more effective for code retrieval than complex methods like RAG. The conversation also touched upon how AI is shifting the role of engineers rather than diminishing it, emphasizing the growing importance of skills like planning and iterative refinement. AI
TOOL · Hugging Face Blog English(EN) · 3mo

Train AI models with Unsloth and Hugging Face Jobs for FREE

Unsloth has partnered with Hugging Face to offer free training for AI models using their platform. This collaboration aims to make AI model training more accessible and efficient for developers. The integration allows users to leverage Unsloth's optimization techniques directly within the Hugging Face ecosystem. AI
TOOL · Hugging Face Blog English(EN) · 4mo

Custom Kernels for All from Codex and Claude

Hugging Face has introduced a new feature allowing developers to integrate custom CUDA kernels into their AI models, enhancing performance and enabling specialized agent skills. This development, powered by integrations with models like OpenAI's Claude and Google's Gemini, aims to democratize access to high-performance computing for AI development. The platform now supports the creation and deployment of these custom kernels, making advanced optimization techniques more accessible to a wider range of users. AI
TOOL · Hugging Face Blog English(EN) · 4mo

Introducing SyGra Studio

Hugging Face has partnered with ServiceNow to launch SyGra Studio, a new platform designed to help enterprises build and deploy generative AI applications. The studio offers tools for data preparation, model fine-tuning, and application deployment, aiming to streamline the process of integrating AI into business workflows. This collaboration seeks to make advanced AI capabilities more accessible to businesses looking to leverage generative AI for various operational needs. AI
TOOL · Together AI blog English(EN) · 4mo

Rime Arcana V3 Turbo and Rime Arcana V3 now available on Together AI

Together AI has launched two new Rime models, V3 Turbo and V3, designed for natural code-switching in voice agents. V3 Turbo offers English-Spanish switching with a time-to-first-audio of approximately 120ms on dedicated endpoints, maintaining conversational flow and prosody. The V3 model supports switching across 11 languages, providing a unified solution for multilingual customer interactions without the need for separate language-specific models. AI

IMPACT Enables more natural and efficient multilingual voice agent interactions, potentially reducing costs for high-volume deployments.
TOOL · HN — machine learning stories English(EN) · 4mo

Show HN: LemonSlice – Upgrade your voice agents to real-time video

LemonSlice has released Lemon Slice 2, a 20 billion parameter diffusion transformer model capable of generating infinite-length video at 20 frames per second on a single GPU. The company's API allows users to create and interact with photorealistic video avatars in real-time, aiming to become the dominant form factor for conversational AI. While acknowledging the challenge of the uncanny valley, LemonSlice claims their avatars are best-in-class and can also generate stylized cartoons and animals. The real-time generation was achieved through a causal model, sliding window attention, GAN-based distillation, and various inference optimizations. AI

IMPACT Enables real-time, interactive video avatars, potentially shifting conversational AI interfaces and raising concerns about misuse.
TOOL · OpenAI News Italiano(IT) · 4mo

TRUSTBANK uses AI agents to personalize Furusato Nozei gifts

TRUSTBANK has partnered with Recursive to develop Choice AI, a new service that leverages OpenAI's models. This AI-powered platform aims to personalize gift recommendations for users participating in the Furusato Nozei program. Choice AI utilizes conversational agents to simplify the process of discovering suitable gifts. AI
TOOL · OpenAI News English(EN) · 4mo

Introducing Prism

OpenAI has launched Prism, a new AI-powered workspace designed to streamline scientific writing and collaboration. This platform integrates GPT-5.2 directly into the research workflow, enabling scientists to draft, revise, and manage papers within a single, cloud-based environment. Prism aims to accelerate scientific progress by overcoming the fragmentation of current research tools, offering features like contextual AI assistance for equations, literature search, and real-time collaboration. AI
TOOL · OpenAI News English(EN) · 4mo

Unrolling the Codex agent loop

OpenAI has detailed its Codex agent loop, a system designed to orchestrate AI models, tools, and prompts. This technical explanation focuses on how the Codex CLI utilizes the Responses API to manage these components and optimize performance. The system aims to create a more cohesive and efficient AI agent experience. AI
TOOL · Last Week in AI English(EN) · 4mo

Last Week in AI #333 - ChatGPT Ads, Zhipu+Huawei, Drama at Thinking Machines

OpenAI is planning to introduce banner ads within its ChatGPT interface for both free and paid users in the U.S. and other regions. This move comes as the company reportedly spends billions of dollars. The company is also exploring other revenue streams, including a potential investment from Sequoia Capital in competitor Anthropic. AI
TOOL · HN — AI startup stories English(EN) · 4mo

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

Linum-AI, a startup founded by two brothers, has released Linum v2, a text-to-video model with 2 billion parameters. The model is available in 360p and 720p resolutions, capable of generating videos between 2 to 5 seconds long. This release is under the Apache 2.0 license, making it accessible for various applications. AI

IMPACT Provides a new open-source option for text-to-video generation, potentially enabling new creative tools and applications.
TOOL · OpenAI News English(EN) · 4mo

Inside Praktika's conversational approach to language learning

Praktika has launched a new AI-powered language learning application that leverages OpenAI's advanced GPT models, including GPT-4.1 and GPT-5.2. This innovative tool functions as an adaptive tutor, personalizing lessons to individual user needs. The application aims to enhance fluency by tracking learner progress and providing tailored feedback. AI
TOOL · OpenAI News English(EN) · 4mo

How Higgsfield turns simple ideas into cinematic social videos

Higgsfield has launched a generative media platform that uses OpenAI's GPT-4.1 and GPT-5 models to create short-form social videos from minimal input like product links or images. The system employs a "cinematic logic layer" to translate user intent into structured video plans, which are then rendered by Sora 2. This approach aims to automate the creation of viral-quality content, with generated videos showing a 150% increase in share velocity compared to previous baselines. AI
TOOL · Hugging Face Blog English(EN) · 4mo

Open Responses: What you need to know

Hugging Face has introduced "Open Responses," a new feature designed to enhance the safety and transparency of large language models. This initiative allows developers to provide open-ended, detailed responses to safety-related queries, moving beyond simple yes/no answers. The goal is to foster greater trust and understanding in AI systems by offering more nuanced explanations of their behavior and limitations. AI
TOOL · AI Explained English(EN) · 5mo

Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me:

A new AI-generated tool called Claude Cowork has gained significant attention for its potential to automate white-collar tasks. While some claim it could automate all such work, the actual productivity gains and impact on jobs are still under debate. The tool's capabilities are being compared to other models, and its development raises questions about the truth behind AI hype and the potential for job market disruption. AI
TOOL · VentureBeat AI English(EN) · 5mo · [2 sources]

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic has released Cowork, a new AI agent accessible via a macOS desktop application for Claude Max subscribers. This tool allows users to perform non-technical tasks by interacting with files in designated folders on their computer, without needing to write code. The development was inspired by observations of users repurposing Anthropic's coding tool for a wide range of everyday tasks, leading to the creation of a more accessible interface. AI
TOOL · OpenAI News Dansk(DA) · 5mo

Datadog uses Codex for system-level code review

Datadog has integrated OpenAI's Codex coding agent into its system-level code review process to enhance reliability and prevent incidents. Codex analyzes pull requests by considering the entire codebase and dependencies, identifying risks that human reviewers or traditional tools might miss. In testing, Codex flagged potential issues that would have prevented approximately 22% of historical incidents examined, demonstrating its value in complementing human expertise. AI
TOOL · One Useful Thing (Ethan Mollick) English(EN) · 5mo

Claude Code and What Comes Next

Ethan Mollick details his experience using Anthropic's Claude Code, an AI tool powered by Opus 4.5, which autonomously generated and deployed a functional website for a startup idea. The AI independently created hundreds of code files and a deployable website within an hour, demonstrating a significant leap in AI's autonomous capabilities. Mollick highlights that these advanced coding tools, while powerful, are primarily designed for programmers and require a technical understanding to utilize effectively. AI
TOOL · VentureBeat AI English(EN) · 5mo

The creator of Claude Code just revealed his workflow, and developers are losing their minds

Boris Cherny, the creator of Anthropic's Claude Code, has shared his workflow, which involves managing five AI agents simultaneously in his terminal. This approach allows a single developer to achieve the output of a small engineering team by running parallel tasks like testing, refactoring, and documentation. Cherny exclusively uses Anthropic's Opus 4.5 model, finding its superior reasoning and tool-use capabilities make it faster overall despite its slower processing speed. AI
TOOL · OpenAI News English(EN) · 6mo

How We Used Codex to Ship Sora for Android in 28 Days

OpenAI has detailed how its Codex agent was instrumental in developing the Sora Android application in just 28 days. A small team of four engineers utilized an early version of the GPT-5.1-Codex model, consuming approximately 5 billion tokens during the development process. This approach allowed the team to bypass traditional software development bottlenecks, resulting in a high-quality app with a 99.9% crash-free rate that achieved the #1 spot on the Play Store upon launch. AI
TOOL · OpenAI News English(EN) · 6mo

BNY builds “AI for everyone, everywhere” with OpenAI

BNY Mellon has developed an internal AI platform called Eliza, which empowers its employees to build AI agents for various business functions. This initiative, leveraging OpenAI's frontier capabilities, has seen 20,000 employees actively creating agents, leading to a 75% reduction in legal review times for certain tasks. The platform integrates robust governance measures to ensure responsible AI deployment across the organization, aiming to transform how the bank operates. AI
TOOL · Hugging Face Blog Nederlands(NL) · 6mo

New in llama.cpp: Model Management

The llama.cpp project has introduced a new model management feature, allowing users to easily download, manage, and switch between different large language models. This update aims to streamline the process of experimenting with various models on local hardware. The new functionality is integrated directly into the llama.cpp ecosystem, simplifying the workflow for developers and enthusiasts. AI
TOOL · OpenAI News English(EN) · 6mo

How Podium is arming 10,000+ SMBs with AI agents

Podium has launched an enhanced AI agent, named "Jerry," powered by OpenAI's GPT-5.1 model, to assist over 10,000 small and medium-sized businesses (SMBs). This AI agent automates lead capture, appointment scheduling, and customer service, aiming to significantly boost revenue and conversion rates for local businesses. Early results show a 300% year-over-year AI revenue increase for Podium and substantial growth for its clients, with agents influencing billions in revenue by providing rapid, 24/7 customer interaction. AI
- OpenAI
- SMBs
- Jerry
- GPT-5.1
- Podium
- medspas
- auto dealers
TOOL · OpenAI News English(EN) · 6mo

How Scout24 is building the next generation of real-estate search with AI

Scout24, a German real-estate platform, has launched an AI-powered conversational assistant named HeyImmo, built using OpenAI's GPT-5. This assistant aims to guide users through the property search process by asking clarifying questions, summarizing options, and adapting its response format. The company developed a custom evaluation system based on OpenAI's Evals framework and conducted extensive internal testing to ensure quality and reliability before launch. AI
TOOL · OpenAI News English(EN) · 6mo

Commonwealth Bank of Australia builds AI fluency at scale

Commonwealth Bank of Australia is deploying ChatGPT Enterprise to its nearly 50,000 employees, aiming to integrate AI into daily workflows and enhance customer service. This initiative focuses on building AI fluency through comprehensive training and leadership engagement. The bank plans to leverage AI for improved customer experiences, particularly in areas like customer service and fraud detection. AI
TOOL · OpenAI News English(EN) · 6mo

Instacart and OpenAI partner on AI shopping experiences

Instacart has launched a new integration within ChatGPT, allowing users to shop for groceries, build a cart, and complete purchases without leaving the chat interface. This collaboration deepens a prior partnership, leveraging OpenAI's models to connect Instacart's real-time grocery network with ChatGPT's conversational capabilities. The feature aims to streamline the process from meal inspiration to doorstep delivery, marking Instacart as the first app to offer a direct checkout experience within ChatGPT. AI
TOOL · HN — AI startup stories English(EN) · 6mo

Launch HN: Onyx (YC W24) – Open-source chat UI

Onyx, an open-source chat UI, has been launched by the team behind the Danswer enterprise search project. The new platform aims to provide a superior user experience for interacting with various large language models, including proprietary and open-weight options. Onyx integrates features like RAG, web search, and memory, while also offering enterprise-grade security and customization for self-hosting. AI

IMPACT Provides a user-friendly interface for interacting with various LLMs, potentially simplifying adoption for enterprises.
- YC W24
- Danswer
- GPT-4o
- Claude Sonnet 4
- Onyx
- Qwen
- ChatGPT
- Claude
TOOL · Hugging Face Blog English(EN) · 6mo

20x Faster TRL Fine-tuning with RapidFire AI

RapidFire AI has introduced a new fine-tuning method that significantly accelerates the training process for large language models. This technique, integrated with Hugging Face's TRL library, reportedly achieves up to a 20x speed increase. The optimization targets common bottlenecks in fine-tuning, making model adaptation more efficient. AI
TOOL · Replit blog Dansk(DA) · 6mo

Design Mode

Replit has launched a new Design Mode, powered by Google's Gemini 3 model, to enable users to create websites and interactive mockups rapidly. This feature allows for the generation of polished designs in under two minutes using natural language commands, streamlining the process from idea conception to a shareable or deployable product. Design Mode aims to reduce the friction between visual design and development, serving as an alternative to traditional website builders and prototyping tools. AI

IMPACT Accelerates website creation and prototyping by integrating advanced AI models into design workflows.
TOOL · X — DeepSeek English(EN) · 6mo

⚠️ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that cou...

DeepSeek has identified a performance-degrading bug in earlier versions of its DeepSeek-V3.2-Exp inference demo. The issue stems from a mismatch in the RoPE implementation within the indexer module, where earlier versions expected non-interleaved input while MLA RoPE expected interleaved. A fix has been implemented and is available via their GitHub repository. AI

IMPACT Addresses a specific bug in an inference demo, improving stability for users of DeepSeek-V3.2-Exp.
- DeepSeek-V3.2-Exp
- DeepSeek
TOOL · Smol AINews English(EN) · 7mo

minor updates to GPT 5.1 and SIMA 2

Smol AINews reported minor updates to GPT 5.1 and SIMA 2. The newsletter provided a brief overview of these changes without extensive detail. The updates appear to be incremental rather than significant advancements. AI
TOOL · Latent Space Podcast English(EN) · 7mo

⚡ [AIE CODE Preview] Inside Google Labs: Building The Gemini Coding Agent — Jed Borovik, Jules

Google Labs is developing an autonomous coding agent named Jules, designed to assist developers with software engineering tasks. The agent leverages advanced techniques like attention-based search over embeddings-based RAG and manages context windows up to 2 million tokens. Jules is positioned as a significant AI application and a potential pathway to AGI, with developers reportedly using it for extended periods. AI
TOOL · OpenAI News English(EN) · 7mo

How CRED is tapping AI to deliver premium customer experiences

CRED, an India-based financial services company, is leveraging OpenAI's GPT-4.0 and GPT-5 models to enhance customer experiences and internal operations. They have developed an AI conversational companion named Cleo for customer support, which has achieved a 98% resolution accuracy rate and improved CSAT scores by 14 percentage points. Additionally, internal tools like Thea and Stark assist support agents and operations teams, respectively, by summarizing conversations and streamlining SOP creation, leading to significant reductions in handling times and session drop-offs. AI
TOOL · Smol AINews English(EN) · 7mo

Cursor 2.0 & Composer-1: Fast Models and New Agents UI

Cursor has released version 2.0 of its AI-powered code editor, introducing a new user interface for agents. This update aims to enhance the developer experience by providing faster model responses and improved agent interaction capabilities. The release focuses on making AI-assisted coding more efficient and accessible within the development workflow. AI
TOOL · OpenAI News English(EN) · 7mo

Doppel’s AI defense system stops attacks before they spread

Doppel has launched an AI-powered defense system that significantly enhances its ability to combat online impersonations and deepfake threats. By integrating OpenAI's GPT-5 and o4-mini models with reinforcement fine-tuning, the system can now detect and neutralize threats in minutes, a drastic improvement from previous hours-long response times. This advancement has led to an 80% reduction in analyst workload and a threefold increase in threat-handling capacity, allowing organizations to defend against rapidly scaling AI-generated attacks. AI
TOOL · r/Anthropic English(EN) · 7mo

Advancing Claude for Financial Services

Anthropic has announced new capabilities for its Claude AI model, specifically tailored for the financial services industry. These advancements aim to enhance Claude's performance in areas such as risk management, fraud detection, and customer service within financial institutions. The updates are designed to provide more accurate and efficient AI-driven solutions for complex financial tasks. AI

IMPACT Enhances AI capabilities for financial services, potentially improving efficiency and accuracy in risk management and customer service.
TOOL · Google DeepMind English(EN) · 7mo

Exploring the context of online images with Backstory

Google DeepMind has introduced Backstory, an experimental AI tool designed to help users understand the context and origin of online images. The tool investigates whether an image is AI-generated, its previous online usage, and if it has been digitally altered. Backstory utilizes Gemini and various detection technologies to provide users with easy-to-read reports, aiming to enhance information trustworthiness. AI
TOOL · Smol AINews Dansk(DA) · 8mo

Claude Agent Skills - glorified AGENTS.md? or MCP killer?

Anthropic has introduced "Agent Skills" for its Claude AI, a feature that allows the AI to perform actions beyond simple text generation. This new capability enables Claude to interact with external tools and services, effectively acting as an agent that can execute tasks. The development has sparked discussion about whether this represents a significant advancement or a more incremental improvement over existing agent frameworks. AI
TOOL · Hugging Face Blog English(EN) · 8mo

Get your VLM running in 3 simple steps on Intel CPUs

Hugging Face has partnered with Intel to enable the deployment of vision-language models (VLMs) on Intel CPUs. This collaboration provides a streamlined process for developers to run these models efficiently, leveraging Intel's OpenVINO toolkit. The integration aims to make advanced AI capabilities more accessible on standard hardware. AI
TOOL · Hugging Face Blog English(EN) · 8mo

Arm will be @ PyTorch Conference, Join Us!

Arm announced a collaboration with Hugging Face to optimize AI models for Arm-based hardware. This partnership aims to improve the performance and accessibility of AI development on devices powered by Arm processors. The collaboration will focus on integrating Hugging Face's tools and models with Arm's architecture, making it easier for developers to deploy AI applications efficiently. AI