Brief

last 24h

[7/7] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · Anyscale blog English(EN) · 3d

Introducing the Anyscale Agent Skill for LLM Post

Anyscale has introduced a new Anyscale Agent Skill designed to simplify and automate the process of generating LLM post-training runs. This skill assists users in selecting the most appropriate post-training method, such as SFT, CPT, DPO, or RLVR, based on their model, dataset, and objectives. It then generates configuration files for popular frameworks like LLaMA-Factory and Ray Train, preparing them for deployment on Anyscale Jobs. AI

IMPACT Simplifies the complex process of LLM post-training, potentially accelerating adoption of advanced alignment and optimization techniques.
- ChatGPT
- LLM
- RLHF
- InstructGPT
- RLVR
- DeepSeek-R1
- SFT
- DAPO
- Anyscale
- GRPO
- Ray Train
- LLaMA-Factory
- Anyscale Jobs
- Anyscale Agent Skills
RESEARCH · Anyscale blog English(EN) · 3d

Ray is Joining The PyTorch Foundation

Anyscale announced that its open-source distributed computing framework, Ray, is joining the PyTorch Foundation, which is part of the Linux Foundation. Ray has experienced significant growth, with downloads increasing nearly tenfold in the past year and powering AI workloads for numerous companies including xAI, Netflix, and JPMorgan. This move aims to foster a stronger open-source community around Ray to meet the evolving demands of AI infrastructure. AI

IMPACT Accelerates the development of open-source AI infrastructure by consolidating community efforts under a major foundation.
- xAI
- JPMorgan
- Netflix
- Linux Foundation
- Apache Spark
- vLLM
- Kubernetes
- UC Berkeley
- Anyscale
- Ray
- PyTorch Foundation
TOOL · Anyscale blog English(EN) · 3d

Monitor and debug Ray workloads with fully persisted Cluster and Actor dashboards on Anyscale

Anyscale has launched new Cluster and Actor Dashboards for its Ray platform, providing fully persisted monitoring and debugging tools. These dashboards address limitations of the previous ephemeral data, enabling historical analysis of Ray workloads even after clusters have shut down. The enhanced observability is designed to handle large-scale AI and data processing jobs, offering improved user experience and seamless navigation across various workload and system-level insights. AI

IMPACT Enhances tooling for AI/ML developers using distributed computing frameworks.
TOOL · Anyscale blog English(EN) · 3d

Announcing Anyscale on Azure: Build, Run and Scale AI

Anyscale has launched a private preview of its managed service on Microsoft Azure, designed to help enterprises build and scale AI workloads. This integration allows users to provision and manage Anyscale, which is powered by the Ray framework, directly within the Azure Portal. The service offers enhanced security features, unified billing through Azure, and optimized performance via the Anyscale Runtime for cost-efficient processing of AI-native applications. AI

IMPACT Streamlines AI development and deployment on Azure, potentially accelerating enterprise adoption of Ray-based applications.
COMMENTARY · Anyscale blog English(EN) · 3d

Architecting Data Pipelines for Multimodal Datasets at Scale

Anyscale's blog post details challenges in scaling multimodal AI data pipelines, where preprocessing often starves GPUs, leading to underutilization. The article explains that traditional staged batch execution, which involves writing intermediate data to storage between preprocessing and training, is inefficient due to significant I/O costs and delays. It proposes a disaggregated streaming architecture using Ray Data to directly stream preprocessed data from a dedicated preprocessing fleet to GPU workers, bypassing storage bottlenecks and improving GPU utilization. AI

IMPACT Provides architectural guidance for optimizing AI training and inference infrastructure, particularly for multimodal datasets.
- Spark
- Anyscale
- MapReduce
- multimodal AI
- Ray Data
COMMENTARY · Anyscale blog English(EN) · 1mo

How Notion cuts embedding costs by 80% and other stories on scaling AI with Ray from Salesforce, Uber, and more…

Anyscale hosted Ray Day Seattle, showcasing how companies like Notion and Salesforce are using the Ray framework to scale AI workloads. Notion significantly reduced embedding costs by 80% and improved query latency by migrating their AI pipeline to Ray, consolidating multiple steps into a single engine. Salesforce leveraged Ray to build a distributed system for summarizing lengthy documents, achieving low latency with a 20B parameter model. Uber also presented improvements in GPU utilization and training time using Ray for their ML platform. AI

IMPACT Demonstrates practical scaling solutions for AI workloads, reducing costs and improving performance for major tech companies.
- Uber
- Notion
- Salesforce
- Spark
- Michelangelo
- vLLM
- Anyscale
- Ray
- Robert Nishihara
- Peng Zhang
- Chi Wang
- Jiwei Cao
- Mickey Liu
TOOL · Anyscale blog English(EN) · 1mo

Announcing DP Group Fault Tolerance for vLLM WideEP Deployments with Ray Serve LLM

Anyscale has introduced a new fault tolerance feature for its vLLM serving engine, integrated with Ray Serve. This enhancement specifically addresses the challenges of deploying large Mixture-of-Experts (MoE) models, which are sharded across multiple GPUs. The new system can now identify and restart entire groups of GPUs that form a data-parallel (DP) group when a single GPU within that group fails, preventing the entire deployment from becoming unavailable. AI

IMPACT Enhances the reliability and operational efficiency of serving large, complex Mixture-of-Experts models, which are becoming increasingly common.

Brief

Introducing the Anyscale Agent Skill for LLM Post

Ray is Joining The PyTorch Foundation

Monitor and debug Ray workloads with fully persisted Cluster and Actor dashboards on Anyscale

Announcing Anyscale on Azure: Build, Run and Scale AI

Architecting Data Pipelines for Multimodal Datasets at Scale

How Notion cuts embedding costs by 80% and other stories on scaling AI with Ray from Salesforce, Uber, and more…

Announcing DP Group Fault Tolerance for vLLM WideEP Deployments with Ray Serve LLM