ENTITY DeepSpeed

DeepSpeed

PulseAugur coverage of DeepSpeed — every cluster mentioning DeepSpeed across labs, papers, and developer communities, ranked by signal.

Total · 30d

4

4 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

developed by Microsoft Research 100%

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL

TOOL · CL_88289 · Jun 12 · 00:00

Anyscale details FSDP for PyTorch and Ray, training Qwen3-TTS

This blog post provides a detailed explanation of Fully Sharded Data Parallelism (FSDP) in PyTorch, a technique for efficiently training large AI models across multiple GPUs. It covers the internal workings of FSDP, dem…
TOOL · CL_76535 · Jun 7 · 20:17

Open-source framework accelerates LLM training with MoE/MoD

A developer has created an open-source PyTorch framework designed for training large language models with Mixture of Experts (MoE) and Mixture of Depths (MoD) architectures. The framework incorporates custom CUDA kernel…
TOOL · CL_33818 · May 15 · 21:31

PyTorch tutorial simplifies distributed AI model inference

This article explains distributed inference techniques for large AI models using PyTorch. It details how to implement Data Parallelism (DP), Tensor Parallelism (TP), and Pipeline Parallelism (PP) with minimal code. The …
RESEARCH · CL_11807 · Apr 30 · 18:55

New methods tackle LLM quantization for improved efficiency and accuracy

Researchers have developed several new methods to improve the efficiency of large language models (LLMs) through quantization. OSAQ focuses on suppressing weight outliers using a low-rank Hessian property for accurate l…