ENTITY Megatron-LM

Megatron-LM

PulseAugur coverage of Megatron-LM — every cluster mentioning Megatron-LM across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

5 over 90d

Releases · 30d

0 over 90d

Papers · 30d

4 over 90d

TIER MIX · 90D

TOPICS

RECENT · PAGE 1/1 · 5 TOTAL

RESEARCH · CL_50673 · May 25 · 13:16

New benchmarks and methods advance multimodal LLM capabilities

Researchers are developing new methods for multimodal large language models (MLLMs) to improve their understanding of sequential audio-video data and large-scale visual recognition. One approach, DLLM-VSR, uses diffusio…
TOOL · CL_33818 · May 15 · 21:31

PyTorch tutorial simplifies distributed AI model inference

This article explains distributed inference techniques for large AI models using PyTorch. It details how to implement Data Parallelism (DP), Tensor Parallelism (TP), and Pipeline Parallelism (PP) with minimal code. The …
TOOL · CL_51841 · May 15 · 13:10

New 1.58-bit LLM family achieves 6x inference memory reduction

A new family of large language models, BitCPM-CANN, has been developed using a novel 1.58-bit ternary quantization technique. These models, ranging from 0.5B to 8B parameters, achieve significant memory reduction for in…
RESEARCH · CL_11807 · Apr 30 · 18:55

New methods tackle LLM quantization for improved efficiency and accuracy

Researchers have developed several new methods to improve the efficiency of large language models (LLMs) through quantization. OSAQ focuses on suppressing weight outliers using a low-rank Hessian property for accurate l…
RESEARCH · CL_01012 · Feb 4 · 18:00

Why Nvidia builds open models with Bryan Catanzaro

Nvidia is significantly expanding its open model program, releasing higher quality models and datasets. This strategy benefits Nvidia by capturing value from open language models, creating a sustainable advantage. The c…

New benchmarks and methods advance multimodal LLM capabilities

PyTorch tutorial simplifies distributed AI model inference

New 1.58-bit LLM family achieves 6x inference memory reduction

New methods tackle LLM quantization for improved efficiency and accuracy

Why Nvidia builds open models with Bryan Catanzaro