PulseAugur
EN
LIVE 10:04:52
ENTITY Sequence Parallelism

Sequence Parallelism

PulseAugur coverage of Sequence Parallelism — every cluster mentioning Sequence Parallelism across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
3
3 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
3
3 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL
  1. RESEARCH · CL_97838 ·

    Spotlight system cuts DiT RL post-training costs using spot GPUs

    Researchers have developed Spotlight, a novel system designed to significantly reduce the cost of post-training Diffusion Transformers (DiTs) for reinforcement learning. By leveraging insights into exploration tolerance…

  2. RESEARCH · CL_15158 ·

    Zyphra's TSP strategy boosts LLM training throughput by 2.6x

    Zyphra has developed a new technique called Tensor and Sequence Parallelism (TSP) designed to optimize the training and inference of large transformer models. This hardware-aware strategy combines aspects of Tensor Para…

  3. RESEARCH · CL_09826 ·

    New TSP strategy folds tensor and sequence parallelism for memory-efficient training

    Researchers have introduced a new parallel execution strategy called Tensor and Sequence Parallelism (TSP) designed to enhance memory efficiency during the training and inference of Transformer models. TSP combines tens…