ENTITY GQA

GQA

PulseAugur coverage of GQA — every cluster mentioning GQA across labs, papers, and developer communities, ranked by signal.

Total · 30d

8 over 90d

Releases · 30d

0 over 90d

Papers · 30d

8 over 90d

TIER MIX · 90D

RELATIONSHIPS

competes with Montessori Lyceum Amsterdam 60%

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 6 TOTAL

TOOL · CL_26875 · May 11 · 16:20

Transformer LLM Architectures Converge on Standard Stack

A recent analysis of 53 large language models from 2017 to 2025 reveals a significant convergence in transformer architectures. Key elements of this de facto standard include pre-normalization (RMSNorm), Rotary Position…
RESEARCH · CL_09211 · Apr 29 · 15:01

IBM releases Granite 4.1 LLMs with 512K context and Apache 2.0 license

IBM has released the Granite 4.1 family of large language models, comprising 3B, 8B, and 30B parameter versions. These models were trained on approximately 15 trillion tokens through a five-stage pre-training process th…
RESEARCH · CL_08619 · Apr 29 · 04:00

BLASST paper introduces dynamic sparse attention for faster LLM inference

Researchers have developed BLASST, a novel sparse attention mechanism designed to accelerate inference for large language models with long contexts. This drop-in solution dynamically skips attention blocks using a simpl…
RESEARCH · CL_06270 · Apr 27 · 12:59

Kwai Summary Attention compresses historical contexts for efficient long-context LLMs

Researchers have introduced Kwai Summary Attention (KSA), a novel attention mechanism designed to address the quadratic time complexity of standard softmax attention in large language models. KSA aims to maintain a line…
RESEARCH · CL_04553 · Apr 27 · 00:29

DeepSeek benchmarks MLA vs GQA on A100, revealing bandwidth-quality tradeoff

A technical analysis explores DeepSeek's decision to utilize MLA (Multi-Head Linear Attention) over GQA (Grouped-Query Attention) in their models. The author highlights this choice as a strategic trade-off between compu…
RESEARCH · CL_03769 · Apr 26 · 04:31

DeepSeek-V4, LoRA, and other LLM techniques detailed in new blogs

A series of six blog posts has been published on Outcome School, detailing fundamental components of contemporary large language models. The posts cover technical concepts such as RMSNorm, DeepSeek-V4, LoRA, RoPE, GQA, …

Transformer LLM Architectures Converge on Standard Stack

IBM releases Granite 4.1 LLMs with 512K context and Apache 2.0 license

BLASST paper introduces dynamic sparse attention for faster LLM inference

Kwai Summary Attention compresses historical contexts for efficient long-context LLMs

DeepSeek benchmarks MLA vs GQA on A100, revealing bandwidth-quality tradeoff

DeepSeek-V4, LoRA, and other LLM techniques detailed in new blogs