实体 GSM8K

GSM8K

PulseAugur coverage of GSM8K — every cluster mentioning GSM8K across labs, papers, and developer communities, ranked by signal.

总计 · 30天

32

90 天内 32

发布 · 30天

0

90 天内 0

论文 · 30天

31

90 天内 31

层级分布 · 90 天

关系

情绪 · 30 天

9 天有情绪数据

最近 · 第 2/2 页 · 共 32 条

RESEARCH · CL_18265 · May 5 · 01:13

Researchers find Transformers know counts but struggle to output them

A new paper identifies a specific bottleneck in Transformer models that hinders their ability to perform counting tasks. Researchers found that while models like Pythia, Qwen3, and Mistral store count information accura…
RESEARCH · CL_11818 · May 1 · 04:00

New LenVM model offers token-level length control for LLMs

Researchers have developed a new framework called the Length Value Model (LenVM) that predicts the remaining generation length for tokens in large language models. This token-level approach models length as a value esti…
RESEARCH · CL_11738 · May 1 · 04:00

BoostLoRA method grows adapter rank to surpass full fine-tuning

Researchers have introduced BoostLoRA, a novel parameter-efficient fine-tuning method designed to enhance model expressivity without increasing inference overhead. This technique iteratively trains and merges small adap…
RESEARCH · CL_14144 · Apr 30 · 20:30

State Stream Transformer V2 enhances LLM reasoning with parallel training and latent state streaming

Researchers have developed the State Stream Transformer (SST) V2, an architectural innovation designed to enhance latent space reasoning in language models. Unlike standard transformers that reset context at each step, …
RESEARCH · CL_10517 · Apr 30 · 10:24

IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

IBM has released Granite 4.1, a family of open-source language models designed for enterprise use, featuring three sizes (3B, 8B, and 30B parameters). Notably, the 8B dense model demonstrates performance matching or exc…
RESEARCH · CL_06627 · Apr 28 · 04:00

New research reveals hidden states in LLMs contain task-solving information

Researchers have investigated the information encoded within the hidden states of language models during chain-of-thought (CoT) reasoning. By using activation patching on the GSM8K dataset, they found that individual Co…
RESEARCH · CL_06321 · Apr 27 · 13:45

Researchers launch Gammaf, an open-source framework for benchmarking LLM multi-agent system security

Researchers have introduced GAMMAF, an open-source framework designed to benchmark anomaly detection methods in Large Language Model (LLM) multi-agent systems. This platform addresses the lack of standardized evaluation…
RESEARCH · CL_05211 · Apr 27 · 04:00

Language agents use auction to cut communication costs and boost reasoning

Researchers have developed a new framework called DALA (Dynamic Auction-based Language Agent) to improve communication efficiency in multi-agent systems powered by large language models. This system treats communication…
RESEARCH · CL_05134 · Apr 27 · 04:00

Multi-Token Prediction via Self-Distillation

Researchers have developed a novel self-distillation technique to accelerate language model inference. This method transforms a standard autoregressive model into a faster multi-token predictor without needing auxiliary…
RESEARCH · CL_05034 · Apr 24 · 06:34

New research suggests LLM self-correction can degrade performance if not carefully managed.

A new research paper introduces a control-theoretic framework to analyze when iterative self-correction in large language models (LLMs) is beneficial or detrimental. The study proposes a diagnostic based on error correc…
RESEARCH · CL_04999 · Apr 24 · 00:20

Researchers explore optimal LoRA placement in hybrid language models

A new paper explores the optimal placement of LoRA adapters in hybrid language models, which combine attention and recurrent components. The research demonstrates that adapting the attention pathway is more effective th…
RESEARCH · CL_01620 · Oct 10 · 00:00

Google DeepMind releases T5Gemma encoder-decoder LLMs adapted from Gemma

Google DeepMind has introduced T5Gemma, a new family of encoder-decoder large language models derived from their existing Gemma 2 models. This adaptation technique allows for flexible combinations of encoder and decoder…