PulseAugur
实时 22:03:29
实体 GSM8K

GSM8K

PulseAugur coverage of GSM8K — every cluster mentioning GSM8K across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
32
90 天内 32
发布 · 30天
0
90 天内 0
论文 · 30天
31
90 天内 31
层级分布 · 90 天
关系
情绪 · 30 天

9 天有情绪数据

最近 · 第 2/2 页 · 共 32 条
  1. RESEARCH · CL_18265 ·

    Researchers find Transformers know counts but struggle to output them

    A new paper identifies a specific bottleneck in Transformer models that hinders their ability to perform counting tasks. Researchers found that while models like Pythia, Qwen3, and Mistral store count information accura…

  2. RESEARCH · CL_11818 ·

    New LenVM model offers token-level length control for LLMs

    Researchers have developed a new framework called the Length Value Model (LenVM) that predicts the remaining generation length for tokens in large language models. This token-level approach models length as a value esti…

  3. RESEARCH · CL_11738 ·

    BoostLoRA method grows adapter rank to surpass full fine-tuning

    Researchers have introduced BoostLoRA, a novel parameter-efficient fine-tuning method designed to enhance model expressivity without increasing inference overhead. This technique iteratively trains and merges small adap…

  4. RESEARCH · CL_14144 ·

    State Stream Transformer V2 enhances LLM reasoning with parallel training and latent state streaming

    Researchers have developed the State Stream Transformer (SST) V2, an architectural innovation designed to enhance latent space reasoning in language models. Unlike standard transformers that reset context at each step, …

  5. RESEARCH · CL_10517 ·

    IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

    IBM has released Granite 4.1, a family of open-source language models designed for enterprise use, featuring three sizes (3B, 8B, and 30B parameters). Notably, the 8B dense model demonstrates performance matching or exc…

  6. RESEARCH · CL_06627 ·

    New research reveals hidden states in LLMs contain task-solving information

    Researchers have investigated the information encoded within the hidden states of language models during chain-of-thought (CoT) reasoning. By using activation patching on the GSM8K dataset, they found that individual Co…

  7. RESEARCH · CL_06321 ·

    Researchers launch Gammaf, an open-source framework for benchmarking LLM multi-agent system security

    Researchers have introduced GAMMAF, an open-source framework designed to benchmark anomaly detection methods in Large Language Model (LLM) multi-agent systems. This platform addresses the lack of standardized evaluation…

  8. RESEARCH · CL_05211 ·

    Language agents use auction to cut communication costs and boost reasoning

    Researchers have developed a new framework called DALA (Dynamic Auction-based Language Agent) to improve communication efficiency in multi-agent systems powered by large language models. This system treats communication…

  9. RESEARCH · CL_05134 ·

    Multi-Token Prediction via Self-Distillation

    Researchers have developed a novel self-distillation technique to accelerate language model inference. This method transforms a standard autoregressive model into a faster multi-token predictor without needing auxiliary…

  10. RESEARCH · CL_05034 ·

    New research suggests LLM self-correction can degrade performance if not carefully managed.

    A new research paper introduces a control-theoretic framework to analyze when iterative self-correction in large language models (LLMs) is beneficial or detrimental. The study proposes a diagnostic based on error correc…

  11. RESEARCH · CL_04999 ·

    Researchers explore optimal LoRA placement in hybrid language models

    A new paper explores the optimal placement of LoRA adapters in hybrid language models, which combine attention and recurrent components. The research demonstrates that adapting the attention pathway is more effective th…

  12. RESEARCH · CL_01620 ·

    Google DeepMind releases T5Gemma encoder-decoder LLMs adapted from Gemma

    Google DeepMind has introduced T5Gemma, a new family of encoder-decoder large language models derived from their existing Gemma 2 models. This adaptation technique allows for flexible combinations of encoder and decoder…