PulseAugur
实时 22:02:36
实体 Mathematics Dataset

Mathematics Dataset

PulseAugur coverage of Mathematics Dataset — every cluster mentioning Mathematics Dataset across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
9
90 天内 9
发布 · 30天
0
90 天内 0
论文 · 30天
9
90 天内 9
层级分布 · 90 天
情绪 · 30 天

3 天有情绪数据

最近 · 第 1/1 页 · 共 9 条
  1. RESEARCH · CL_43919 ·

    New 'Distillation Game' framework reveals model imitation risks

    Researchers have developed a new framework called "The Distillation Game" to study the trade-off between model utility and imitation risk. This framework models the interaction as a minimax game between a teacher model …

  2. RESEARCH · CL_41786 ·

    New RL methods tackle LLM training issues

    Two new research papers introduce methods to improve the training of large language models using reinforcement learning. One paper addresses the issue of "advantage collapse" in Group Relative Policy Optimization (GRPO)…

  3. TOOL · CL_41828 ·

    HRM-Text model drastically cuts LLM pretraining costs

    Researchers have developed HRM-Text, a novel Hierarchical Recurrent Model that significantly reduces the computational resources and training data required for pretraining large language models. By decoupling computatio…

  4. TOOL · CL_36559 ·

    New VSPO method enhances language model behavioral control

    Researchers have developed a new method called Vector-Steered Policy Optimization (VSPO) to help language models better control specific behaviors while maintaining accuracy. VSPO uses a steering vector to adjust the in…

  5. RESEARCH · CL_28917 ·

    New RL method teaches LLMs to self-correct answers

    Researchers have developed SCoRe, a novel two-stage reinforcement learning technique that enables language models to refine their own responses using self-generated data. This method significantly improves performance o…

  6. TOOL · CL_28283 ·

    AI reasoning studies flawed by focus on final answer, not computation

    A new research paper identifies a significant flaw in chain-of-thought (CoT) corruption studies, which are used to evaluate the faithfulness of AI reasoning. The study found that these evaluations often mistakenly ident…

  7. TOOL · CL_16250 ·

    The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment

    Researchers have introduced the Master Key Hypothesis, suggesting that model capabilities reside in transferable latent subspaces that can be aligned across different model scales. They developed a framework called UNLO…

  8. RESEARCH · CL_05034 ·

    New research suggests LLM self-correction can degrade performance if not carefully managed.

    A new research paper introduces a control-theoretic framework to analyze when iterative self-correction in large language models (LLMs) is beneficial or detrimental. The study proposes a diagnostic based on error correc…

  9. COMMENTARY · CL_01323 ·

    How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs

    Current methods for evaluating large language models, such as MMLU and HumanEval, may be insufficient as they do not capture the nuances of interactive, goal-oriented conversations. A more effective approach would invol…