PulseAugur
实时 23:50:24
实体 MBPP

MBPP

PulseAugur coverage of MBPP — every cluster mentioning MBPP across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
9
90 天内 9
发布 · 30天
0
90 天内 0
论文 · 30天
9
90 天内 9
层级分布 · 90 天
关系
情绪 · 30 天

2 天有情绪数据

最近 · 第 1/1 页 · 共 9 条
  1. TOOL · CL_44879 ·

    New method steers LLM attention to correct reasoning errors

    Researchers have developed Manifold-Guided Attention Steering (MAGS), a novel method to improve the reasoning capabilities of large language models. MAGS identifies deviations from a 'correctness manifold' in the model'…

  2. RESEARCH · CL_36940 ·

    CANTANTE framework optimizes LLM multi-agent systems via credit attribution

    Researchers have developed CANTANTE, a new framework designed to optimize the configuration of large language model-based multi-agent systems. This system addresses the challenge of assigning credit for performance when…

  3. RESEARCH · CL_30616 ·

    New AI wrapper guides release decisions for iterative workflows

    Researchers have developed a new statistical method to determine when AI workflows should release their outputs, particularly for systems that use iterative generate-evaluate-revise loops. This "always-valid release wra…

  4. TOOL · CL_27577 ·

    Neuroevolution framework boosts LLM output diversity via prompt embedding evolution

    Researchers have developed QD-LLM, a novel framework that uses parameter-efficient neuroevolution to enhance the diversity of outputs from large language models. This method evolves compact prompt embeddings, which act …

  5. TOOL · CL_18865 ·

    ReCode framework enhances AI code generation by rewarding reasoning processes

    Researchers have developed ReCode, a novel reinforcement learning framework designed to improve code generation by focusing on the reasoning process. This framework uses Contrastive Reasoning-Process Reward Learning (CR…

  6. RESEARCH · CL_11738 ·

    BoostLoRA method grows adapter rank to surpass full fine-tuning

    Researchers have introduced BoostLoRA, a novel parameter-efficient fine-tuning method designed to enhance model expressivity without increasing inference overhead. This technique iteratively trains and merges small adap…

  7. RESEARCH · CL_10517 ·

    IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

    IBM has released Granite 4.1, a family of open-source language models designed for enterprise use, featuring three sizes (3B, 8B, and 30B parameters). Notably, the 8B dense model demonstrates performance matching or exc…

  8. RESEARCH · CL_06927 ·

    Think Anywhere in Code Generation

    Researchers have introduced "Think-Anywhere," a new reasoning mechanism for large language models that allows them to generate code by thinking at any point during the process, rather than just upfront. This approach ha…

  9. RESEARCH · CL_00258 ·

    LLMs advance code editing, generation, and bug detection with new techniques

    Researchers are exploring various methods to enhance Large Language Models (LLMs) for code-related tasks. One study evaluates locally deployed LLMs like LLaMA 3.2 and Mistral for Python bug detection, finding they can i…