PulseAugur
EN
LIVE 12:10:32
ENTITY MBPP

MBPP

PulseAugur coverage of MBPP — every cluster mentioning MBPP across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
15
15 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
14
14 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 15 TOTAL
  1. TOOL · CL_98129 ·

    New signature filtering method boosts LLM watermark detection accuracy

    Researchers have developed a new method called signature filtering to improve the detection of statistical watermarks in large language models. This technique enhances existing watermark detection without altering the e…

  2. RESEARCH · CL_93587 ·

    Study finds most post-hoc operators fail to improve frozen code model accuracy

    A new study published on arXiv investigates post-hoc falsification operators for small, frozen code models, finding that most operators do not improve accuracy over standard methods like Best-of-N. The research highligh…

  3. TOOL · CL_62660 ·

    Qwen2.5-Coder and DeepSeek-Coder V2 lead local coding LLM race

    For users with 8GB of VRAM, the Qwen2.5-Coder 7B model is the top choice for coding tasks, offering impressive benchmark scores and a large context window. Those with 12-16GB of VRAM face a trade-off between a dense 14B…

  4. TOOL · CL_58838 ·

    New BrahmicTokenizer-131K improves Indic language tokenization efficiency

    Researchers have developed BrahmicTokenizer-131K, a new tokenizer designed to improve efficiency for Indic languages while maintaining performance on English and code. This tokenizer achieves a 26.7% reduction in token …

  5. TOOL · CL_56429 ·

    New 'Poison-with-Style' Attack Targets Code LLMs with Subtle Triggers

    Researchers have developed a novel data poisoning attack called Poison-with-Style (PwS) that targets code large language models (CLLMs). This attack subtly embeds trigger code styles within developers' prompts, causing …

  6. TOOL · CL_51356 ·

    New Bilevel Approach Enhances LLM Learning with Textual Feedback

    Researchers have developed a novel bilevel approach for reinforcement learning with textual feedback, aiming to improve sample efficiency in LLMs. This new method, called Bilevel Natural Language Actor-Critic (Bi-NAC), …

  7. TOOL · CL_44879 ·

    New method steers LLM attention to correct reasoning errors

    Researchers have developed Manifold-Guided Attention Steering (MAGS), a novel method to improve the reasoning capabilities of large language models. MAGS identifies deviations from a 'correctness manifold' in the model'…

  8. RESEARCH · CL_36940 ·

    CANTANTE framework optimizes LLM multi-agent systems via credit attribution

    Researchers have developed CANTANTE, a new framework designed to optimize the configuration of large language model-based multi-agent systems. This system addresses the challenge of assigning credit for performance when…

  9. RESEARCH · CL_30616 ·

    New AI wrapper guides release decisions for iterative workflows

    Researchers have developed a new statistical method to determine when AI workflows should release their outputs, particularly for systems that use iterative generate-evaluate-revise loops. This "always-valid release wra…

  10. TOOL · CL_27577 ·

    Neuroevolution framework boosts LLM output diversity via prompt embedding evolution

    Researchers have developed QD-LLM, a novel framework that uses parameter-efficient neuroevolution to enhance the diversity of outputs from large language models. This method evolves compact prompt embeddings, which act …

  11. TOOL · CL_18865 ·

    ReCode framework enhances AI code generation by rewarding reasoning processes

    Researchers have developed ReCode, a novel reinforcement learning framework designed to improve code generation by focusing on the reasoning process. This framework uses Contrastive Reasoning-Process Reward Learning (CR…

  12. RESEARCH · CL_11738 ·

    BoostLoRA method grows adapter rank to surpass full fine-tuning

    Researchers have introduced BoostLoRA, a novel parameter-efficient fine-tuning method designed to enhance model expressivity without increasing inference overhead. This technique iteratively trains and merges small adap…

  13. RESEARCH · CL_10517 ·

    IBM's new 8B Granite 4.1 model outperforms older 32B MoE version

    IBM has released Granite 4.1, a family of open-source language models designed for enterprise use, featuring three sizes (3B, 8B, and 30B parameters). Notably, the 8B dense model demonstrates performance matching or exc…

  14. RESEARCH · CL_06927 ·

    Think Anywhere in Code Generation

    Researchers have introduced "Think-Anywhere," a new reasoning mechanism for large language models that allows them to generate code by thinking at any point during the process, rather than just upfront. This approach ha…

  15. RESEARCH · CL_00258 ·

    LLMs advance code editing, generation, and bug detection with new techniques

    Researchers are exploring various methods to enhance Large Language Models (LLMs) for code-related tasks. One study evaluates locally deployed LLMs like LLaMA 3.2 and Mistral for Python bug detection, finding they can i…