PulseAugur
实时 23:19:28
实体 LiveCodeBench

LiveCodeBench

PulseAugur coverage of LiveCodeBench — every cluster mentioning LiveCodeBench across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
14
90 天内 14
发布 · 30天
0
90 天内 0
论文 · 30天
11
90 天内 11
层级分布 · 90 天
情绪 · 30 天

5 天有情绪数据

最近 · 第 1/1 页 · 共 14 条
  1. TOOL · CL_44823 ·

    New STAND technique slashes LLM reasoning latency by 65%

    Researchers have developed STAND (STochastic Adaptive N-gram Drafting), a new model-free speculative decoding technique designed to accelerate language model reasoning. This method leverages the redundancy in reasoning …

  2. TOOL · CL_30793 ·

    LLMs learn to actively seek external info for better task adaptation

    Researchers have developed a new method for adapting large language models (LLMs) by enabling them to actively seek information from external sources like Wikipedia and web browsers. This approach, termed "active inform…

  3. TOOL · CL_29426 ·

    New framework StepCodeReasoner boosts code reasoning with execution traces

    Researchers have developed StepCodeReasoner, a new framework designed to improve code reasoning by focusing on intermediate execution states rather than just final outputs. This approach uses structured print statements…

  4. TOOL · CL_20541 ·

    New Conductor model learns to orchestrate LLMs for better performance

    Researchers have developed a "Conductor" model trained with reinforcement learning to coordinate multiple large language models. This Conductor model learns to establish communication pathways and craft specific instruc…

  5. TOOL · CL_24799 ·

    New CoREB benchmark and model advance code search capabilities

    Researchers have introduced CoREB, a new benchmark and model designed to improve code search beyond simple retrieval. CoREB addresses limitations in existing benchmarks, such as data contamination and noisy labels, by f…

  6. TOOL · CL_20651 ·

    New CoREB benchmark and reranker improve code search beyond retrieval

    Researchers have introduced CoREB, a new benchmark designed to evaluate code search systems beyond simple retrieval. This benchmark addresses limitations in existing datasets, such as data contamination and noisy labels…

  7. TOOL · CL_18865 ·

    ReCode framework enhances AI code generation by rewarding reasoning processes

    Researchers have developed ReCode, a novel reinforcement learning framework designed to improve code generation by focusing on the reasoning process. This framework uses Contrastive Reasoning-Process Reward Learning (CR…

  8. TOOL · CL_13981 ·

    DeepClaude slashes coding agent costs by 17x using DeepSeek V4 Pro

    An open-source tool called DeepClaude has gained significant traction by allowing developers to use the Claude Code agent loop with DeepSeek V4 Pro instead of Anthropic's models. This swap drastically reduces costs, wit…

  9. RESEARCH · CL_11452 ·

    ScaleBox system enhances LLM code verification accuracy and efficiency

    Researchers have developed ScaleBox, a new system designed to improve the accuracy and efficiency of code verification for large language models. Existing code sandboxes struggle with high-concurrency workloads, leading…

  10. RESEARCH · CL_47651 ·

    DeepSeek-V4 Pro model with 1.6T parameters now on Together AI

    DeepSeek-V4 Pro, a large Mixture-of-Experts model with 1.6 trillion parameters, is now accessible on the Together AI platform. This model is designed for long-context reasoning, supporting up to a 512K-token context win…

  11. RESEARCH · CL_07021 ·

    AI benchmark contamination signal sensitive to question format, study finds

    A new paper questions the reliability of temporal signals in detecting benchmark contamination for large language models. Researchers found that the way benchmark questions are phrased significantly impacts whether perf…

  12. RESEARCH · CL_06927 ·

    Think Anywhere in Code Generation

    Researchers have introduced "Think-Anywhere," a new reasoning mechanism for large language models that allows them to generate code by thinking at any point during the process, rather than just upfront. This approach ha…

  13. TOOL · CL_47691 ·

    Together AI launches API to execute LLM-generated code

    Together AI has launched Together Code Interpreter (TCI), an API designed to securely execute code generated by large language models. This tool addresses the limitation of LLMs being unable to run the code they produce…

  14. RESEARCH · CL_05788 ·

    Kwai AI's SRPO achieves DeepSeek-R1-Zero performance with 10x fewer training steps

    Researchers from Kuaishou's Kwaipilot team have developed a novel reinforcement learning framework called SRPO, designed to improve the efficiency and performance of large language models. This new method addresses limi…