ENTITY LiveCodeBench V6

LiveCodeBench V6

PulseAugur coverage of LiveCodeBench V6 — every cluster mentioning LiveCodeBench V6 across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

5 over 90d

Releases · 30d

0 over 90d

Papers · 30d

4 over 90d

TIER MIX · 90D

significant 1
research 2
tool 2

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL

RESEARCH · CL_94915 · Jun 16 · 13:44

New 3B model VibeThinker matches frontier math & coding performance

Researchers have developed VibeThinker-3B, a compact 3-billion parameter model that achieves performance comparable to much larger models in mathematics and coding tasks. This model, built upon Qwen2.5-Coder-3B and util…
RESEARCH · CL_53559 · May 26 · 13:21

New CPPO method boosts code generation by exploring multiple strategies

Researchers have introduced Coordinated Pass@K Policy Optimization (CPPO), a novel method to enhance code generation by exploring multiple distinct algorithmic strategies simultaneously. Unlike standard approaches that …
RESEARCH · CL_40825 · May 19 · 06:46

New self-distillation methods boost LLM performance on reasoning tasks

Researchers have developed new self-distillation techniques for large language models to improve their performance without relying on external feedback. AVSD (Adaptive-View Self-Distillation) balances consensus signals …
RESEARCH · CL_02960 · Apr 23 · 12:36

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Researchers have developed a new framework called Verbal Process Supervision (VPS) that enhances the reasoning capabilities of large language models without requiring gradient updates. This method utilizes structured na…
FRONTIER RELEASE · CL_01735 · Oct 23 · 18:54

Google DeepMind launches Deep Think for Gemini Ultra subscribers

Google DeepMind has released a new AI capability called Deep Think, now available to Google AI Ultra subscribers via the Gemini app. This feature utilizes parallel thinking techniques, allowing the model to explore mult…

New 3B model VibeThinker matches frontier math & coding performance

New CPPO method boosts code generation by exploring multiple strategies

New self-distillation methods boost LLM performance on reasoning tasks

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Google DeepMind launches Deep Think for Gemini Ultra subscribers