AlphaZero
PulseAugur coverage of AlphaZero — every cluster mentioning AlphaZero across labs, papers, and developer communities, ranked by signal.
-
New MCTS policies improve Monte Carlo Tree Search with variance awareness
Researchers have developed a new methodology called Inverse-RPO to systematically derive prior-based tree policies for Monte Carlo Tree Search (MCTS). This approach builds upon framing MCTS as a regularized policy optim…
-
Claude Opus 4.7 leads frontier agents in AI research acceleration benchmark
A new research paper proposes a benchmark to assess AI's ability to autonomously implement machine learning pipelines, aiming to detect early signs of recursive self-improvement. Frontier coding agents were tasked with …
-
DeepMind founder David Silver raises $1.1B for AI that learns without human data
Ineffable Intelligence, a new AI lab founded by former DeepMind researcher David Silver, has secured $1.1 billion in funding. The company aims to develop a "superlearner" that can acquire knowledge and skills autonomous…
-
DeepMind's AlphaGo lead David Silver launches Ineffable Intelligence with Sequoia funding
David Silver, a key figure behind DeepMind's AlphaGo and other AI projects, has launched a new research lab called Ineffable Intelligence. The lab aims to create a "superlearner" that acquires knowledge through direct e…
-
LLMs struggle to play video games, despite coding prowess, experts say
Despite rapid advancements in areas like coding, large language models (LLMs) demonstrate significant limitations when it comes to playing video games. While some models have achieved success in specific games, their pe…
-
Andrej Karpathy discusses Sutton's critique of LLMs as not 'bitter lesson pilled'
Andrej Karpathy discusses a podcast featuring Geoffrey Hinton, who questions the widely held belief that Large Language Models (LLMs) fully embody his "Bitter Lesson" principle. Hinton argues that LLMs rely heavily on f…
-
Amateurs aim to win Trackmania's Cup of the Day using machine learning
This article details a project aiming to develop a machine learning program capable of winning Division 1 of Trackmania's "Cup of the Day" without prior map knowledge. The authors are motivated by the desire to explore …