AlphaZero
PulseAugur coverage of AlphaZero — every cluster mentioning AlphaZero across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
AI Agent 'WallZero' Masters Complex Board Game WallGo
Researchers have developed WallZero, an AI agent based on AlphaZero, designed to master the strategic board game WallGo. This game, popularized by a Netflix series, presents significant complexity despite its small boar…
-
AI discovers superior lattice reduction strategies, outperforming LLL algorithm
Researchers have developed a deep reinforcement learning approach to discover new strategies for lattice basis reduction, outperforming the traditional Lenstra-Lenstra-Lovász (LLL) algorithm. By framing lattice reductio…
-
AI discovers superior lattice reduction strategies, outperforming LLL algorithm
Researchers have developed a new method using deep reinforcement learning to discover superior strategies for the Lenstra-Lenstra-Lovász (LLL) algorithm, a fundamental tool in computer science for lattice basis reductio…
-
David Silver's Ineffable Intelligence raises $11B on anti-LLM bet
Ineffable Intelligence, a two-month-old company founded by DeepMind's David Silver, has secured a historic $11 billion seed funding round at a $51 billion valuation. The company's core bet challenges the prevailing LLM …
-
AlphaZero Othello training struggles prompt hyperparameter analysis
A user is training an AlphaZero model for Othello on a 6x6 board and encountering issues with performance. Despite models improving against each other, they are not significantly better than benchmark agents, with a win…
-
Mechanistic interpretability reveals LLM reasoning processes
Researchers are making significant progress in understanding the internal workings of large language models through mechanistic interpretability. Techniques like Anthropic's circuit tracing allow for the identification …
-
Developer starts MCP internship, eyes potential paper
A developer has started a new role focusing on Model Context Protocol (MCP) at a startup, aiming to integrate it with LLMs and a custom simulator. This work has the potential to lead to a published paper. The developer …
-
Google sunsets Gemini CLI; AlphaZero defeats Stockfish
Google is sunsetting its Gemini CLI tool on June 18th, urging users to migrate to the Anti Gravity CLI. Separately, DeepMind's AlphaZero demonstrated a significant chess-playing capability by defeating Stockfish after e…
-
New AI methods tackle imperfect-information games
Researchers are developing new methods to tackle complex games with imperfect information. One paper introduces Recurrent Structural Policy Gradient (RSPG), a novel approach for partially observable mean field games tha…
-
RL framework automates security protocol analysis in Tamarin
Researchers have developed a reinforcement learning (RL) framework to automate and shorten the process of analyzing security protocols using the Tamarin tool. This new method, inspired by AlphaZero, employs a neural heu…
-
Demis Hassabis's AI work earns Nobel Prize in Chemistry
Demis Hassabis's pioneering work, including AlphaGo, AlphaZero, and AlphaFold, has significantly advanced artificial intelligence and its applications in science. His contributions were recognized with the Nobel Prize i…
-
New MCTS policies improve Monte Carlo Tree Search with variance awareness
Researchers have developed a new methodology called Inverse-RPO to systematically derive prior-based tree policies for Monte Carlo Tree Search (MCTS). This approach builds upon framing MCTS as a regularized policy optim…
-
Claude Opus 4.7 leads frontier agents in AI research acceleration benchmark
A new research paper proposes a benchmark to assess AI's ability to autonomously implement machine learning pipelines, aiming to detect early signs of recursive self-improvement. Frontier coding agents were tasked with …
-
DeepMind founder David Silver raises $1.1B for AI that learns without human data
Ineffable Intelligence, a new AI lab founded by former DeepMind researcher David Silver, has secured $1.1 billion in funding. The company aims to develop a "superlearner" that can acquire knowledge and skills autonomous…
-
DeepMind's AlphaGo lead David Silver launches Ineffable Intelligence with Sequoia funding
David Silver, a key figure behind DeepMind's AlphaGo and other AI projects, has launched a new research lab called Ineffable Intelligence. The lab aims to create a "superlearner" that acquires knowledge through direct e…
-
LLMs struggle to play video games, despite coding prowess, experts say
Despite rapid advancements in areas like coding, large language models (LLMs) demonstrate significant limitations when it comes to playing video games. While some models have achieved success in specific games, their pe…
-
Andrej Karpathy discusses Sutton's critique of LLMs as not 'bitter lesson pilled'
Andrej Karpathy discusses a podcast featuring Geoffrey Hinton, who questions the widely held belief that Large Language Models (LLMs) fully embody his "Bitter Lesson" principle. Hinton argues that LLMs rely heavily on f…
-
Amateurs aim to win Trackmania's Cup of the Day using machine learning
This article details a project aiming to develop a machine learning program capable of winning Division 1 of Trackmania's "Cup of the Day" without prior map knowledge. The authors are motivated by the desire to explore …