OpenAI has developed a new method called Video PreTraining (VPT) to train AI agents using vast amounts of unlabeled online video data. This technique involves first training an inverse dynamics model on a small set of labeled videos to predict actions, which then labels a larger dataset. The trained model, demonstrated in Minecraft, can perform complex tasks like crafting diamond tools, showcasing a step towards general AI agents capable of interacting with computer interfaces. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON OpenAI released a paper detailing a new training methodology for AI agents using video data, which is a research advancement.