PulseAugur
EN
LIVE 16:31:08

OpenAI releases RL-Teacher for AI training with human feedback

OpenAI has released RL-Teacher, an open-source tool designed to train AI models using human feedback instead of predefined reward functions. This approach, developed with AI safety in mind, involves a reward predictor that learns human preferences and can be integrated into various AI agents. The system includes a web application for humans to provide feedback, which is then used to train the predictor, and is implemented in under 1,000 lines of Python code. AI

RANK_REASON Open-source release of a tool for training AI models using human feedback.

Read on OpenAI News →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

OpenAI releases RL-Teacher for AI training with human feedback

COVERAGE [1]

  1. OpenAI News TIER_1 English(EN) ·

    Gathering human feedback

    RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with re…