Pulse

last 48h

[4/3254] 98 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

RESEARCH · OpenAI News English(EN) · 109mo · [2 sources] · BLOG

Learning from human preferences

OpenAI and DeepMind have developed a new algorithm that learns desired behaviors from human feedback, reducing the need for explicit goal functions. This method uses a three-step cycle where humans compare two agent behaviors, allowing the AI to infer the reward function and improve its performance. The approach has shown promising sample efficiency, requiring minimal human input to learn complex tasks like a backflip, and has achieved strong results in simulated robotics and Atari games, sometimes surpassing performance with standard reward functions. However, the system can be susceptible to agents that trick human evaluators, a problem being addressed with additional visual cues. AI
RESEARCH · OpenAI News English(EN) · 113mo · [32 sources] · BLOG

Transfer of adversarial robustness between perturbation types

OpenAI researchers are exploring the transferability of adversarial robustness across different types of perturbations in neural networks. Their findings indicate that robustness against one perturbation type does not always guarantee robustness against others and can sometimes be detrimental. They recommend evaluating adversarial defenses using a diverse range of perturbation types and sizes to ensure comprehensive security. Additionally, OpenAI is investigating adversarial examples as a concrete AI safety problem, noting their potential to cause significant issues, such as tricking autonomous vehicles. AI

IMPACT Highlights the ongoing challenges in securing AI systems against sophisticated adversarial attacks, necessitating robust evaluation and defense strategies.
RESEARCH · OpenAI News English(EN) · 122mo · [800 sources] · MASTOBLOGX

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI has published a series of research papers detailing advancements in reinforcement learning. These include achieving superhuman performance in Dota 2 with OpenAI Five, developing benchmarks for safe exploration in RL, and quantifying generalization capabilities with the CoinRun environment. The company also explored novel methods like prediction-based rewards for curiosity-driven exploration, learning policy representations in multiagent systems, and an experimental metalearning approach called Evolved Policy Gradients for faster training on new tasks. Further research addresses variance reduction in policy gradients and the equivalence between policy gradients and soft Q-learning, alongside challenging robotics environments for multi-goal RL. AI

IMPACT Demonstrates significant progress in RL capabilities, including superhuman performance, safety, generalization, and exploration, pushing the boundaries of AI.
TOOL · OpenAI News English(EN) · 127mo · [4478 sources] · HNLOBSTERSMASTOBLOGREDDITX

Introducing OpenAI

OpenAI has launched a preview of its Codex coding assistant within the ChatGPT mobile app, allowing users to manage coding tasks remotely across devices. The company is also highlighting how various organizations, including Ramp, NVIDIA, and AutoScout24, are leveraging Codex and GPT-5.5 for accelerated code review, faster development cycles, and AI-assisted research. Meanwhile, Anthropic's Project Glasswing initiative has identified over ten thousand high-severity vulnerabilities in essential software, emphasizing the need for the industry to adapt to AI-driven security analysis. AI

IMPACT Expands accessibility of AI coding assistants and highlights AI's role in identifying software vulnerabilities, potentially accelerating development and improving security.

Pulse

Learning from human preferences

Transfer of adversarial robustness between perturbation types

RL²: Fast reinforcement learning via slow reinforcement learning

Introducing OpenAI