PulseAugur
LIVE 00:10:30
ENTITY Proximal Policy Optimization

Proximal Policy Optimization

PulseAugur coverage of Proximal Policy Optimization — every cluster mentioning Proximal Policy Optimization across labs, papers, and developer communities, ranked by signal.

Total · 30d
10
10 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
10
10 over 90d
TIER MIX · 90D
RELATIONSHIPS
RECENT · PAGE 1/1 · 10 TOTAL
  1. TOOL · CL_16702 ·

    Author demystifies reinforcement learning math with new blog series

    A new blog series aims to demystify the mathematics behind reinforcement learning, starting with foundational concepts and progressing towards advanced algorithms like Proximal Policy Optimization (PPO). The initial pos…

  2. RESEARCH · CL_16149 ·

    AI agents leverage reinforcement learning to enhance software test case generation and code coverage

    Researchers have developed two novel approaches for automated test case generation using large language models (LLMs) and reinforcement learning. The first method, PPO-LLM, employs Proximal Policy Optimization (PPO) to …

  3. TOOL · CL_16153 ·

    Multi-agent RL ensures drone fleet separation but may favor stronger configurations

    Researchers have developed a multi-agent reinforcement learning framework to ensure safe separation between fleets of small unmanned aerial systems (sUASs). The proposed attention-enhanced Proximal Policy Optimization-b…

  4. TOOL · CL_16220 ·

    Deep Reinforcement Learning Optimizes Data Center Energy Use

    This paper introduces a new Deep Reinforcement Learning (DRL) framework to manage energy consumption in data centers. The system dynamically coordinates solar, wind, battery storage, and grid power to reduce costs and c…

  5. TOOL · CL_16276 ·

    DeepStage uses AI to learn autonomous defense against multi-stage cyberattacks

    Researchers have developed DeepStage, a new framework utilizing deep reinforcement learning to create autonomous defense policies against multi-stage cyberattacks. The system models enterprise environments as partially …

  6. RESEARCH · CL_11403 ·

    New Kernelized Advantage Estimation improves LLM reasoning with nonparametric statistics

    Researchers have introduced Kernelized Advantage Estimation (KAE) to enhance the reasoning capabilities of large language models (LLMs) through reinforcement learning. KAE addresses limitations in existing methods like …

  7. RESEARCH · CL_10167 ·

    Robots navigate using AI-powered depth estimation, ditching LiDAR

    Researchers have developed a novel teacher-student framework for robot navigation that replaces traditional LiDAR sensors with vision-based monocular depth estimation. A teacher policy, trained with privileged LiDAR dat…

  8. RESEARCH · CL_06928 ·

    AI framework optimizes land use for ecosystem services in Lake Malawi Basin

    Researchers have developed a deep reinforcement learning framework to optimize land-use allocation in the Lake Malawi Basin, aiming to enhance ecosystem service value. The system uses a Proximal Policy Optimization agen…

  9. RESEARCH · CL_01553 ·

    OpenAI releases Proximal Policy Optimization for simpler, effective reinforcement learning

    OpenAI has released Proximal Policy Optimization (PPO), a new reinforcement learning algorithm that offers comparable or superior performance to existing methods while being simpler to implement and tune. PPO strikes a …

  10. RESEARCH · CL_00324 ·

    OpenAI advances reinforcement learning with new benchmarks and methods

    OpenAI has published a series of research papers detailing advancements in reinforcement learning (RL). These include achieving superhuman performance in the game Dota 2 using large-scale deep RL, developing benchmarks …