Deep Deterministic Policy Gradient
PulseAugur coverage of Deep Deterministic Policy Gradient — every cluster mentioning Deep Deterministic Policy Gradient across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New backdoor attack methods target speech classification models
Researchers have developed new methods for creating sophisticated backdoor attacks on speech classification models. One approach, DRL-CLBA, uses reinforcement learning to embed triggers that cause misclassification with…
-
New RL framework enhances multi-fuel engine combustion control
Researchers have developed a new reinforcement learning framework to improve combustion phasing control in multi-fuel compression-ignition engines. This system addresses the challenge of uncertain and time-varying fuel …
-
Model-free RL controllers enhance cyber-physical system resilience against attacks · arXiv paper
A new research paper published on arXiv explores the effectiveness of model-free reinforcement learning (RL) controllers in enhancing the resilience of cyber-physical systems against cyberattacks. The study analyzes fou…
-
Deep MARL pricing models show failure modes, partial fix proposed
Researchers have identified two failure modes in deep multi-agent reinforcement learning (MARL) applied to asynchronous pricing markets. These modes include tacit cartel formation among competing agents and actor-critic…
-
DRL algorithms struggle to outperform calibrated baselines in resource control benchmarks
A new benchmark study, RLScale-Bench, has been developed to evaluate deep reinforcement learning (DRL) algorithms for adaptive resource control. The research found that a properly calibrated rule-based autoscaler often …
-
New YANN-RL method speeds up AI control for chemical processes
Researchers have developed a new reinforcement learning (RL) approach called Y-wise Affine Neural Network (YANN-RL) designed for control in chemical process systems. This method aims to overcome the typical challenges o…
-
Deep learning model achieves 95% accuracy in criminal identification
Researchers have developed a new deep learning method using the Deep Deterministic Policy Gradient (DDPG) algorithm to identify culprits in criminal investigations. This approach trains the DDPG model on crime scene dat…
-
Reinforcement learning uses symmetry and data augmentation for faster aircraft control
Researchers have developed a new method for offline reinforcement learning that leverages the symmetry of dynamical systems to improve sample efficiency. This approach uses symmetric data augmentation to enhance the sta…