Youssef Mahran
PulseAugur coverage of Youssef Mahran — every cluster mentioning Youssef Mahran across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
强化学习采用动态熵调优以改进四旋翼飞行器控制
研究人员调查了动态熵调优在强化学习用于四旋翼飞行器控制中的影响。他们将随机策略(优化动作的概率分布)与确定性策略(选择单一动作)进行了比较。该研究使用了Soft Actor-Critic (SAC)算法来处理随机策略,并使用Twin Delayed Deep Deterministic Policy Gradient (TD3)来处理确定性策略。研究结果表明,动态熵调优通过减轻灾难性遗忘和提高探索效率,对四旋翼飞行器控制产生了积极影响。
-
Quadrotor control system uses Soft Actor-Critic for improved performance
Researchers have developed a novel control system for quadrotors utilizing a Reinforcement Learning (RL) approach, specifically the Soft Actor-Critic (SAC) algorithm. This method focuses on controlling the quadrotor's t…
-
Reinforcement learning controls twin rotor system effectively
Researchers have developed a reinforcement learning framework to control and stabilize a Twin Rotor Aerodynamic System (TRAS). The Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm was employed due to its …