English(EN) Learning optimal policies from event logs through reinforcement learning: a comparison of deep and MDP-based approaches

比较强化学习方法以优化业务流程

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-15 04:00

一篇新的研究论文探讨了使用强化学习为规范性流程监控学习最优策略。该研究将使用马尔可夫决策过程（MDP）的基于模型的方法与无模型深度强化学习（DRL）方法进行了比较。这两种技术都旨在直接从历史事件数据中学习干预策略，最大限度地减少对领域知识的需求，并处理涉及外部参与者的情况。虽然两种方法在改进关键绩效指标（KPI）方面都显示出相似的有效性，但发现基于MDP的方法在计算效率方面更高。 AI

影响这项研究可能通过直接从历史数据中学习干预策略，从而实现更具数据驱动性和自动化的业务流程优化。

排序理由该集群包含一篇详细介绍新研究方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Stefano Branchi, Andrei Buliga, Chiara Di Francescomarino, Chiara Ghidini, Riccardo Graziosi, Francesca Meneghello, Massimiliano Ronzani · 2026-06-15 04:00

Learning optimal policies from event logs through reinforcement learning: a comparison of deep and MDP-based approaches

arXiv:2303.09209v2 Announce Type: replace Abstract: Prescriptive Process Monitoring is an emerging area within Process Mining that focuses on recommending actions to optimize business outcomes. Most existing works prescribe pre-defined interventions, i.e., sets of actions applied…

报道来源 [1]

Learning optimal policies from event logs through reinforcement learning: a comparison of deep and MDP-based approaches

相关实体

相关话题