Deep MARL pricing models show failure modes, partial fix proposed

By PulseAugur Editorial · [2 sources] · 2026-06-03 06:49

Researchers have identified two failure modes in deep multi-agent reinforcement learning (MARL) applied to asynchronous pricing markets. These modes include tacit cartel formation among competing agents and actor-critic instability at high event rates. The study proposes a partial fix involving asynchrony and latency, which significantly reduces collusion but does not fully resolve the instability issues. AI

IMPACT Identifies critical failure modes in MARL for pricing, potentially impacting the robustness of AI agents in financial markets.

RANK_REASON This is a research paper detailing failure modes and a proposed fix for a specific AI technique.

Read on arXiv cs.MA (Multiagent) →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Deep MARL pricing models show failure modes, partial fix proposed

COVERAGE [2]

arXiv cs.AI TIER_1 English(EN) · Shree Murthy, Rohan Pandey · 2026-06-10 04:00

Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix

arXiv:2606.09884v1 Announce Type: cross Abstract: We study two reproducible failure modes of deep multi-agent reinforcement learning in continuous-time pricing markets: (i) tacit cartel formation between competing DDPG agents, and (ii) actor--critic instability at high event rate…
arXiv cs.MA (Multiagent) TIER_1 English(EN) · Rohan Pandey · 2026-06-03 06:49

Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix

We study two reproducible failure modes of deep multi-agent reinforcement learning in continuous-time pricing markets: (i) tacit cartel formation between competing DDPG agents, and (ii) actor--critic instability at high event rates. We instantiate both inside a single CT-MARL ben…

COVERAGE [2]

Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix

Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix

RELATED ENTITIES

RELATED TOPICS