Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 8h

Offline Diffusion Policy for Multi-User Delay-Constrained Scheduling

Researchers have developed a new offline reinforcement learning algorithm called SOCD for delay-constrained scheduling in multi-user systems. This method utilizes a diffusion policy and a critic network to learn scheduling strategies solely from pre-collected data, avoiding the need for real-time system interaction. Experiments show SOCD effectively handles various system dynamics and outperforms existing scheduling approaches. AI

IMPACT This new algorithm could improve resource allocation in AI systems requiring real-time decision-making under delay constraints.
- Zhuoran Li
TOOL · arXiv cs.AI English(EN) · 8h

Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies

Researchers have introduced OMAD, a novel framework for online multi-agent reinforcement learning (MARL) that utilizes diffusion policies to enhance agent coordination. This approach addresses the challenge of intractable likelihoods in diffusion models, which typically hinder exploration in online MARL settings. OMAD employs a relaxed policy objective that maximizes scaled joint entropy and a joint distributional value function for decentralized policy optimization, leading to significant improvements in sample efficiency. AI

IMPACT Introduces a novel approach to multi-agent reinforcement learning, potentially improving coordination and sample efficiency in complex AI systems.
- Zhuoran Li

Brief

Offline Diffusion Policy for Multi-User Delay-Constrained Scheduling

Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies