Crypto Ticker:
technology from Arxiv cs.ai

Improving Generalization and Data Efficiency with Diffusion in Offline Multi-agent RL

Zhuoran Li, Ling Pan, Jiatai Huang, Longbo Huang
Thursday at 04:00
1 Views
0 Comments

arXiv:2307.01472v2 Announce Type: replace Abstract: We present a novel Diffusion Offline Multi-agent Model (DOM2) for offline Multi-Agent Reinforcement Learning (MARL). Different from existing algorithms that rely mainly on conservatism in policy design, DOM2 enhances policy expressiveness and diversity based on diffusion model. Specifically, we...

Read the full article at the source.

Was this helpful?
Share:

Comments (0)

Please login to post a comment

No comments yet. Be the first to comment!