Multi-Agent Reinforcement Learning Developments

The field of multi-agent reinforcement learning (MARL) is moving towards more complex and realistic environments, with a focus on cooperation, communication, and credit assignment. Researchers are developing new benchmarks and frameworks to test the limits of current methods and drive innovation. Notable advancements include the development of more efficient and expressive algorithms for multi-agent coordination, as well as the application of MARL to real-world problems such as understanding animal behavior.

Some papers are particularly noteworthy, including: Multi-Agent Craftax, which introduces a new benchmark for open-ended MARL and demonstrates its potential to drive long-term research in the field. MAC-Flow, which presents a simple yet expressive framework for multi-agent coordination that achieves fast inference and good performance across various benchmarks. A Historical Interaction-Enhanced Shapley Policy Gradient Algorithm, which proposes a hybrid credit assignment mechanism that enhances the agent's ability to perceive its own contribution and retains training stability. TIGER-MARL, which captures temporal information through graph-based embeddings and representations to enhance MARL and consistently outperforms diverse baselines in task performance and sample efficiency. Learning Efficient Communication Protocols, which introduces a generalized framework for learning multi-round communication protocols that are both effective and efficient, and demonstrates significant enhancements in communication efficiency and cooperation performance.

Sources

Multi-Agent Craftax: Benchmarking Open-Ended Multi-Agent Reinforcement Learning at the Hyperscale

Multi-agent Coordination via Flow Matching

Cooperation Under Network-Constrained Communication

A Historical Interaction-Enhanced Shapley Policy Gradient Algorithm for Multi-Agent Credit Assignment

ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games

Understanding Electro-communication and Electro-sensing in Weakly Electric Fish using Multi-Agent Deep Reinforcement Learning

TIGER-MARL: Enhancing Multi-Agent Reinforcement Learning with Temporal Information through Graph-based Embeddings and Representations

Learning Efficient Communication Protocols for Multi-Agent Reinforcement Learning

Built with on top of