marl
Multi-Agent Reinforcement Learning (MARL) is the study of how multiple agents learn to act within a shared environment, where each agent seeks to maximize its own cumulative reward. MARL covers a range of settings from fully cooperative to competitive or mixed interactions. Unlike single-agent reinforcement learning, the environment in MARL is non-stationary from the perspective of any individual learner because other agents are continually adapting their policies.
Formalization usually uses the framework of Markov games (also called stochastic games). A Markov game generalizes
Approaches in MARL frequently involve centralized training with decentralized execution (CTDE), especially in deep learning contexts.
Challenges in MARL include non-stationarity due to simultaneous learning, credit assignment across agents, partial observability, heterogeneity