marl - Infinite Lexicon - Infinite Lexicon

marl

Multi-Agent Reinforcement Learning (MARL) is the study of how multiple agents learn to act within a shared environment, where each agent seeks to maximize its own cumulative reward. MARL covers a range of settings from fully cooperative to competitive or mixed interactions. Unlike single-agent reinforcement learning, the environment in MARL is non-stationary from the perspective of any individual learner because other agents are continually adapting their policies.

Formalization usually uses the framework of Markov games (also called stochastic games). A Markov game generalizes

Approaches in MARL frequently involve centralized training with decentralized execution (CTDE), especially in deep learning contexts.

Challenges in MARL include non-stationarity due to simultaneous learning, credit assignment across agents, partial observability, heterogeneity

i

a

=

...,

|

i

a

a

policy-gradient

non-stationarity.