Partial Action Replacement: Tackling Distribution Shift in Offline MARL
arXiv:2511.07629v1 Announce Type: new Abstract: Offline multi-agent reinforcement learning (MARL) is severely hampered by the challenge of evaluating out-of-distribution (OOD) joint actions. Our core finding is that when the behavior policy is factorized – a common scenario where agents act…
