Robust Adversarial Reinforcement Learning in Stochastic Games via Sequence Modeling
arXiv:2510.11877v1 Announce Type: new Abstract: The Transformer, a highly expressive architecture for sequence modeling, has recently been adapted to solve sequential decision-making, most notably through the Decision Transformer (DT), which learns policies by conditioning on desired returns. Yet, the adversarial…
