ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty
arXiv:2510.11899v1 Announce Type: new Abstract: Robust reinforcement learning (Robust RL) seeks to handle epistemic uncertainty in environment dynamics, but existing approaches often rely on nested min–max optimization, which is computationally expensive and yields overly conservative policies. We propose textbf{Adaptive Rank…
