Continuous-Time Value Iteration for Multi-Agent Reinforcement Learning
arXiv:2509.09135v3 Announce Type: replace Abstract: Existing reinforcement learning (RL) methods struggle with complex dynamical systems that demand interactions at high frequencies or irregular time intervals. Continuous-time RL (CTRL) has emerged as a promising alternative by replacing discrete-time Bellman recursion with…
