Archives AI News

Two-Player Zero-Sum Games with Bandit Feedback

arXiv:2506.14518v2 Announce Type: replace Abstract: We study a two-player zero-sum game in which the row player aims to maximize their payoff against an adversarial column player, under an unknown payoff matrix estimated through bandit feedback. We propose three algorithms based…

Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space

arXiv:2401.13530v4 Announce Type: replace Abstract: Recently, optimization on the Riemannian manifold have provided valuable insights to the optimization community. In this regard, extending these methods to to the Wasserstein space is of particular interest, since optimization on Wasserstein space is…