Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search
arXiv:2512.21648v1 Announce Type: new Abstract: Monte Carlo Tree Search (MCTS) has profoundly influenced reinforcement learning (RL) by integrating planning and learning in tasks requiring long-horizon reasoning, exemplified by the AlphaZero family of algorithms. Central to MCTS is the search strategy,…
