Archives AI News

Reinforcement Learning Methods for Neighborhood Selection in Local Search

arXiv:2601.07948v1 Announce Type: new Abstract: Reinforcement learning has recently gained traction as a means to improve combinatorial optimization methods, yet its effectiveness within local search metaheuristics specifically remains comparatively underexamined. In this study, we evaluate a range of reinforcement learning-based…

YRC-Bench: A Benchmark for Learning to Coordinate with Experts

arXiv:2502.09583v3 Announce Type: replace Abstract: When deployed in the real world, AI agents will inevitably face challenges that exceed their individual capabilities. A critical component of AI safety is an agent’s ability to recognize when it is likely to fail…

Spike-timing-dependent Hebbian learning as noisy gradient descent

arXiv:2505.10272v3 Announce Type: replace Abstract: Hebbian learning is a key principle underlying learning in biological neural networks. We relate a Hebbian spike-timing-dependent plasticity rule to noisy gradient descent with respect to a non-convex loss function on the probability simplex. Despite…

InfGraND: An Influence-Guided GNN-to-MLP Knowledge Distillation

arXiv:2601.08033v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) are the go-to model for graph data analysis. However, GNNs rely on two key operations – aggregation and update, which can pose challenges for low-latency inference tasks or resource-constrained scenarios. Simple…

On the Sample Complexity of Differentially Private Policy Optimization

arXiv:2510.21060v2 Announce Type: replace Abstract: Policy optimization (PO) is a cornerstone of modern reinforcement learning (RL), with diverse applications spanning robotics, healthcare, and large language model training. The increasing deployment of PO in sensitive domains, however, raises significant privacy concerns.…