Archives AI News

YRC-Bench: A Benchmark for Learning to Coordinate with Experts

arXiv:2502.09583v3 Announce Type: replace Abstract: When deployed in the real world, AI agents will inevitably face challenges that exceed their individual capabilities. A critical component of AI safety is an agent’s ability to recognize when it is likely to fail…

Spike-timing-dependent Hebbian learning as noisy gradient descent

arXiv:2505.10272v3 Announce Type: replace Abstract: Hebbian learning is a key principle underlying learning in biological neural networks. We relate a Hebbian spike-timing-dependent plasticity rule to noisy gradient descent with respect to a non-convex loss function on the probability simplex. Despite…

InfGraND: An Influence-Guided GNN-to-MLP Knowledge Distillation

arXiv:2601.08033v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) are the go-to model for graph data analysis. However, GNNs rely on two key operations – aggregation and update, which can pose challenges for low-latency inference tasks or resource-constrained scenarios. Simple…

On the Sample Complexity of Differentially Private Policy Optimization

arXiv:2510.21060v2 Announce Type: replace Abstract: Policy optimization (PO) is a cornerstone of modern reinforcement learning (RL), with diverse applications spanning robotics, healthcare, and large language model training. The increasing deployment of PO in sensitive domains, however, raises significant privacy concerns.…

Learning Steerable Clarification Policies with Collaborative Self-play

arXiv:2512.04068v2 Announce Type: replace Abstract: To handle underspecified or ambiguous queries, AI assistants need a policy for managing their uncertainty to determine (a) when to guess the user intent and answer directly, (b) when to enumerate and answer multiple possible…

LUT-Compiled Kolmogorov-Arnold Networks for Lightweight DoS Detection on IoT Edge Devices

arXiv:2601.08044v1 Announce Type: new Abstract: Denial-of-Service (DoS) attacks pose a critical threat to Internet of Things (IoT) ecosystems, yet deploying effective intrusion detection on resource-constrained edge devices remains challenging. Kolmogorov-Arnold Networks (KANs) offer a compact alternative to Multi-Layer Perceptrons (MLPs)…