Archives AI News

YRC-Bench: A Benchmark for Learning to Coordinate with Experts

arXiv:2502.09583v3 Announce Type: replace Abstract: When deployed in the real world, AI agents will inevitably face challenges that exceed their individual capabilities. A critical component of AI safety is an agent’s ability to recognize when it is likely to fail…

January 14, 2026

DataScribe: An AI-Native, Policy-Aligned Web Platform for Multi-Objective Materials Design and Discovery

arXiv:2601.07966v1 Announce Type: new Abstract: The acceleration of materials discovery requires digital platforms that go beyond data repositories to embed learning, optimization, and decision-making directly into research workflows. We introduce DataScribe, an AI-native, cloud-based materials discovery platform that unifies heterogeneous…

January 14, 2026

Spike-timing-dependent Hebbian learning as noisy gradient descent

arXiv:2505.10272v3 Announce Type: replace Abstract: Hebbian learning is a key principle underlying learning in biological neural networks. We relate a Hebbian spike-timing-dependent plasticity rule to noisy gradient descent with respect to a non-convex loss function on the probability simplex. Despite…

January 14, 2026

Beyond the Next Port: A Multi-Task Transformer for Forecasting Future Voyage Segment Durations

arXiv:2601.08013v1 Announce Type: new Abstract: Accurate forecasts of segment-level sailing durations are fundamental to enhancing maritime schedule reliability and optimizing long-term port operations. However, conventional estimated time of arrival (ETA) models are primarily designed for the immediate next port of…

January 14, 2026

Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values

arXiv:2506.11849v2 Announce Type: replace Abstract: With origins in game theory, probabilistic values like Shapley values, Banzhaf values, and semi-values have emerged as a central tool in explainable AI. They are used for feature attribution, data attribution, data valuation, and more.…

January 14, 2026

InfGraND: An Influence-Guided GNN-to-MLP Knowledge Distillation

arXiv:2601.08033v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) are the go-to model for graph data analysis. However, GNNs rely on two key operations – aggregation and update, which can pose challenges for low-latency inference tasks or resource-constrained scenarios. Simple…

January 14, 2026

On the Sample Complexity of Differentially Private Policy Optimization

arXiv:2510.21060v2 Announce Type: replace Abstract: Policy optimization (PO) is a cornerstone of modern reinforcement learning (RL), with diverse applications spanning robotics, healthcare, and large language model training. The increasing deployment of PO in sensitive domains, however, raises significant privacy concerns.…

January 14, 2026

Riemannian Zeroth-Order Gradient Estimation with Structure-Preserving Metrics for Geodesically Incomplete Manifolds

arXiv:2601.08039v1 Announce Type: new Abstract: In this paper, we study Riemannian zeroth-order optimization in settings where the underlying Riemannian metric $g$ is geodesically incomplete, and the goal is to approximate stationary points with respect to this incomplete metric. To address…

January 14, 2026

Learning Steerable Clarification Policies with Collaborative Self-play

arXiv:2512.04068v2 Announce Type: replace Abstract: To handle underspecified or ambiguous queries, AI assistants need a policy for managing their uncertainty to determine (a) when to guess the user intent and answer directly, (b) when to enumerate and answer multiple possible…

January 14, 2026

LUT-Compiled Kolmogorov-Arnold Networks for Lightweight DoS Detection on IoT Edge Devices

arXiv:2601.08044v1 Announce Type: new Abstract: Denial-of-Service (DoS) attacks pose a critical threat to Internet of Things (IoT) ecosystems, yet deploying effective intrusion detection on resource-constrained edge devices remains challenging. Kolmogorov-Arnold Networks (KANs) offer a compact alternative to Multi-Layer Perceptrons (MLPs)…

January 14, 2026