Archives AI News

A Simple Method for PMF Estimation on Large Supports

arXiv:2510.15132v1 Announce Type: new Abstract: We study nonparametric estimation of a probability mass function (PMF) on a large discrete support, where the PMF is multi-modal and heavy-tailed. The core idea is to treat the empirical PMF as a signal on…

October 20, 2025

Learning to Interpret Weight Differences in Language Models

arXiv:2510.05092v2 Announce Type: replace Abstract: Finetuning (pretrained) language models is a standard approach for updating their internal parametric knowledge and specializing them to new tasks and domains. However, the corresponding model weight changes (“weight diffs”) are not generally interpretable. While…

October 20, 2025

Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Counts in the Global Terrorism Database (GTD)

arXiv:2510.15136v1 Announce Type: new Abstract: We study short-horizon forecasting of weekly terrorism incident counts using the Global Terrorism Database (GTD, 1970–2016). We build a reproducible pipeline with fixed time-based splits and evaluate a Bidirectional LSTM (BiLSTM) against strong classical anchors…

October 20, 2025

Improving Intrusion Detection with Domain-Invariant Representation Learning in Latent Space

arXiv:2312.17300v5 Announce Type: replace-cross Abstract: Zero-day anomaly detection is critical in industrial applications where novel, unforeseen threats can compromise system integrity and safety. Traditional detection systems often fail to identify these unseen anomalies due to their reliance on in-distribution data.…

October 20, 2025

Policy Transfer Ensures Fast Learning for Continuous-Time LQR with Entropy Regularization

arXiv:2510.15165v1 Announce Type: new Abstract: Reinforcement Learning (RL) enables agents to learn optimal decision-making strategies through interaction with an environment, yet training from scratch on complex tasks can be highly inefficient. Transfer learning (TL), widely successful in large language models…

October 20, 2025

Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

arXiv:2505.11325v2 Announce Type: replace-cross Abstract: Prior-data fitted networks (PFNs) have emerged as promising foundation models for prediction from tabular data sets, achieving state-of-the-art performance on small to moderate data sizes without tuning. While PFNs are motivated by Bayesian ideas, they…

October 20, 2025

A simple mean field model of feature learning

arXiv:2510.15174v1 Announce Type: new Abstract: Feature learning (FL), where neural networks adapt their internal representations during training, remains poorly understood. Using methods from statistical physics, we derive a tractable, self-consistent mean-field (MF) theory for the Bayesian posterior of two-layer non-linear…

October 20, 2025

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

arXiv:2509.16606v2 Announce Type: replace-cross Abstract: In networked multi-agent reinforcement learning (Networked-MARL), decentralized agents must act under local observability and constrained communication over fixed physical graphs. Existing methods often assume static neighborhoods, limiting adaptability to dynamic or heterogeneous environments. While centralized…

October 20, 2025

Finding geodesics with the Deep Ritz method

arXiv:2510.15177v1 Announce Type: new Abstract: Geodesic problems involve computing trajectories between prescribed initial and final states to minimize a user-defined measure of distance, cost, or energy. They arise throughout physics and engineering — for instance, in determining optimal paths through…

October 20, 2025

Stochastic Optimization with Random Search

arXiv:2510.15610v1 Announce Type: cross Abstract: We revisit random search for stochastic optimization, where only noisy function evaluations are available. We show that the method works under weaker smoothness assumptions than previously considered, and that stronger assumptions enable improved guarantees. In…

October 20, 2025