Archives AI News

Safety-Biased Policy Optimisation: Towards Hard-Constrained Reinforcement Learning via Trust Regions

arXiv:2512.23770v1 Announce Type: new Abstract: Reinforcement learning (RL) in safety-critical domains requires agents to maximise rewards while strictly adhering to safety constraints. Existing approaches, such as Lagrangian and projection-based methods, often either fail to ensure near-zero safety violations or sacrifice…

January 1, 2026

Deep sequence models tend to memorize geometrically; it is unclear why

arXiv:2510.26745v2 Announce Type: replace Abstract: Deep sequence models are said to store atomic facts predominantly in the form of associative memory: a brute-force lookup of co-occurring entities. We identify a dramatically different form of storage of atomic facts that we…

January 1, 2026

Machine learning for option pricing: an empirical investigation of network architectures

arXiv:2307.07657v2 Announce Type: replace-cross Abstract: We consider the supervised learning problem of learning the price of an option or the implied volatility given appropriate input data (model parameters) and corresponding output data (option prices or implied volatilities). The majority of…

January 1, 2026

Tazza: Shuffling Neural Network Parameters for Secure and Private Federated Learning

arXiv:2412.07454v3 Announce Type: replace Abstract: Federated learning enables decentralized model training without sharing raw data, preserving data privacy. However, its vulnerability towards critical security threats, such as gradient inversion and model poisoning by malicious clients, remain unresolved. Existing solutions often…

January 1, 2026

Learning Network Dismantling Without Handcrafted Inputs

arXiv:2508.00706v2 Announce Type: replace Abstract: The application of message-passing Graph Neural Networks has been a breakthrough for important network science problems. However, the competitive performance often relies on using handcrafted structural features as inputs, which increases computational cost and introduces…

January 1, 2026

Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training

arXiv:2512.24794v1 Announce Type: cross Abstract: The Noise2Noise method allows for training machine learning-based denoisers with pairs of input and target images where both the input and target can be noisy. This removes the need for training with clean target images,…

January 1, 2026

Optimal Approximation — Smoothness Tradeoffs for Soft-Max Functions

arXiv:2010.11450v2 Announce Type: replace Abstract: A soft-max function has two main efficiency measures: (1) approximation – which corresponds to how well it approximates the maximum function, (2) smoothness – which shows how sensitive it is to changes of its input.…

January 1, 2026

Learning Coupled System Dynamics under Incomplete Physical Constraints and Missing Data

arXiv:2512.23761v1 Announce Type: new Abstract: Advances in data acquisition and computational methods have accelerated the use of differential equation based modelling for complex systems. Such systems are often described by coupled (or more) variables, yet governing equation is typically available…

January 1, 2026

Training Language Models to Explain Their Own Computations

arXiv:2511.08579v2 Announce Type: replace-cross Abstract: Can language models (LMs) learn to faithfully describe their internal computations? Are they better able to describe themselves than other models? We study the extent to which LMs’ privileged access to their own internals can…

January 1, 2026

Generalized Regularized Evidential Deep Learning Models: Theory and Comprehensive Evaluation

arXiv:2512.23753v1 Announce Type: new Abstract: Evidential deep learning (EDL) models, based on Subjective Logic, introduce a principled and computationally efficient way to make deterministic neural networks uncertainty-aware. The resulting evidential models can quantify fine-grained uncertainty using learned evidence. However, the…

January 1, 2026