Archives AI News

Machine learning for option pricing: an empirical investigation of network architectures

arXiv:2307.07657v2 Announce Type: replace-cross Abstract: We consider the supervised learning problem of learning the price of an option or the implied volatility given appropriate input data (model parameters) and corresponding output data (option prices or implied volatilities). The majority of…

January 1, 2026

Tazza: Shuffling Neural Network Parameters for Secure and Private Federated Learning

arXiv:2412.07454v3 Announce Type: replace Abstract: Federated learning enables decentralized model training without sharing raw data, preserving data privacy. However, its vulnerability towards critical security threats, such as gradient inversion and model poisoning by malicious clients, remain unresolved. Existing solutions often…

January 1, 2026

Learning Network Dismantling Without Handcrafted Inputs

arXiv:2508.00706v2 Announce Type: replace Abstract: The application of message-passing Graph Neural Networks has been a breakthrough for important network science problems. However, the competitive performance often relies on using handcrafted structural features as inputs, which increases computational cost and introduces…

January 1, 2026

Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training

arXiv:2512.24794v1 Announce Type: cross Abstract: The Noise2Noise method allows for training machine learning-based denoisers with pairs of input and target images where both the input and target can be noisy. This removes the need for training with clean target images,…

January 1, 2026

Optimal Approximation — Smoothness Tradeoffs for Soft-Max Functions

arXiv:2010.11450v2 Announce Type: replace Abstract: A soft-max function has two main efficiency measures: (1) approximation – which corresponds to how well it approximates the maximum function, (2) smoothness – which shows how sensitive it is to changes of its input.…

January 1, 2026

Learning Coupled System Dynamics under Incomplete Physical Constraints and Missing Data

arXiv:2512.23761v1 Announce Type: new Abstract: Advances in data acquisition and computational methods have accelerated the use of differential equation based modelling for complex systems. Such systems are often described by coupled (or more) variables, yet governing equation is typically available…

January 1, 2026

Training Language Models to Explain Their Own Computations

arXiv:2511.08579v2 Announce Type: replace-cross Abstract: Can language models (LMs) learn to faithfully describe their internal computations? Are they better able to describe themselves than other models? We study the extent to which LMs’ privileged access to their own internals can…

January 1, 2026

Generalized Regularized Evidential Deep Learning Models: Theory and Comprehensive Evaluation

arXiv:2512.23753v1 Announce Type: new Abstract: Evidential deep learning (EDL) models, based on Subjective Logic, introduce a principled and computationally efficient way to make deterministic neural networks uncertainty-aware. The resulting evidential models can quantify fine-grained uncertainty using learned evidence. However, the…

January 1, 2026

HINTS: Extraction of Human Insights from Time-Series Without External Sources

arXiv:2512.23755v1 Announce Type: new Abstract: Human decision-making, emotions, and collective psychology are complex factors that shape the temporal dynamics observed in financial and economic systems. Many recent time series forecasting models leverage external sources (e.g., news and social media) to…

January 1, 2026

Geometric Scaling of Bayesian Inference in LLMs

arXiv:2512.23752v1 Announce Type: new Abstract: Recent work has shown that small transformers trained in controlled “wind-tunnel” settings can implement exact Bayesian inference, and that their training dynamics produce a geometric substrate — low-dimensional value manifolds and progressively orthogonal keys —…

January 1, 2026