Archives AI News

Geometric Scaling of Bayesian Inference in LLMs

arXiv:2512.23752v1 Announce Type: new Abstract: Recent work has shown that small transformers trained in controlled “wind-tunnel” settings can implement exact Bayesian inference, and that their training dynamics produce a geometric substrate — low-dimensional value manifolds and progressively orthogonal keys —…

January 1, 2026

Coordinate Matrix Machine: A Human-level Concept Learning to Classify Very Similar Documents

arXiv:2512.23749v1 Announce Type: new Abstract: Human-level concept learning argues that humans typically learn new concepts from a single example, whereas machine learning algorithms typically require hundreds of samples to learn a single concept. Our brain subconsciously identifies important features and…

January 1, 2026

A Review of Diffusion-based Simulation-Based Inference: Foundations and Applications in Non-Ideal Data Scenarios

arXiv:2512.23748v1 Announce Type: new Abstract: For complex simulation problems, inferring parameters of scientific interest often precludes the use of classical likelihood-based techniques due to intractable likelihood functions. Simulation-based inference (SBI) methods forego the need for explicit likelihoods by directly utilizing…

January 1, 2026

NeuroPMD: Neural Fields for Density Estimation on Product Manifolds

arXiv:2501.02994v2 Announce Type: replace-cross Abstract: We propose a novel deep neural network methodology for density estimation on product Riemannian manifold domains. In our approach, the network directly parameterizes the unknown density function and is trained using a penalized maximum likelihood…

January 1, 2026

Drift-Based Dataset Stability Benchmark

arXiv:2512.23762v1 Announce Type: new Abstract: Machine learning (ML) represents an efficient and popular approach for network traffic classification. However, network traffic classification is a challenging domain, and trained models may degrade soon after deployment due to the obsolete datasets and…

January 1, 2026

PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning

arXiv:2507.06415v2 Announce Type: replace-cross Abstract: Long-context reasoning requires accurately identifying relevant information in extensive, noisy input contexts. Previous research shows that using test-time learning to encode context directly into model parameters can effectively enable reasoning over noisy information. However, meta-learning…

January 1, 2026

Neural Optimal Design of Experiment for Inverse Problems

arXiv:2512.23763v1 Announce Type: new Abstract: We introduce Neural Optimal Design of Experiments, a learning-based framework for optimal experimental design in inverse problems that avoids classical bilevel optimization and indirect sparsity regularization. NODE jointly trains a neural reconstruction model and a…

January 1, 2026

A Comprehensive Study of Deep Learning Model Fixing Approaches

arXiv:2512.23745v1 Announce Type: new Abstract: Deep Learning (DL) has been widely adopted in diverse industrial domains, including autonomous driving, intelligent healthcare, and aided programming. Like traditional software, DL systems are also prone to faults, whose malfunctioning may expose users to…

January 1, 2026

Exploring Cumulative Effects in Survival Data Using Deep Learning Networks

arXiv:2512.23764v1 Announce Type: new Abstract: In epidemiological research, modeling the cumulative effects of time-dependent exposures on survival outcomes presents a challenge due to their intricate temporal dynamics. Conventional spline-based statistical methods, though effective, require repeated data transformation for each spline…

January 1, 2026

RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment

arXiv:2512.24943v1 Announce Type: cross Abstract: Search relevance plays a central role in web e-commerce. While large language models (LLMs) have shown significant results on relevance task, existing benchmarks lack sufficient complexity for comprehensive model assessment, resulting in an absence of…

January 1, 2026