Archives AI News

Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding

arXiv:2512.23858v1 Announce Type: new Abstract: Speculative decoding improves LLM inference by generating and verifying multiple tokens in parallel, but existing systems suffer from suboptimal performance due to a mismatch between dynamic speculation and static runtime assumptions. We present Yggdrasil, a…

January 1, 2026

Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching

arXiv:2411.00759v3 Announce Type: replace Abstract: Discrete flow matching, a recent framework for modeling categorical data, has shown competitive performance with autoregressive models. However, unlike continuous flow matching, the rectification strategy cannot be applied due to the stochasticity of discrete paths,…

January 1, 2026

Probing the Limits of Compressive Memory: A Study of Infini-Attention in Small-Scale Pretraining

arXiv:2512.23862v1 Announce Type: new Abstract: This study investigates small-scale pretraining for Small Language Models (SLMs) to enable efficient use of limited data and compute, improve accessibility in low-resource settings and reduce costs. To enhance long-context extrapolation in compact models, we…

January 1, 2026

Local-Cloud Inference Offloading for LLMs in Multi-Modal, Multi-Task, Multi-Dialogue Settings

arXiv:2502.11007v4 Announce Type: replace Abstract: Compared to traditional machine learning models, recent large language models (LLMs) can exhibit multi-task-solving capabilities through multiple dialogues and multi-modal data sources. These unique characteristics of LLMs, together with their large model size, make their…

January 1, 2026

Max-Entropy Reinforcement Learning with Flow Matching and A Case Study on LQR

arXiv:2512.23870v1 Announce Type: new Abstract: Soft actor-critic (SAC) is a popular algorithm for max-entropy reinforcement learning. In practice, the energy-based policies in SAC are often approximated using simple policy classes for efficiency, sacrificing the expressiveness and robustness. In this paper,…

January 1, 2026

Deep sequence models tend to memorize geometrically; it is unclear why

arXiv:2510.26745v2 Announce Type: replace Abstract: Deep sequence models are said to store atomic facts predominantly in the form of associative memory: a brute-force lookup of co-occurring entities. We identify a dramatically different form of storage of atomic facts that we…

January 1, 2026

Machine learning for option pricing: an empirical investigation of network architectures

arXiv:2307.07657v2 Announce Type: replace-cross Abstract: We consider the supervised learning problem of learning the price of an option or the implied volatility given appropriate input data (model parameters) and corresponding output data (option prices or implied volatilities). The majority of…

January 1, 2026

Tazza: Shuffling Neural Network Parameters for Secure and Private Federated Learning

arXiv:2412.07454v3 Announce Type: replace Abstract: Federated learning enables decentralized model training without sharing raw data, preserving data privacy. However, its vulnerability towards critical security threats, such as gradient inversion and model poisoning by malicious clients, remain unresolved. Existing solutions often…

January 1, 2026

Learning Network Dismantling Without Handcrafted Inputs

arXiv:2508.00706v2 Announce Type: replace Abstract: The application of message-passing Graph Neural Networks has been a breakthrough for important network science problems. However, the competitive performance often relies on using handcrafted structural features as inputs, which increases computational cost and introduces…

January 1, 2026

Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training

arXiv:2512.24794v1 Announce Type: cross Abstract: The Noise2Noise method allows for training machine learning-based denoisers with pairs of input and target images where both the input and target can be noisy. This removes the need for training with clean target images,…

January 1, 2026