Archives AI News

Information Shapes Koopman Representation

arXiv:2510.13025v1 Announce Type: new Abstract: The Koopman operator provides a powerful framework for modeling dynamical systems and has attracted growing interest from the machine learning community. However, its infinite-dimensional nature makes identifying suitable finite-dimensional subspaces challenging, especially for deep architectures.…

Conditional Distribution Compression via the Kernel Conditional Mean Embedding

arXiv:2504.10139v3 Announce Type: replace-cross Abstract: Existing distribution compression methods, like Kernel Herding (KH), were originally developed for unlabelled data. However, no existing approach directly compresses the conditional distribution of labelled data. To address this gap, we first introduce the Average…

Bayesian Double Descent

arXiv:2507.07338v3 Announce Type: replace-cross Abstract: Double descent is a phenomenon of over-parameterized statistical models such as deep neural networks which have a re-descending property in their risk function. As the complexity of the model increases, risk exhibits a U-shaped region…

Randomness and Interpolation Improve Gradient Descent

arXiv:2510.13040v1 Announce Type: new Abstract: Based on Stochastic Gradient Descent (SGD), the paper introduces two optimizers, named Interpolational Accelerating Gradient Descent (IAGD) as well as Noise-Regularized Stochastic Gradient Descent (NRSGD). IAGD leverages second-order Newton Interpolation to expedite the convergence process…

On Pretraining for Project-Level Code Completion

arXiv:2510.13697v1 Announce Type: cross Abstract: Repository-level pretraining is commonly used to enable large language models for code to leverage codebase-wide context. This enhances their ability to generate accurate and context-aware code completions. In this work, we investigate how different repository-processing…

Time-Varying Optimization for Streaming Data Via Temporal Weighting

arXiv:2510.13052v1 Announce Type: new Abstract: Classical optimization theory deals with fixed, time-invariant objective functions. However, time-varying optimization has emerged as an important subject for decision-making in dynamic environments. In this work, we study the problem of learning from streaming data…

PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference

arXiv:2510.13763v1 Announce Type: cross Abstract: Amortized simulator-based inference offers a powerful framework for tackling Bayesian inference in computational fields such as engineering or neuroscience, increasingly leveraging modern generative methods like diffusion models to map observed data to model parameters or…