Archives AI News

Transformers know more than they can tell — Learning the Collatz sequence

arXiv:2511.10811v1 Announce Type: new Abstract: We investigate transformer prediction of long Collatz steps, a complex arithmetic function that maps odd integers to their distant successors in the Collatz sequence ( $u_{n+1}=u_n/2$ if $u_n$ is even, $u_{n+1}=(3u_n+1)/2$ if $u_n$ is odd).…

Towards Uncertainty Quantification in Generative Model Learning

arXiv:2511.10710v1 Announce Type: new Abstract: While generative models have become increasingly prevalent across various domains, fundamental concerns regarding their reliability persist. A crucial yet understudied aspect of these models is the uncertainty quantification surrounding their distribution approximation capabilities. Current evaluation…

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

arXiv:2511.10707v1 Announce Type: new Abstract: Parameter-Efficient finetuning (PEFT) enhances model performance on downstream tasks by updating a minimal subset of parameters. Representation finetuning (ReFT) methods further improve efficiency by freezing model weights and optimizing internal representations with fewer parameters than…

Differentiable Sparse Identification of Lagrangian Dynamics

arXiv:2511.10706v1 Announce Type: new Abstract: Data-driven discovery of governing equations from data remains a fundamental challenge in nonlinear dynamics. Although sparse regression techniques have advanced system identification, they struggle with rational functions and noise sensitivity in complex mechanical systems. The…

Towards Universal Neural Operators through Multiphysics Pretraining

arXiv:2511.10829v1 Announce Type: new Abstract: Although neural operators are widely used in data-driven physical simulations, their training remains computationally expensive. Recent advances address this issue via downstream learning, where a model pretrained on simpler problems is fine-tuned on more complex…

Evolutionary Retrofitting

arXiv:2410.11330v2 Announce Type: replace Abstract: AfterLearnER (After Learning Evolutionary Retrofitting) consists in applying evolutionary optimization to refine fully trained machine learning models by optimizing a set of carefully chosen parameters or hyperparameters of the model, with respect to some actual,…

Benchmarking Quantum Kernels Across Diverse and Complex Data

arXiv:2511.10831v1 Announce Type: new Abstract: Quantum kernel methods are a promising branch of quantum machine learning, yet their practical advantage on diverse, high-dimensional, real-world data remains unverified. Current research has largely been limited to low-dimensional or synthetic datasets, preventing a…