Archives AI News

Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT

arXiv:2511.00051v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods are crucial for adapting large pre-trained models. Among these, LoRA is considered a foundational approach. Building on this, the influential DoRA method enhances performance by decomposing weight updates into magnitude and…

Scientific Machine Learning with Kolmogorov-Arnold Networks

arXiv:2507.22959v2 Announce Type: replace Abstract: The field of scientific machine learning, which originally utilized multilayer perceptrons (MLPs), is increasingly adopting Kolmogorov-Arnold Networks (KANs) for data encoding. This shift is driven by the limitations of MLPs, including poor interpretability, fixed activation…

Amortized Active Generation of Pareto Sets

arXiv:2510.21052v2 Announce Type: replace Abstract: We introduce active generation of Pareto sets (A-GPS), a new framework for online discrete black-box multi-objective optimization (MOO). A-GPS learns a generative model of the Pareto set that supports a-posteriori conditioning on user preferences. The…

Gymnasium: A Standard Interface for Reinforcement Learning Environments

arXiv:2407.17032v4 Announce Type: replace Abstract: Reinforcement Learning (RL) is a continuously growing field that has the potential to revolutionize many areas of artificial intelligence. However, despite its promise, RL research is often hindered by the lack of standardization in environment…

Stochastic Subspace Descent Accelerated via Bi-fidelity Line Search

arXiv:2505.00162v2 Announce Type: replace Abstract: Efficient optimization remains a fundamental challenge across numerous scientific and engineering domains, especially when objective function and gradient evaluations are computationally expensive. While zeroth-order optimization methods offer effective approaches when gradients are inaccessible, their practical…

ReLaX-Net: Reusing Layers for Parameter-Efficient Physical Neural Networks

arXiv:2511.00044v1 Announce Type: new Abstract: Physical Neural Networks (PNN) are promising platforms for next-generation computing systems. However, recent advances in digital neural network performance are largely driven by the rapid growth in the number of trainable parameters and, so far,…

Partial Trace-Class Bayesian Neural Networks

arXiv:2511.01628v1 Announce Type: cross Abstract: Bayesian neural networks (BNNs) allow rigorous uncertainty quantification in deep learning, but often come at a prohibitive computational cost. We propose three different innovative architectures of partial trace-class Bayesian neural networks (PaTraC BNNs) that enable…