Archives AI News

XGBoost Forecasting of NEPSE Index Log Returns with Walk Forward Validation

arXiv:2601.08896v1 Announce Type: new Abstract: This study develops a robust machine learning framework for one-step-ahead forecasting of daily log-returns in the Nepal Stock Exchange (NEPSE) Index using the XGBoost regressor. A comprehensive feature set is engineered, including lagged log-returns (up…

Meta-learning to Address Data Shift in Time Series Classification

arXiv:2601.09018v1 Announce Type: new Abstract: Across engineering and scientific domains, traditional deep learning (TDL) models perform well when training and test data share the same distribution. However, the dynamic nature of real-world data, broadly termed textit{data shift}, renders TDL models…

Dynamics-Aligned Latent Imagination in Contextual World Models for Zero-Shot Generalization

arXiv:2508.20294v2 Announce Type: replace Abstract: Real-world reinforcement learning demands adaptation to unseen environmental conditions without costly retraining. Contextual Markov Decision Processes (cMDP) model this challenge, but existing methods often require explicit context variables (e.g., friction, gravity), limiting their use when…

Layer-Parallel Training for Transformers

arXiv:2601.09026v1 Announce Type: new Abstract: We present a new training methodology for transformers using a multilevel, layer-parallel approach. Through a neural ODE formulation of transformers, our application of a multilevel parallel-in-time algorithm for the forward and backpropagation phases of training…

When do spectral gradient updates help in deep learning?

arXiv:2512.04299v2 Announce Type: replace Abstract: Spectral gradient methods, such as the recently popularized Muon optimizer, are a promising alternative to standard Euclidean gradient descent for training deep neural networks and transformers, but it is still unclear in which regimes they…

SCaLE: Switching Cost aware Learning and Exploration

arXiv:2601.09042v1 Announce Type: new Abstract: This work addresses the fundamental problem of unbounded metric movement costs in bandit online convex optimization, by considering high-dimensional dynamic quadratic hitting costs and $ell_2$-norm switching costs in a noisy bandit feedback model. For a…

Deep Incomplete Multi-View Clustering via Hierarchical Imputation and Alignment

arXiv:2601.09051v1 Announce Type: new Abstract: Incomplete multi-view clustering (IMVC) aims to discover shared cluster structures from multi-view data with partial observations. The core challenges lie in accurately imputing missing views without introducing bias, while maintaining semantic consistency across views and…