Archives AI News

DriftGuard: A Hierarchical Framework for Concept Drift Detection and Remediation in Supply Chain Forecasting

arXiv:2601.08928v1 Announce Type: new Abstract: Supply chain forecasting models degrade over time as real-world conditions change. Promotions shift, consumer preferences evolve, and supply disruptions alter demand patterns, causing what is known as concept drift. This silent degradation leads to stockouts…

January 15, 2026

XGBoost Forecasting of NEPSE Index Log Returns with Walk Forward Validation

arXiv:2601.08896v1 Announce Type: new Abstract: This study develops a robust machine learning framework for one-step-ahead forecasting of daily log-returns in the Nepal Stock Exchange (NEPSE) Index using the XGBoost regressor. A comprehensive feature set is engineered, including lagged log-returns (up…

January 15, 2026

Improving AI-Based Canine Heart Disease Diagnosis with Expert-Consensus Auscultation Labeling

arXiv:2507.05950v2 Announce Type: replace Abstract: Noisy labels pose significant challenges for AI model training in veterinary medicine. This study examines expert assessment ambiguity in canine auscultation data, highlights the negative impact of label noise on classification performance, and introduces methods…

January 15, 2026

Meta-learning to Address Data Shift in Time Series Classification

arXiv:2601.09018v1 Announce Type: new Abstract: Across engineering and scientific domains, traditional deep learning (TDL) models perform well when training and test data share the same distribution. However, the dynamic nature of real-world data, broadly termed textit{data shift}, renders TDL models…

January 15, 2026

Dynamics-Aligned Latent Imagination in Contextual World Models for Zero-Shot Generalization

arXiv:2508.20294v2 Announce Type: replace Abstract: Real-world reinforcement learning demands adaptation to unseen environmental conditions without costly retraining. Contextual Markov Decision Processes (cMDP) model this challenge, but existing methods often require explicit context variables (e.g., friction, gravity), limiting their use when…

January 15, 2026

Layer-Parallel Training for Transformers

arXiv:2601.09026v1 Announce Type: new Abstract: We present a new training methodology for transformers using a multilevel, layer-parallel approach. Through a neural ODE formulation of transformers, our application of a multilevel parallel-in-time algorithm for the forward and backpropagation phases of training…

January 15, 2026

When do spectral gradient updates help in deep learning?

arXiv:2512.04299v2 Announce Type: replace Abstract: Spectral gradient methods, such as the recently popularized Muon optimizer, are a promising alternative to standard Euclidean gradient descent for training deep neural networks and transformers, but it is still unclear in which regimes they…

January 15, 2026

SCaLE: Switching Cost aware Learning and Exploration

arXiv:2601.09042v1 Announce Type: new Abstract: This work addresses the fundamental problem of unbounded metric movement costs in bandit online convex optimization, by considering high-dimensional dynamic quadratic hitting costs and $ell_2$-norm switching costs in a noisy bandit feedback model. For a…

January 15, 2026

Template-Based Probes Are Imperfect Lenses for Counterfactual Bias Evaluation in LLMs

arXiv:2404.03471v5 Announce Type: replace-cross Abstract: Bias in large language models (LLMs) has many forms, from overt discrimination to implicit stereotypes. Counterfactual bias evaluation is a widely used approach to quantifying bias and often relies on template-based probes that explicitly state…

January 15, 2026

Deep Incomplete Multi-View Clustering via Hierarchical Imputation and Alignment

arXiv:2601.09051v1 Announce Type: new Abstract: Incomplete multi-view clustering (IMVC) aims to discover shared cluster structures from multi-view data with partial observations. The core challenges lie in accurately imputing missing views without introducing bias, while maintaining semantic consistency across views and…

January 15, 2026