Archives AI News

Split-on-Share: Mixture of Sparse Experts for Task-Agnostic Continual Learning

arXiv:2601.17616v2 Announce Type: replace Abstract: Continual learning in Large Language Models (LLMs) is hindered by the plasticity-stability dilemma, where acquiring new capabilities often leads to catastrophic forgetting of previous knowledge. Existing methods typically treat parameters uniformly, failing to distinguish between…

Time-series forecasting through the lens of dynamics

arXiv:2507.15774v2 Announce Type: replace Abstract: While deep learning is facing an homogenization across modalities led by Transformers, they are still challenged by shallow linear models in the time-series forecasting task. Our hypothesis is that models should learn a direct link…