Archives AI News

Ada-MoGE: Adaptive Mixture of Gaussian Expert Model for Time Series Forecasting

arXiv:2512.02061v1 Announce Type: new Abstract: Multivariate time series forecasts are widely used, such as industrial, transportation and financial forecasts. However, the dominant frequencies in time series may shift with the evolving spectral distribution of the data. Traditional Mixture of Experts…

DPWMixer: Dual-Path Wavelet Mixer for Long-Term Time Series Forecasting

arXiv:2512.02070v1 Announce Type: new Abstract: Long-term time series forecasting (LTSF) is a critical task in computational intelligence. While Transformer-based models effectively capture long-range dependencies, they often suffer from quadratic complexity and overfitting due to data sparsity. Conversely, efficient linear models…

PIBNet: a Physics-Inspired Boundary Network for Multiple Scattering Simulations

arXiv:2512.02049v1 Announce Type: new Abstract: The boundary element method (BEM) provides an efficient numerical framework for solving multiple scattering problems in unbounded homogeneous domains, since it reduces the discretization to the domain boundaries, thereby condensing the computational complexity. The procedure…

Cross-View Topology-Aware Graph Representation Learning

arXiv:2512.02130v1 Announce Type: new Abstract: Graph classification has gained significant attention due to its applications in chemistry, social networks, and bioinformatics. While Graph Neural Networks (GNNs) effectively capture local structural patterns, they often overlook global topological features that are critical…

Efficient Turing Machine Simulation with Transformers

arXiv:2512.00003v2 Announce Type: replace-cross Abstract: Constant bit-size Transformers are known to be Turing complete, but existing constructions require $Omega(s(n))$ chain-of-thought (CoT) steps per simulated Turing machine (TM) step, leading to impractical reasoning lengths. In this paper, we significantly reduce this…

LSHBloom: Memory-efficient, Extreme-scale Document Deduplication

arXiv:2411.04257v3 Announce Type: replace Abstract: Contemporary large language model (LLM) training pipelines require the assembly of internet-scale databases full of text data from a variety of sources (e.g., web, academic, and publishers). Preprocessing these datasets via deduplication — detecting and…

CLEF: Clinically-Guided Contrastive Learning for Electrocardiogram Foundation Models

arXiv:2512.02180v1 Announce Type: new Abstract: The electrocardiogram (ECG) is a key diagnostic tool in cardiovascular health. Single-lead ECG recording is integrated into both clinical-grade and consumer wearables. While self-supervised pretraining of foundation models on unlabeled ECGs improves diagnostic performance, existing…