Archives AI News

Small Vocabularies, Big Gains: Pretraining and Tokenization in Time Series Models

arXiv:2511.11622v1 Announce Type: new Abstract: Tokenization and transfer learning are two critical components in building state of the art time series foundation models for forecasting. In this work, we systematically study the effect of tokenizer design, specifically scaling and quantization…

November 18, 2025

PERTINENCE: Input-based Opportunistic Neural Network Dynamic Execution

arXiv:2507.01695v2 Announce Type: replace Abstract: Deep neural networks (DNNs) have become ubiquitous thanks to their remarkable ability to model complex patterns across various domains such as computer vision, speech recognition, robotics, etc. While large DNN models are often more accurate…

November 18, 2025

Early GVHD Prediction in Liver Transplantation via Multi-Modal Deep Learning on Imbalanced EHR Data

arXiv:2511.11623v1 Announce Type: new Abstract: Graft-versus-host disease (GVHD) is a rare but often fatal complication in liver transplantation, with a very high mortality rate. By harnessing multi-modal deep learning methods to integrate heterogeneous and imbalanced electronic health records (EHR), we…

November 18, 2025

Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making

arXiv:2510.21788v2 Announce Type: replace Abstract: We explore the use of expert-guided bandit learning, which we refer to as online mixture-of-experts (OMoE). In this setting, given a context, a candidate committee of experts must determine how to aggregate their outputs to…

November 18, 2025

MedFedPure: A Medical Federated Framework with MAE-based Detection and Diffusion Purification for Inference-Time Attacks

arXiv:2511.11625v1 Announce Type: new Abstract: Artificial intelligence (AI) has shown great potential in medical imaging, particularly for brain tumor detection using Magnetic Resonance Imaging (MRI). However, the models remain vulnerable at inference time when they are trained collaboratively through Federated…

November 18, 2025

Bigger datasets aren’t always better

MIT researchers developed a way to identify the smallest dataset that guarantees optimal solutions to complex problems.

November 18, 2025

Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation

arXiv:2504.18720v2 Announce Type: replace Abstract: Deep learning has advanced weather forecasting, but accurate predictions first require identifying the current state of the atmosphere from observational data. In this work, we introduce Appa, a score-based data assimilation model generating global atmospheric…

November 18, 2025

The Shape of Data: Topology Meets Analytics. A Practical Introduction to Topological Analytics and the Stability Index (TSI) in Business

arXiv:2511.13503v1 Announce Type: cross Abstract: Modern business and economic datasets often exhibit nonlinear, multi-scale structures that traditional linear tools under-represent. Topological Data Analysis (TDA) offers a geometric lens for uncovering robust patterns, such as connected components, loops and voids, across…

November 18, 2025

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

arXiv:2410.01623v3 Announce Type: replace Abstract: Low-rank training has emerged as a promising approach for reducing memory usage in training Large Language Models (LLMs). Previous methods either rely on decomposing weight matrices (e.g., LoRA), or seek to decompose gradient matrices (e.g.,…

November 18, 2025

Multi-Domain EEG Representation Learning with Orthogonal Mapping and Attention-based Fusion for Cognitive Load Classification

arXiv:2511.12394v1 Announce Type: cross Abstract: We propose a new representation learning solution for the classification of cognitive load based on Electroencephalogram (EEG). Our method integrates both time and frequency domains by first passing the raw EEG signals through the convolutional…

November 18, 2025