Archives AI News

Medical Test-free Disease Detection Based on Big Data

arXiv:2512.07856v1 Announce Type: new Abstract: Accurate disease detection is of paramount importance for effective medical treatment and patient care. However, the process of disease detection is often associated with extensive medical testing and considerable costs, making it impractical to perform…

GPU Memory Prediction for Multimodal Model Training

arXiv:2512.07853v1 Announce Type: new Abstract: As deep learning models in agentic AI systems grow in scale and complexity, GPU memory requirements increase and often exceed the available GPU memory capacity, so that out-of-memory (OoM) errors occur. It is well known…

HSTMixer: A Hierarchical MLP-Mixer for Large-Scale Traffic Forecasting

arXiv:2512.07854v1 Announce Type: new Abstract: Traffic forecasting task is significant to modern urban management. Recently, there is growing attention on large-scale forecasting, as it better reflects the complexity of real-world traffic networks. However, existing models often exhibit quadratic computational complexity,…

SABER: Small Actions, Big Errors – Safeguarding Mutating Steps in LLM Agents

arXiv:2512.07850v1 Announce Type: new Abstract: Despite rapid progress in LLM agents, performance on long-horizon, tool-using tasks remains fragile. To better understand this fragility, we ask a simple question: emph{do all actions contribute equally to failure?} Analyzing execution traces on $tau$-Bench…

Multicalibration for LLM-based Code Generation

arXiv:2512.08810v1 Announce Type: cross Abstract: As AI-based code generation becomes widespread, researchers are investigating the calibration of code LLMs – ensuring their confidence scores faithfully represent the true likelihood of code correctness. To do so, we investigate multicalibration, which can…

FAIM: Frequency-Aware Interactive Mamba for Time Series Classification

arXiv:2512.07858v1 Announce Type: new Abstract: Time series classification (TSC) is crucial in numerous real-world applications, such as environmental monitoring, medical diagnosis, and posture recognition. TSC tasks require models to effectively capture discriminative information for accurate class identification. Although deep learning…