Archives AI News

Pre-training Epidemic Time Series Forecasters with Compartmental Prototypes

arXiv:2502.03393v5 Announce Type: replace Abstract: Accurate epidemic forecasting is crucial for outbreak preparedness, but existing data-driven models are often brittle. Typically trained on a single pathogen, they struggle with data scarcity during new outbreaks and fail under distribution shifts caused…

October 24, 2025

On the Robustness of Kernel Goodness-of-Fit Tests

arXiv:2408.05854v5 Announce Type: replace-cross Abstract: Goodness-of-fit testing is often criticized for its lack of practical relevance: since “all models are wrong”, the null hypothesis that the data conform to our model is ultimately always rejected as the sample size grows.…

October 24, 2025

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

arXiv:2505.24161v2 Announce Type: replace-cross Abstract: Spiking Neural Networks (SNNs) offer low-latency and energy-efficient decision making on neuromorphic hardware, making them attractive for Reinforcement Learning (RL) in resource-constrained edge devices. However, most RL algorithms for continuous control are designed for Artificial…

October 24, 2025

Beyond the Ideal: Analyzing the Inexact Muon Update

arXiv:2510.19933v1 Announce Type: new Abstract: The Muon optimizer has rapidly emerged as a powerful, geometry-aware alternative to AdamW, demonstrating strong performance in large-scale training of neural networks. However, a critical theory-practice disconnect exists: Muon’s efficiency relies on fast, approximate orthogonalization,…

October 24, 2025

Mitigating Privacy-Utility Trade-off in Decentralized Federated Learning via $f$-Differential Privacy

arXiv:2510.19934v1 Announce Type: new Abstract: Differentially private (DP) decentralized Federated Learning (FL) allows local users to collaborate without sharing their data with a central server. However, accurately quantifying the privacy budget of private FL algorithms is challenging due to the…

October 24, 2025

Enhancing Diagnostic Accuracy for Urinary Tract Disease through Explainable SHAP-Guided Feature Selection and Classification

arXiv:2510.19896v1 Announce Type: new Abstract: In this paper, we propose an approach to support the diagnosis of urinary tract diseases, with a focus on bladder cancer, using SHAP (SHapley Additive exPlanations)-based feature selection to enhance the transparency and effectiveness of…

October 24, 2025

FINDER: Feature Inference on Noisy Datasets using Eigenspace Residuals

arXiv:2510.19917v1 Announce Type: new Abstract: ”Noisy” datasets (regimes with low signal to noise ratios, small sample sizes, faulty data collection, etc) remain a key research frontier for classification methods with both theoretical and practical implications. We introduce FINDER, a rigorous…

October 24, 2025

FairGRPO: Fair Reinforcement Learning for Equitable Clinical Reasoning

arXiv:2510.19893v1 Announce Type: new Abstract: Medical artificial intelligence systems have achieved remarkable diagnostic capabilities, yet they consistently exhibit performance disparities across demographic groups, causing real-world harm to underrepresented populations. While recent multimodal reasoning foundation models have advanced clinical diagnosis through…

October 24, 2025

From Optimization to Prediction: Transformer-Based Path-Flow Estimation to the Traffic Assignment Problem

arXiv:2510.19889v1 Announce Type: new Abstract: The traffic assignment problem is essential for traffic flow analysis, traditionally solved using mathematical programs under the Equilibrium principle. These methods become computationally prohibitive for large-scale networks due to non-linear growth in complexity with the…

October 24, 2025

From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph

arXiv:2510.19873v1 Announce Type: new Abstract: Despite significant evolution of CUDA programming and domain-specific libraries, effectively utilizing GPUs with massively parallel engines remains difficult. Large language models (LLMs) show strong potential in generating optimized CUDA code from sequential code. However, using…

October 24, 2025