Archives AI News

AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

arXiv:2508.06944v3 Announce Type: replace Abstract: Large Language Models (LLMs) are typically fine-tuned for reasoning tasks through a two-stage pipeline of Supervised Fine-Tuning (SFT) followed by Reinforcement Learning (RL), a process fraught with catastrophic forgetting and suboptimal trade-offs between imitation and…

October 13, 2025

Conformal Risk Training: End-to-End Optimization of Conformal Risk Control

arXiv:2510.08748v1 Announce Type: new Abstract: While deep learning models often achieve high predictive accuracy, their predictions typically do not come with any provable guarantees on risk or reliability, which are critical for deployment in high-stakes applications. The framework of conformal…

October 13, 2025

Machine Learning Detection of Lithium Plating in Lithium-ion Cells: A Gaussian Process Approach

arXiv:2509.26234v3 Announce Type: replace Abstract: Lithium plating during fast charging is a critical degradation mechanism that accelerates capacity fade and can trigger catastrophic safety failures. Recent work has identified a distinctive dQ/dV peak above 4.0 V as a reliable signature…

October 13, 2025

Exploring Cross-Client Memorization of Training Data in Large Language Models for Federated Learning

arXiv:2510.08750v1 Announce Type: new Abstract: Federated learning (FL) enables collaborative training without raw data sharing, but still risks training data memorization. Existing FL memorization detection techniques focus on one sample at a time, underestimating more subtle risks of cross-sample memorization.…

October 13, 2025

Untangling Component Imbalance in Hybrid Linear Attention Conversion Methods

arXiv:2510.05901v2 Announce Type: replace Abstract: Transformers’ quadratic computational complexity limits their scalability despite remarkable performance. While linear attention reduces this to linear complexity, pre-training such models from scratch remains, in most cases, prohibitively expensive. Recent post-training linearisation methods convert pre-trained…

October 13, 2025

LOTION: Smoothing the Optimization Landscape for Quantized Training

arXiv:2510.08757v1 Announce Type: new Abstract: Optimizing neural networks for quantized objectives is fundamentally challenging because the quantizer is piece-wise constant, yielding zero gradients everywhere except at quantization thresholds where the derivative is undefined. Most existing methods deal with this issue…

October 13, 2025

Mini-batch Estimation for Deep Cox Models: Statistical Foundations and Practical Guidance

arXiv:2408.02839v2 Announce Type: replace-cross Abstract: The stochastic gradient descent (SGD) algorithm has been widely used to optimize deep Cox neural network (Cox-NN) by updating model parameters using mini-batches of data. We show that SGD aims to optimize the average of…

October 13, 2025

Spatial Deconfounder: Interference-Aware Deconfounding for Spatial Causal Inference

arXiv:2510.08762v1 Announce Type: new Abstract: Causal inference in spatial domains faces two intertwined challenges: (1) unmeasured spatial factors, such as weather, air pollution, or mobility, that confound treatment and outcome, and (2) interference from nearby treatments that violate standard no-interference…

October 13, 2025

AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking in Large Language Models

arXiv:2505.17312v4 Announce Type: replace-cross Abstract: LLMs often need effective configurations, like temperature and reasoning steps, to handle tasks requiring sophisticated reasoning and problem-solving, ranging from joke generation to mathematical reasoning. Existing prompting approaches usually adopt general-purpose, fixed configurations that work…

October 13, 2025

Reinforcement Learning-Based Optimization of CT Acquisition and Reconstruction Parameters Through Virtual Imaging Trials

arXiv:2510.08763v1 Announce Type: new Abstract: Protocol optimization is critical in Computed Tomography (CT) to achieve high diagnostic image quality while minimizing radiation dose. However, due to the complex interdependencies among CT acquisition and reconstruction parameters, traditional optimization methods rely on…

October 13, 2025