Archives AI News

Collaborative Unlabeled Data Optimization

arXiv:2505.14117v2 Announce Type: replace Abstract: This paper pioneers a novel data-centric paradigm to maximize the utility of unlabeled data, tackling a critical question: How can we enhance the efficiency and sustainability of deep learning training by optimizing the data itself?…

October 13, 2025

Graph Diffusion Transformers are In-Context Molecular Designers

arXiv:2510.08744v1 Announce Type: new Abstract: In-context learning allows large models to adapt to new tasks from a few demonstrations, but it has shown limited success in molecular design. Existing databases such as ChEMBL contain molecular properties spanning millions of biological…

October 13, 2025

RFOD: Random Forest-based Outlier Detection for Tabular Data

arXiv:2510.08747v1 Announce Type: new Abstract: Outlier detection in tabular data is crucial for safeguarding data integrity in high-stakes domains such as cybersecurity, financial fraud detection, and healthcare, where anomalies can cause serious operational and economic impacts. Despite advances in both…

October 13, 2025

AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

arXiv:2508.06944v3 Announce Type: replace Abstract: Large Language Models (LLMs) are typically fine-tuned for reasoning tasks through a two-stage pipeline of Supervised Fine-Tuning (SFT) followed by Reinforcement Learning (RL), a process fraught with catastrophic forgetting and suboptimal trade-offs between imitation and…

October 13, 2025

Conformal Risk Training: End-to-End Optimization of Conformal Risk Control

arXiv:2510.08748v1 Announce Type: new Abstract: While deep learning models often achieve high predictive accuracy, their predictions typically do not come with any provable guarantees on risk or reliability, which are critical for deployment in high-stakes applications. The framework of conformal…

October 13, 2025

Machine Learning Detection of Lithium Plating in Lithium-ion Cells: A Gaussian Process Approach

arXiv:2509.26234v3 Announce Type: replace Abstract: Lithium plating during fast charging is a critical degradation mechanism that accelerates capacity fade and can trigger catastrophic safety failures. Recent work has identified a distinctive dQ/dV peak above 4.0 V as a reliable signature…

October 13, 2025

Exploring Cross-Client Memorization of Training Data in Large Language Models for Federated Learning

arXiv:2510.08750v1 Announce Type: new Abstract: Federated learning (FL) enables collaborative training without raw data sharing, but still risks training data memorization. Existing FL memorization detection techniques focus on one sample at a time, underestimating more subtle risks of cross-sample memorization.…

October 13, 2025

Untangling Component Imbalance in Hybrid Linear Attention Conversion Methods

arXiv:2510.05901v2 Announce Type: replace Abstract: Transformers’ quadratic computational complexity limits their scalability despite remarkable performance. While linear attention reduces this to linear complexity, pre-training such models from scratch remains, in most cases, prohibitively expensive. Recent post-training linearisation methods convert pre-trained…

October 13, 2025

LOTION: Smoothing the Optimization Landscape for Quantized Training

arXiv:2510.08757v1 Announce Type: new Abstract: Optimizing neural networks for quantized objectives is fundamentally challenging because the quantizer is piece-wise constant, yielding zero gradients everywhere except at quantization thresholds where the derivative is undefined. Most existing methods deal with this issue…

October 13, 2025

Mini-batch Estimation for Deep Cox Models: Statistical Foundations and Practical Guidance

arXiv:2408.02839v2 Announce Type: replace-cross Abstract: The stochastic gradient descent (SGD) algorithm has been widely used to optimize deep Cox neural network (Cox-NN) by updating model parameters using mini-batches of data. We show that SGD aims to optimize the average of…

October 13, 2025