Archives AI News

Gradient Descent Algorithm Survey

arXiv:2511.20725v1 Announce Type: new Abstract: Focusing on the practical configuration needs of optimization algorithms in deep learning, this article concentrates on five major algorithms: SGD, Mini-batch SGD, Momentum, Adam, and Lion. It systematically analyzes the core advantages, limitations, and key…

Active Slice Discovery in Large Language Models

arXiv:2511.20713v1 Announce Type: new Abstract: Large Language Models (LLMs) often exhibit systematic errors on specific subsets of data, known as error slices. For instance, a slice can correspond to a certain demographic, where a model does poorly in identifying toxic…

Solving Diffusion Inverse Problems with Restart Posterior Sampling

arXiv:2511.20705v1 Announce Type: new Abstract: Inverse problems are fundamental to science and engineering, where the goal is to infer an underlying signal or state from incomplete or noisy measurements. Recent approaches employ diffusion models as powerful implicit priors for such…

Post-Pruning Accuracy Recovery via Data-Free Knowledge Distillation

arXiv:2511.20702v1 Announce Type: new Abstract: Model pruning is a widely adopted technique to reduce the computational complexity and memory footprint of Deep Neural Networks (DNNs). However, global unstructured pruning often leads to significant degradation in accuracy, typically necessitating fine-tuning on…