Archives AI News

Active Slice Discovery in Large Language Models

arXiv:2511.20713v1 Announce Type: new Abstract: Large Language Models (LLMs) often exhibit systematic errors on specific subsets of data, known as error slices. For instance, a slice can correspond to a certain demographic, where a model does poorly in identifying toxic…

Solving Diffusion Inverse Problems with Restart Posterior Sampling

arXiv:2511.20705v1 Announce Type: new Abstract: Inverse problems are fundamental to science and engineering, where the goal is to infer an underlying signal or state from incomplete or noisy measurements. Recent approaches employ diffusion models as powerful implicit priors for such…

Post-Pruning Accuracy Recovery via Data-Free Knowledge Distillation

arXiv:2511.20702v1 Announce Type: new Abstract: Model pruning is a widely adopted technique to reduce the computational complexity and memory footprint of Deep Neural Networks (DNNs). However, global unstructured pruning often leads to significant degradation in accuracy, typically necessitating fine-tuning on…

CHiQPM: Calibrated Hierarchical Interpretable Image Classification

arXiv:2511.20779v1 Announce Type: new Abstract: Globally interpretable models are a promising approach for trustworthy AI in safety-critical domains. Alongside global explanations, detailed local explanations are a crucial complement to effectively support human experts during inference. This work proposes the Calibrated…

TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data

arXiv:2511.21600v1 Announce Type: cross Abstract: The rise of generative AI has enabled the production of high-fidelity synthetic tabular data across fields such as healthcare, finance, and public policy, raising growing concerns about data provenance and misuse. Watermarking offers a promising…

Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model

arXiv:2511.20798v1 Announce Type: new Abstract: Recent advances in mechanistic interpretability have revealed that large language models (LLMs) develop internal representations corresponding not only to concrete entities but also distinct, human-understandable abstract concepts and behaviour. Moreover, these hidden features can be…