Archives AI News

Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

arXiv:2505.01997v3 Announce Type: replace Abstract: One of the key technologies for the success of Large Language Models (LLMs) is preference alignment. However, a notable side effect of preference alignment is poor calibration: while the pre-trained models are typically well-calibrated, LLMs…

October 17, 2025

MCbiF: Measuring Topological Autocorrelation in Multiscale Clusterings via 2-Parameter Persistent Homology

arXiv:2510.14710v1 Announce Type: cross Abstract: Datasets often possess an intrinsic multiscale structure with meaningful descriptions at different levels of coarseness. Such datasets are naturally described as multi-resolution clusterings, i.e., not necessarily hierarchical sequences of partitions across scales. To analyse and…

October 17, 2025

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

arXiv:2510.14925v1 Announce Type: cross Abstract: We reinterpret Kant’s Critique of Pure Reason as a theory of feedback stability, viewing reason as a regulator that keeps inference within the bounds of possible experience. We formalize this intuition via a composite instability…

October 17, 2025

Weight Weaving: Parameter Pooling for Data-Free Model Merging

arXiv:2510.13921v1 Announce Type: new Abstract: Model merging provides a cost-effective and data-efficient combination of specialized deep neural networks through parameter integration. This technique leverages expert models across downstream tasks without requiring retraining. Most model merging approaches critically depend on scaling…

October 17, 2025

Symmetry-Aware GFlowNets

arXiv:2506.02685v3 Announce Type: replace-cross Abstract: Generative Flow Networks (GFlowNets) offer a powerful framework for sampling graphs in proportion to their rewards. However, existing approaches suffer from systematic biases due to inaccuracies in state transition probability computations. These biases, rooted in…

October 17, 2025

K-frames: Scene-Driven Any-k Keyframe Selection for long video understanding

arXiv:2510.13891v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have demonstrated significant capabilities in image understanding, but long-video are constrained by context windows and computational cost. Uniform frame sampling often leads to substantial information loss. Meanwhile existing keyframe selection…

October 17, 2025

Multi-View Semi-Supervised Label Distribution Learning with Local Structure Complementarity

arXiv:2510.13917v1 Announce Type: new Abstract: Label distribution learning (LDL) is a paradigm that each sample is associated with a label distribution. At present, the existing approaches are proposed for the single-view LDL problem with labeled data, while the multi-view LDL…

October 17, 2025

Joint Discriminative-Generative Modeling via Dual Adversarial Training

arXiv:2510.13872v1 Announce Type: new Abstract: Simultaneously achieving robust classification and high-fidelity generative modeling within a single framework presents a significant challenge. Hybrid approaches, such as Joint Energy-Based Models (JEM), interpret classifiers as EBMs but are often limited by the instability…

October 17, 2025

CoLoR-GAN: Continual Few-Shot Learning with Low-Rank Adaptation in Generative Adversarial Networks

arXiv:2510.13869v1 Announce Type: new Abstract: Continual learning (CL) in the context of Generative Adversarial Networks (GANs) remains a challenging problem, particularly when it comes to learn from a few-shot (FS) samples without catastrophic forgetting. Current most effective state-of-the-art (SOTA) methods,…

October 17, 2025

Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning

arXiv:2510.13865v1 Announce Type: new Abstract: We introduce the Deep Edge Filter, a novel approach that applies high-pass filtering to deep neural network features to improve model generalizability. Our method is motivated by our hypothesis that neural networks encode task-relevant semantic…

October 17, 2025