Archives AI News

Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning

arXiv:2510.08146v2 Announce Type: replace Abstract: We introduce a simple, yet novel entropy-based framework to drive token efficiency in large language models during reasoning tasks. Our approach uses Shannon entropy from token-level logprobs as a confidence signal to enable early stopping,…

October 17, 2025

FedHFT: Efficient Federated Finetuning with Heterogeneous Edge Clients

arXiv:2510.14054v1 Announce Type: new Abstract: Fine-tuning pre-trained large language models (LLMs) has become a common practice for personalized natural language understanding (NLU) applications on downstream tasks and domain-specific datasets. However, there are two main challenges: (i) limited and/or heterogeneous data…

October 17, 2025

LLM-guided Chemical Process Optimization with a Multi-Agent Approach

arXiv:2506.20921v2 Announce Type: replace Abstract: Chemical process optimization maximizes production efficiency and economic performance, but optimization algorithms, including gradient-based solvers, numerical methods, and parameter grid searches, become impractical when operating constraints are ill-defined or unavailable. We present a multi-agent LLM…

October 17, 2025

Beyond Linear Probes: Dynamic Safety Monitoring for Language Models

arXiv:2509.26238v2 Announce Type: replace Abstract: Monitoring large language models’ (LLMs) activations is an effective way to detect harmful requests before they lead to unsafe outputs. However, traditional safety monitors often require the same amount of compute for every query. This…

October 17, 2025

Lost in the Averages: A New Specific Setup to Evaluate Membership Inference Attacks Against Machine Learning Models

arXiv:2405.15423v2 Announce Type: replace Abstract: Synthetic data generators and machine learning models can memorize their training data, posing privacy concerns. Membership inference attacks (MIAs) are a standard method of estimating the privacy risk of these systems. The risk of individual…

October 17, 2025

Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

arXiv:2505.01997v3 Announce Type: replace Abstract: One of the key technologies for the success of Large Language Models (LLMs) is preference alignment. However, a notable side effect of preference alignment is poor calibration: while the pre-trained models are typically well-calibrated, LLMs…

October 17, 2025

MCbiF: Measuring Topological Autocorrelation in Multiscale Clusterings via 2-Parameter Persistent Homology

arXiv:2510.14710v1 Announce Type: cross Abstract: Datasets often possess an intrinsic multiscale structure with meaningful descriptions at different levels of coarseness. Such datasets are naturally described as multi-resolution clusterings, i.e., not necessarily hierarchical sequences of partitions across scales. To analyse and…

October 17, 2025

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

arXiv:2510.14925v1 Announce Type: cross Abstract: We reinterpret Kant’s Critique of Pure Reason as a theory of feedback stability, viewing reason as a regulator that keeps inference within the bounds of possible experience. We formalize this intuition via a composite instability…

October 17, 2025

Weight Weaving: Parameter Pooling for Data-Free Model Merging

arXiv:2510.13921v1 Announce Type: new Abstract: Model merging provides a cost-effective and data-efficient combination of specialized deep neural networks through parameter integration. This technique leverages expert models across downstream tasks without requiring retraining. Most model merging approaches critically depend on scaling…

October 17, 2025

Symmetry-Aware GFlowNets

arXiv:2506.02685v3 Announce Type: replace-cross Abstract: Generative Flow Networks (GFlowNets) offer a powerful framework for sampling graphs in proportion to their rewards. However, existing approaches suffer from systematic biases due to inaccuracies in state transition probability computations. These biases, rooted in…

October 17, 2025