Archives AI News

Thompson Sampling via Fine-Tuning of LLMs

arXiv:2510.13328v2 Announce Type: replace Abstract: Bayesian optimization in large unstructured discrete spaces is often hindered by the computational cost of maximizing acquisition functions due to the absence of gradients. We propose a scalable alternative based on Thompson sampling that eliminates…

October 17, 2025

LTR-ICD: A Learning-to-Rank Approach for Automatic ICD Coding

arXiv:2510.13922v1 Announce Type: new Abstract: Clinical notes contain unstructured text provided by clinicians during patient encounters. These notes are usually accompanied by a sequence of diagnostic codes following the International Classification of Diseases (ICD). Correctly assigning and ordering ICD codes…

October 17, 2025

Uncertainty Quantification with the Empirical Neural Tangent Kernel

arXiv:2502.02870v2 Announce Type: replace-cross Abstract: While neural networks have demonstrated impressive performance across various tasks, accurately quantifying uncertainty in their predictions is essential to ensure their trustworthiness and enable widespread adoption in critical systems. Several Bayesian uncertainty quantification (UQ) methods…

October 17, 2025

Distributional Consistency Loss: Beyond Pointwise Data Terms in Inverse Problems

arXiv:2510.13972v1 Announce Type: new Abstract: Recovering true signals from noisy measurements is a central challenge in inverse problems spanning medical imaging, geophysics, and signal processing. Current solutions balance prior assumptions regarding the true signal (regularization) with agreement to noisy measured…

October 17, 2025

LLM-guided Chemical Process Optimization with a Multi-Agent Approach

arXiv:2506.20921v2 Announce Type: replace Abstract: Chemical process optimization maximizes production efficiency and economic performance, but optimization algorithms, including gradient-based solvers, numerical methods, and parameter grid searches, become impractical when operating constraints are ill-defined or unavailable. We present a multi-agent LLM…

October 17, 2025

Beyond Linear Probes: Dynamic Safety Monitoring for Language Models

arXiv:2509.26238v2 Announce Type: replace Abstract: Monitoring large language models’ (LLMs) activations is an effective way to detect harmful requests before they lead to unsafe outputs. However, traditional safety monitors often require the same amount of compute for every query. This…

October 17, 2025

Lost in the Averages: A New Specific Setup to Evaluate Membership Inference Attacks Against Machine Learning Models

arXiv:2405.15423v2 Announce Type: replace Abstract: Synthetic data generators and machine learning models can memorize their training data, posing privacy concerns. Membership inference attacks (MIAs) are a standard method of estimating the privacy risk of these systems. The risk of individual…

October 17, 2025

Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

arXiv:2505.01997v3 Announce Type: replace Abstract: One of the key technologies for the success of Large Language Models (LLMs) is preference alignment. However, a notable side effect of preference alignment is poor calibration: while the pre-trained models are typically well-calibrated, LLMs…

October 17, 2025

MCbiF: Measuring Topological Autocorrelation in Multiscale Clusterings via 2-Parameter Persistent Homology

arXiv:2510.14710v1 Announce Type: cross Abstract: Datasets often possess an intrinsic multiscale structure with meaningful descriptions at different levels of coarseness. Such datasets are naturally described as multi-resolution clusterings, i.e., not necessarily hierarchical sequences of partitions across scales. To analyse and…

October 17, 2025

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

arXiv:2510.14925v1 Announce Type: cross Abstract: We reinterpret Kant’s Critique of Pure Reason as a theory of feedback stability, viewing reason as a regulator that keeps inference within the bounds of possible experience. We formalize this intuition via a composite instability…

October 17, 2025