Archives AI News

ADPO: Anchored Direct Preference Optimization

arXiv:2510.18913v1 Announce Type: new Abstract: Anchored Direct Preference Optimization (ADPO) is a unified framework that generalizes Direct Preference Optimization (DPO) with soft preferences, reference-policy anchoring, and groupwise extensions. While standard DPO assumes hard binary labels and pairwise comparisons, ADPO introduces:…

October 23, 2025

Disentanglement Beyond Static vs. Dynamic: A Benchmark and Evaluation Framework for Multi-Factor Sequential Representations

arXiv:2510.17313v2 Announce Type: replace Abstract: Learning disentangled representations in sequential data is a key goal in deep learning, with broad applications in vision, audio, and time series. While real-world data involves multiple interacting semantic factors over time, prior work has…

October 23, 2025

NeuroAda: Activating Each Neuron’s Potential for Parameter-Efficient Fine-Tuning

arXiv:2510.18940v1 Announce Type: new Abstract: Existing parameter-efficient fine-tuning (PEFT) methods primarily fall into two categories: addition-based and selective in-situ adaptation. The former, such as LoRA, introduce additional modules to adapt the model to downstream tasks, offering strong memory efficiency. However,…

October 23, 2025

A Comprehensive Benchmark for RNA 3D Structure-Function Modeling

arXiv:2503.21681v3 Announce Type: replace-cross Abstract: The relationship between RNA structure and function has recently attracted interest within the deep learning community, a trend expected to intensify as nucleic acid structure models advance. Despite this momentum, the lack of standardized, accessible…

October 23, 2025

Towards Universal Solvers: Using PGD Attack in Active Learning to Increase Generalizability of Neural Operators as Knowledge Distillation from Numerical PDE Solvers

arXiv:2510.18989v1 Announce Type: new Abstract: Nonlinear PDE solvers require fine space-time discretizations and local linearizations, leading to high memory cost and slow runtimes. Neural operators such as FNOs and DeepONets offer fast single-shot inference by learning function-to-function mappings and truncating…

October 23, 2025

ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning

arXiv:2510.14176v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) algorithms are highly sensitive to reward function specification, which remains a central challenge limiting their broad applicability. We present ARM-FM: Automated Reward Machines via Foundation Models, a framework for automated, compositional reward…

October 23, 2025

An Encode-then-Decompose Approach to Unsupervised Time Series Anomaly Detection on Contaminated Training Data–Extended Version

arXiv:2510.18998v1 Announce Type: new Abstract: Time series anomaly detection is important in modern large-scale systems and is applied in a variety of domains to analyze and monitor the operation of diverse systems. Unsupervised approaches have received widespread interest, as they…

October 23, 2025

LICO: Large Language Models for In-Context Molecular Optimization

arXiv:2406.18851v2 Announce Type: replace Abstract: Optimizing black-box functions is a fundamental problem in science and engineering. To solve this problem, many approaches learn a surrogate function that estimates the underlying objective from limited historical evaluations. Large Language Models (LLMs), with…

October 23, 2025

Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records

arXiv:2510.19014v1 Announce Type: new Abstract: Current medical practice depends on standardized treatment frameworks and empirical methodologies that neglect individual patient variations, leading to suboptimal health outcomes. We develop a comprehensive system integrating Large Language Models (LLMs), Conditional Tabular Generative Adversarial…

October 23, 2025

A recursive Bayesian neural network for constitutive modeling of sands under monotonic and cyclic loading

arXiv:2501.10088v2 Announce Type: replace Abstract: In geotechnical engineering, constitutive models are central to capturing soil behavior across diverse drainage conditions, stress paths,and loading histories. While data driven deep learning (DL) approaches have shown promise as alternatives to traditional constitutive formulations,…

October 23, 2025