Archives AI News

Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention

Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention arXiv:2508.19414v1 Announce Type: new Abstract: We present a mechanistic case study of a format-dependent reasoning failure in Llama-3.1-8B-Instruct, where the model incorrectly judges “9.11” as larger than…

August 29, 2025

Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints

Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints arXiv:2407.01991v4 Announce Type: replace Abstract: To find the shortest paths for all pairs on manifolds with infinitesimally defined metrics, we introduce a framework to generate them by predicting midpoints recursively.…

August 29, 2025

Differentiable multiphase flow model for physics-informed machine learning in reservoir pressure management

Differentiable multiphase flow model for physics-informed machine learning in reservoir pressure management arXiv:2508.19419v1 Announce Type: new Abstract: Accurate subsurface reservoir pressure control is extremely challenging due to geological heterogeneity and multiphase fluid-flow dynamics. Predicting behavior in this setting relies on…

August 29, 2025

Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence

Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence arXiv:2412.18164v3 Announce Type: replace Abstract: Diffusion models have emerged as powerful tools for generative modeling, demonstrating exceptional capability in capturing target data distributions from large datasets. However, fine-tuning these massive…

August 29, 2025

MS-ConTab: Multi-Scale Contrastive Learning of Mutation Signatures for Pan Cancer Representation and Stratification

MS-ConTab: Multi-Scale Contrastive Learning of Mutation Signatures for Pan Cancer Representation and Stratification arXiv:2508.19424v1 Announce Type: new Abstract: Motivation. Understanding the pan-cancer mutational landscape offers critical insights into the molecular mechanisms underlying tumorigenesis. While patient-level machine learning techniques have been…

August 29, 2025

R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning

R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning arXiv:2504.11195v2 Announce Type: replace Abstract: Vision-language models (VLMs), such as CLIP, have gained significant popularity as foundation models, with numerous fine-tuning methods developed to enhance performance on downstream tasks.…

August 29, 2025

Data-Augmented Few-Shot Neural Stencil Emulation for System Identification of Computer Models

Data-Augmented Few-Shot Neural Stencil Emulation for System Identification of Computer Models arXiv:2508.19441v1 Announce Type: new Abstract: Partial differential equations (PDEs) underpin the modeling of many natural and engineered systems. It can be convenient to express such models as neural PDEs…

August 29, 2025

ProARD: progressive adversarial robustness distillation: provide wide range of robust students

ProARD: progressive adversarial robustness distillation: provide wide range of robust students arXiv:2506.07666v3 Announce Type: replace Abstract: Adversarial Robustness Distillation (ARD) has emerged as an effective method to enhance the robustness of lightweight deep neural networks against adversarial attacks. Current ARD…

August 29, 2025

Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization

Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization arXiv:2508.19443v1 Announce Type: new Abstract: Producing large complex simulation datasets can often be a time and resource consuming task. Especially when these experiments are very expensive, it is becoming more reasonable…

August 29, 2025

GTPO: Trajectory-Based Policy Optimization in Large Language Models

GTPO: Trajectory-Based Policy Optimization in Large Language Models arXiv:2508.03772v3 Announce Type: replace Abstract: Policy-based optimizations are widely adopted today for the training and alignment of language models, where one of the most recent and effective approaches is Group-relative Policy Optimization…

August 29, 2025