Archives AI News

Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation

arXiv:2509.24798v3 Announce Type: replace-cross Abstract: We present Causal-Adapter, a modular framework that adapts frozen text-to-image diffusion backbones for counterfactual image generation. Our method enables causal interventions on target attributes, consistently propagating their effects to causal dependents without altering the core…

Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

arXiv:2510.02840v1 Announce Type: new Abstract: A common but rarely examined assumption in machine learning is that training yields models that actually satisfy their specified objective function. We call this the Objective Satisfaction Assumption (OSA). Although deviations from OSA are acknowledged,…

Reward Model Routing in Alignment

arXiv:2510.02850v1 Announce Type: new Abstract: Reinforcement learning from human or AI feedback (RLHF / RLAIF) has become the standard paradigm for aligning large language models (LLMs). However, most pipelines rely on a single reward model (RM), limiting alignment quality and…

Distilled Protein Backbone Generation

arXiv:2510.03095v1 Announce Type: cross Abstract: Diffusion- and flow-based generative models have recently demonstrated strong performance in protein backbone generation tasks, offering unprecedented capabilities for de novo protein design. However, while achieving notable performance in generation quality, these models are limited…

Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models

arXiv:2510.02880v1 Announce Type: new Abstract: Optimizing discrete diffusion model (DDM) with rewards remains a challenge: the non-autoregressive paradigm makes importance sampling intractable and rollout complex, puzzling reinforcement learning methods such as Group Relative Policy Optimization (GRPO). In this study, we…

Onto-Epistemological Analysis of AI Explanations

arXiv:2510.02996v1 Announce Type: new Abstract: Artificial intelligence (AI) is being applied in almost every field. At the same time, the currently dominant deep learning methods are fundamentally black-box systems that lack explanations for their inferences, significantly limiting their trustworthiness and…