Archives AI News

Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation

arXiv:2509.24798v3 Announce Type: replace-cross Abstract: We present Causal-Adapter, a modular framework that adapts frozen text-to-image diffusion backbones for counterfactual image generation. Our method enables causal interventions on target attributes, consistently propagating their effects to causal dependents without altering the core…

October 6, 2025

Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents

arXiv:2510.02837v1 Announce Type: new Abstract: Although recent tool-augmented benchmarks incorporate complex user requests and diverse tools, the evaluation methods for most of them remain limited to answer matching. However, as the number of steps required to resolve a user request…

October 6, 2025

Grounding Large Language Models in Clinical Evidence: A Retrieval-Augmented Generation System for Querying UK NICE Clinical Guidelines

arXiv:2510.02967v1 Announce Type: cross Abstract: This paper presents the development and evaluation of a Retrieval-Augmented Generation (RAG) system for querying the United Kingdom’s National Institute for Health and Care Excellence (NICE) clinical guidelines using Large Language Models (LLMs). The extensive…

October 6, 2025

Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

arXiv:2510.02840v1 Announce Type: new Abstract: A common but rarely examined assumption in machine learning is that training yields models that actually satisfy their specified objective function. We call this the Objective Satisfaction Assumption (OSA). Although deviations from OSA are acknowledged,…

October 6, 2025

CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration

arXiv:2510.03038v1 Announce Type: cross Abstract: With the advancement of mobile device capabilities, deploying reranking models directly on devices has become feasible, enabling real-time contextual recommendations. When migrating models from cloud to devices, resource heterogeneity inevitably necessitates model compression. Recent quantization…

October 6, 2025

Reward Model Routing in Alignment

arXiv:2510.02850v1 Announce Type: new Abstract: Reinforcement learning from human or AI feedback (RLHF / RLAIF) has become the standard paradigm for aligning large language models (LLMs). However, most pipelines rely on a single reward model (RM), limiting alignment quality and…

October 6, 2025

Distilled Protein Backbone Generation

arXiv:2510.03095v1 Announce Type: cross Abstract: Diffusion- and flow-based generative models have recently demonstrated strong performance in protein backbone generation tasks, offering unprecedented capabilities for de novo protein design. However, while achieving notable performance in generation quality, these models are limited…

October 6, 2025

Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models

arXiv:2510.02880v1 Announce Type: new Abstract: Optimizing discrete diffusion model (DDM) with rewards remains a challenge: the non-autoregressive paradigm makes importance sampling intractable and rollout complex, puzzling reinforcement learning methods such as Group Relative Policy Optimization (GRPO). In this study, we…

October 6, 2025

Wave-GMS: Lightweight Multi-Scale Generative Model for Medical Image Segmentation

arXiv:2510.03216v1 Announce Type: cross Abstract: For equitable deployment of AI tools in hospitals and healthcare facilities, we need Deep Segmentation Networks that offer high performance and can be trained on cost-effective GPUs with limited memory and large batch sizes. In…

October 6, 2025

Onto-Epistemological Analysis of AI Explanations

arXiv:2510.02996v1 Announce Type: new Abstract: Artificial intelligence (AI) is being applied in almost every field. At the same time, the currently dominant deep learning methods are fundamentally black-box systems that lack explanations for their inferences, significantly limiting their trustworthiness and…

October 6, 2025