Archives AI News

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

arXiv:2510.01171v2 Announce Type: replace-cross Abstract: Post-training alignment often reduces LLM diversity, leading to a phenomenon known as mode collapse. Unlike prior work that attributes this effect to algorithmic limitations, we identify a fundamental, pervasive data-level driver: typicality bias in preference…

Rare Text Semantics Were Always There in Your Diffusion Transformer

arXiv:2510.03886v1 Announce Type: new Abstract: Starting from flow- and diffusion-based transformers, Multi-modal Diffusion Transformers (MM-DiTs) have reshaped text-to-vision generation, gaining acclaim for exceptional visual fidelity. As these models advance, users continually push the boundary with imaginative or rare prompts, which…

Kantian-Utilitarian XAI: Meta-Explained

arXiv:2510.03892v1 Announce Type: new Abstract: We present a gamified explainable AI (XAI) system for ethically aware consumer decision-making in the coffee domain. Each session comprises six rounds with three options per round. Two symbolic engines provide real-time reasons: a Kantian…

Multilingual Routing in Mixture-of-Experts

arXiv:2510.04694v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) architectures have become the key to scaling modern LLMs, yet little is understood about how their sparse routing dynamics respond to multilingual data. In this work, we analyze expert routing patterns using parallel…

Quantifying Risks in Multi-turn Conversation with Large Language Models

arXiv:2510.03969v1 Announce Type: new Abstract: Large Language Models (LLMs) can produce catastrophic responses in conversational settings that pose serious risks to public safety and security. Existing evaluations often fail to fully reveal these vulnerabilities because they rely on fixed attack…

On Predicting Post-Click Conversion Rate via Counterfactual Inference

arXiv:2510.04816v1 Announce Type: cross Abstract: Accurately predicting conversion rate (CVR) is essential in various recommendation domains such as online advertising systems and e-commerce. These systems utilize user interaction logs, which consist of exposures, clicks, and conversions. CVR prediction models are…