Archives AI News

The Impossibility of Inverse Permutation Learning in Transformer Models

arXiv:2509.24125v3 Announce Type: replace Abstract: In this technical note, we study the problem of inverse permutation learning in decoder-only transformers. Given a permutation and a string to which that permutation has been applied, the model is tasked with producing the…

StructuredDNA: A Bio-Physical Framework for Energy-Aware Transformer Routing

arXiv:2512.08968v1 Announce Type: new Abstract: The rapid scaling of large computational models has led to a critical increase in energy and compute costs. Inspired by biological systems where structure and function emerge from low-energy configurations, we introduce StructuredDNA, a sparse…

Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs

arXiv:2512.08976v1 Announce Type: new Abstract: We introduce Contrastive Region Masking (CRM), a training free diagnostic that reveals how multimodal large language models (MLLMs) depend on specific visual regions at each step of chain-of-thought (CoT) reasoning. Unlike prior approaches limited to…

Imitative Membership Inference Attack

arXiv:2509.06796v2 Announce Type: replace-cross Abstract: A Membership Inference Attack (MIA) assesses how much a target machine learning model reveals about its training data by determining whether specific query instances were part of the training set. State-of-the-art MIAs rely on training…

Function-on-Function Bayesian Optimization

arXiv:2511.12783v2 Announce Type: replace-cross Abstract: Bayesian optimization (BO) has been widely used to optimize expensive and gradient-free objective functions across various domains. However, existing BO methods have not addressed the objective where both inputs and outputs are functions, which increasingly…

Improving Multi-Class Calibration through Normalization-Aware Isotonic Techniques

arXiv:2512.09054v1 Announce Type: new Abstract: Accurate and reliable probability predictions are essential for multi-class supervised learning tasks, where well-calibrated models enable rational decision-making. While isotonic regression has proven effective for binary calibration, its extension to multi-class problems via one-vs-rest calibration…