Archives AI News

Financial Instruction Following Evaluation (FIFE)

arXiv:2512.08965v1 Announce Type: new Abstract: Language Models (LMs) struggle with complex, interdependent instructions, particularly in high-stakes domains like finance where precision is critical. We introduce FIFE, a novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks.…

CluCERT: Certifying LLM Robustness via Clustering-Guided Denoising Smoothing

arXiv:2512.08967v1 Announce Type: new Abstract: Recent advancements in Large Language Models (LLMs) have led to their widespread adoption in daily applications. Despite their impressive capabilities, they remain vulnerable to adversarial attacks, as even minor meaning-preserving changes such as synonym substitutions…

The Impossibility of Inverse Permutation Learning in Transformer Models

arXiv:2509.24125v3 Announce Type: replace Abstract: In this technical note, we study the problem of inverse permutation learning in decoder-only transformers. Given a permutation and a string to which that permutation has been applied, the model is tasked with producing the…

StructuredDNA: A Bio-Physical Framework for Energy-Aware Transformer Routing

arXiv:2512.08968v1 Announce Type: new Abstract: The rapid scaling of large computational models has led to a critical increase in energy and compute costs. Inspired by biological systems where structure and function emerge from low-energy configurations, we introduce StructuredDNA, a sparse…

Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs

arXiv:2512.08976v1 Announce Type: new Abstract: We introduce Contrastive Region Masking (CRM), a training free diagnostic that reveals how multimodal large language models (MLLMs) depend on specific visual regions at each step of chain-of-thought (CoT) reasoning. Unlike prior approaches limited to…

Imitative Membership Inference Attack

arXiv:2509.06796v2 Announce Type: replace-cross Abstract: A Membership Inference Attack (MIA) assesses how much a target machine learning model reveals about its training data by determining whether specific query instances were part of the training set. State-of-the-art MIAs rely on training…