Archives AI News

Mechanistic Interpretability with Sparse Autoencoder Neural Operators

arXiv:2509.03738v3 Announce Type: replace Abstract: We introduce sparse autoencoder neural operators (SAE-NOs), a new class of sparse autoencoders that operate directly in infinite-dimensional function spaces. We generalize the linear representation hypothesis to a functional representation hypothesis, enabling concept learning beyond…

February 24, 2026

Adaptive Time Series Reasoning via Segment Selection

arXiv:2602.18645v1 Announce Type: new Abstract: Time series reasoning tasks often start with a natural language question and require targeted analysis of a time series. Evidence may span the full series or appear in a few short intervals, so the model…

February 24, 2026

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

arXiv:2510.01650v2 Announce Type: replace Abstract: Neural network pruning is a promising technique to mitigate the excessive computational and memory requirements of large language models (LLMs). Despite its promise, however, progress in this area has diminished, as conventional methods are seemingly…

February 24, 2026

Information-Guided Noise Allocation for Efficient Diffusion Training

arXiv:2602.18647v1 Announce Type: new Abstract: Training diffusion models typically relies on manually tuned noise schedules, which can waste computation on weakly informative noise regions and limit transfer across datasets, resolutions, and representations. We revisit noise schedule allocation through an information-theoretic…

February 24, 2026

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

arXiv:2511.04934v2 Announce Type: replace Abstract: Unlearning in large language models (LLMs) is critical for regulatory compliance and for building ethical generative AI systems that avoid producing private, toxic, illegal, or copyrighted content. Despite rapid progress, in this work we show…

February 24, 2026

More trees where they matter, please

An international study reveals disparities in urban shade levels, exacerbating the “heat island” effect in big cities.

February 24, 2026

Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking

arXiv:2510.06820v2 Announce Type: replace-cross Abstract: Multimodal retrieval still leans on embedding-based models like CLIP for fast vector search over pre-computed image embeddings. Yet, unlike text retrieval, where joint-encoder rerankers are standard, comparable vision-language rerankers are largely absent. We find that…

February 24, 2026

Interpretable Failure Analysis in Multi-Agent Reinforcement Learning Systems

arXiv:2602.08104v2 Announce Type: replace-cross Abstract: Multi-Agent Reinforcement Learning (MARL) is increasingly deployed in safety-critical domains, yet methods for interpretable failure detection and attribution remain underdeveloped. We introduce a two-stage gradient-based framework that provides interpretable diagnostics for three critical failure analysis…

February 24, 2026

PhysE-Inv: A Physics-Encoded Inverse Modeling approach for Arctic Snow Depth Prediction

arXiv:2601.17074v2 Announce Type: replace Abstract: The accurate estimation of Arctic snow depth remains a critical time-varying inverse problem due to the extreme scarcity and noise inherent in associated sea ice parameters. Existing process-based and data-driven models are either highly sensitive…

February 24, 2026

Efficient Context Propagating Perceiver Architectures for Auto-Regressive Language Modeling

arXiv:2412.06106v3 Announce Type: replace-cross Abstract: One of the key challenges in Transformer architectures is the quadratic complexity of the attention mechanism, which limits the efficient processing of long sequences. Many recent research works have attempted to provide a reduction from…

February 24, 2026