Archives AI News

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

arXiv:2510.01650v2 Announce Type: replace Abstract: Neural network pruning is a promising technique to mitigate the excessive computational and memory requirements of large language models (LLMs). Despite its promise, however, progress in this area has diminished, as conventional methods are seemingly…

February 24, 2026

Information-Guided Noise Allocation for Efficient Diffusion Training

arXiv:2602.18647v1 Announce Type: new Abstract: Training diffusion models typically relies on manually tuned noise schedules, which can waste computation on weakly informative noise regions and limit transfer across datasets, resolutions, and representations. We revisit noise schedule allocation through an information-theoretic…

February 24, 2026

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

arXiv:2511.04934v2 Announce Type: replace Abstract: Unlearning in large language models (LLMs) is critical for regulatory compliance and for building ethical generative AI systems that avoid producing private, toxic, illegal, or copyrighted content. Despite rapid progress, in this work we show…

February 24, 2026

Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms

arXiv:2602.18649v1 Announce Type: new Abstract: Grokking — the abrupt transition from memorization to generalization after extended training — has been linked to the emergence of low-dimensional structure in learning dynamics. Yet neural network parameters inhabit extremely high-dimensional spaces. How can…

February 24, 2026

More trees where they matter, please

An international study reveals disparities in urban shade levels, exacerbating the “heat island” effect in big cities.

February 24, 2026

Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking

arXiv:2510.06820v2 Announce Type: replace-cross Abstract: Multimodal retrieval still leans on embedding-based models like CLIP for fast vector search over pre-computed image embeddings. Yet, unlike text retrieval, where joint-encoder rerankers are standard, comparable vision-language rerankers are largely absent. We find that…

February 24, 2026

Interpretable Failure Analysis in Multi-Agent Reinforcement Learning Systems

arXiv:2602.08104v2 Announce Type: replace-cross Abstract: Multi-Agent Reinforcement Learning (MARL) is increasingly deployed in safety-critical domains, yet methods for interpretable failure detection and attribution remain underdeveloped. We introduce a two-stage gradient-based framework that provides interpretable diagnostics for three critical failure analysis…

February 24, 2026

PhysE-Inv: A Physics-Encoded Inverse Modeling approach for Arctic Snow Depth Prediction

arXiv:2601.17074v2 Announce Type: replace Abstract: The accurate estimation of Arctic snow depth remains a critical time-varying inverse problem due to the extreme scarcity and noise inherent in associated sea ice parameters. Existing process-based and data-driven models are either highly sensitive…

February 24, 2026

Efficient Context Propagating Perceiver Architectures for Auto-Regressive Language Modeling

arXiv:2412.06106v3 Announce Type: replace-cross Abstract: One of the key challenges in Transformer architectures is the quadratic complexity of the attention mechanism, which limits the efficient processing of long sequences. Many recent research works have attempted to provide a reduction from…

February 24, 2026

Graph Neural Networks Powered by Encoder Embedding for Improved Node Learning

arXiv:2507.11732v2 Announce Type: replace Abstract: Graph neural networks (GNNs) have emerged as a powerful framework for a wide range of node-level graph learning tasks. However, their performance typically depends on random or minimally informed initial feature representations, where poor initialization…

February 24, 2026