Archives AI News

On the Existence and Behaviour of Secondary Attention Sinks

arXiv:2512.22213v1 Announce Type: new Abstract: Attention sinks are tokens, often the beginning-of-sequence (BOS) token, that receive disproportionately high attention despite limited semantic relevance. In this work, we identify a class of attention sinks, which we term secondary sinks, that differ…

Multivariate Conformal Prediction via Conformalized Gaussian Scoring

arXiv:2507.20941v2 Announce Type: replace-cross Abstract: While achieving exact conditional coverage in conformal prediction is unattainable without making strong, untestable regularity assumptions, the promise of conformal prediction hinges on finding approximations to conditional guarantees that are realizable in practice. A promising…

M”untz-Sz’asz Networks: Neural Architectures with Learnable Power-Law Bases

arXiv:2512.22222v1 Announce Type: new Abstract: Standard neural network architectures employ fixed activation functions (ReLU, tanh, sigmoid) that are poorly suited for approximating functions with singular or fractional power behavior, a structure that arises ubiquitously in physics, including boundary layers, fracture…

ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

arXiv:2512.22223v1 Announce Type: new Abstract: Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic analysis systems, whether rule-based or machine learning-driven, often suffer from high false positives and lack interpretability, limiting…

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

arXiv:2512.22234v1 Announce Type: new Abstract: Diffusion Language Models (dLLMs) have emerged as promising alternatives to Auto-Regressive (AR) models. While recent efforts have validated their pre-training potential and accelerated inference speeds, the post-training landscape for dLLMs remains underdeveloped. Existing methods suffer…

Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems

arXiv:2512.23132v1 Announce Type: cross Abstract: Machine learning (ML) underpins foundation models in finance, healthcare, and critical infrastructure, making them targets for data poisoning, model extraction, prompt injection, automated jailbreaking, and preference-guided black-box attacks that exploit model comparisons. Larger models can…