Archives AI News

On the Existence and Behaviour of Secondary Attention Sinks

arXiv:2512.22213v1 Announce Type: new Abstract: Attention sinks are tokens, often the beginning-of-sequence (BOS) token, that receive disproportionately high attention despite limited semantic relevance. In this work, we identify a class of attention sinks, which we term secondary sinks, that differ…

December 30, 2025

Investigation of the Impact of Synthetic Training Data in the Industrial Application of Terminal Strip Object Detection

arXiv:2403.04809v2 Announce Type: replace-cross Abstract: In industrial manufacturing, deploying deep learning models for visual inspection is mostly hindered by the high and often intractable cost of collecting and annotating large-scale training datasets. While image synthesis from 3D CAD models is…

December 30, 2025

Interpretable and Adaptive Node Classification on Heterophilic Graphs via Combinatorial Scoring and Hybrid Learning

arXiv:2512.22221v1 Announce Type: new Abstract: Graph neural networks (GNNs) achieve strong performance on homophilic graphs but often struggle under heterophily, where adjacent nodes frequently belong to different classes. We propose an interpretable and adaptive framework for semi-supervised node classification based…

December 30, 2025

Multivariate Conformal Prediction via Conformalized Gaussian Scoring

arXiv:2507.20941v2 Announce Type: replace-cross Abstract: While achieving exact conditional coverage in conformal prediction is unattainable without making strong, untestable regularity assumptions, the promise of conformal prediction hinges on finding approximations to conditional guarantees that are realizable in practice. A promising…

December 30, 2025

M”untz-Sz’asz Networks: Neural Architectures with Learnable Power-Law Bases

arXiv:2512.22222v1 Announce Type: new Abstract: Standard neural network architectures employ fixed activation functions (ReLU, tanh, sigmoid) that are poorly suited for approximating functions with singular or fractional power behavior, a structure that arises ubiquitously in physics, including boundary layers, fracture…

December 30, 2025

Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers

arXiv:2512.22564v1 Announce Type: cross Abstract: Respiratory sound classification is hindered by the limited size, high noise levels, and severe class imbalance of benchmark datasets like ICBHI 2017. While Transformer-based models offer powerful feature extraction capabilities, they are prone to overfitting…

December 30, 2025

ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

arXiv:2512.22223v1 Announce Type: new Abstract: Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic analysis systems, whether rule-based or machine learning-driven, often suffer from high false positives and lack interpretability, limiting…

December 30, 2025

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

arXiv:2512.22854v1 Announce Type: cross Abstract: Human-object interaction (HOI) video generation has garnered increasing attention due to its promising applications in digital humans, e-commerce, advertising, and robotics imitation learning. However, existing methods face two critical limitations: (1) a lack of effective…

December 30, 2025

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

arXiv:2512.22234v1 Announce Type: new Abstract: Diffusion Language Models (dLLMs) have emerged as promising alternatives to Auto-Regressive (AR) models. While recent efforts have validated their pre-training potential and accelerated inference speeds, the post-training landscape for dLLMs remains underdeveloped. Existing methods suffer…

December 30, 2025

Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems

arXiv:2512.23132v1 Announce Type: cross Abstract: Machine learning (ML) underpins foundation models in finance, healthcare, and critical infrastructure, making them targets for data poisoning, model extraction, prompt injection, automated jailbreaking, and preference-guided black-box attacks that exploit model comparisons. Larger models can…

December 30, 2025