Archives AI News

Efficient Offline Reinforcement Learning: First Imitate, then Improve

arXiv:2406.13376v2 Announce Type: replace Abstract: Supervised imitation-based approaches are often favored over off-policy reinforcement learning approaches for learning policies offline, since their straightforward optimization objective makes them computationally efficient and stable to train. However, their performance is fundamentally limited by…

December 30, 2025

Transformer Reconstructed with Dynamic Value Attention

arXiv:2512.22212v1 Announce Type: new Abstract: Since transformer was firstly published in 2017, several works have been proposed to optimize it. However, the major structure of transformer remains unchanged, ignoring one of its main intrinsic limitations, which is the same static…

December 30, 2025

Data-driven particle dynamics: Structure-preserving coarse-graining for emergent behavior in non-equilibrium systems

arXiv:2508.12569v3 Announce Type: replace Abstract: Multiscale systems are ubiquitous in science and technology, but are notoriously challenging to simulate as short spatiotemporal scales must be appropriately linked to emergent bulk physics. When expensive high-dimensional dynamical systems are coarse-grained into low-dimensional…

December 30, 2025

On the Existence and Behaviour of Secondary Attention Sinks

arXiv:2512.22213v1 Announce Type: new Abstract: Attention sinks are tokens, often the beginning-of-sequence (BOS) token, that receive disproportionately high attention despite limited semantic relevance. In this work, we identify a class of attention sinks, which we term secondary sinks, that differ…

December 30, 2025

Investigation of the Impact of Synthetic Training Data in the Industrial Application of Terminal Strip Object Detection

arXiv:2403.04809v2 Announce Type: replace-cross Abstract: In industrial manufacturing, deploying deep learning models for visual inspection is mostly hindered by the high and often intractable cost of collecting and annotating large-scale training datasets. While image synthesis from 3D CAD models is…

December 30, 2025

Interpretable and Adaptive Node Classification on Heterophilic Graphs via Combinatorial Scoring and Hybrid Learning

arXiv:2512.22221v1 Announce Type: new Abstract: Graph neural networks (GNNs) achieve strong performance on homophilic graphs but often struggle under heterophily, where adjacent nodes frequently belong to different classes. We propose an interpretable and adaptive framework for semi-supervised node classification based…

December 30, 2025

Multivariate Conformal Prediction via Conformalized Gaussian Scoring

arXiv:2507.20941v2 Announce Type: replace-cross Abstract: While achieving exact conditional coverage in conformal prediction is unattainable without making strong, untestable regularity assumptions, the promise of conformal prediction hinges on finding approximations to conditional guarantees that are realizable in practice. A promising…

December 30, 2025

M”untz-Sz’asz Networks: Neural Architectures with Learnable Power-Law Bases

arXiv:2512.22222v1 Announce Type: new Abstract: Standard neural network architectures employ fixed activation functions (ReLU, tanh, sigmoid) that are poorly suited for approximating functions with singular or fractional power behavior, a structure that arises ubiquitously in physics, including boundary layers, fracture…

December 30, 2025

Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers

arXiv:2512.22564v1 Announce Type: cross Abstract: Respiratory sound classification is hindered by the limited size, high noise levels, and severe class imbalance of benchmark datasets like ICBHI 2017. While Transformer-based models offer powerful feature extraction capabilities, they are prone to overfitting…

December 30, 2025

ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

arXiv:2512.22223v1 Announce Type: new Abstract: Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic analysis systems, whether rule-based or machine learning-driven, often suffer from high false positives and lack interpretability, limiting…

December 30, 2025