Archives AI News

Mini-Batch Class Composition Bias in Link Prediction

arXiv:2604.25978v1 Announce Type: new Abstract: Prior work on node classification has shown that Graph Neural Networks (GNNs) can learn representations that transfer across graphs, when underlying graph properties are shared. For a fixed graph, one would then expect GNNs trained…

April 30, 2026

Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective

arXiv:2604.25975v1 Announce Type: new Abstract: Key-value (KV) caching is essential for large language model inference, yet its memory overhead poses a critical bottleneck for long-context generation. Existing eviction policies predominantly rely on empirical heuristics, lacking a rigorous theoretical foundation. This…

April 30, 2026

A Survey of Multi-Agent Deep Reinforcement Learning with Graph Neural Network-Based Communication

arXiv:2604.25972v1 Announce Type: new Abstract: In multi-agent reinforcement learning (MARL), the integration of a communication mechanism, allowing agents to better learn to coordinate their actions and converge on their objectives by sharing information. Based on an interaction graph, a subclass…

April 30, 2026

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

arXiv:2405.13729v3 Announce Type: replace Abstract: In this paper, we study an under-explored but important factor of diffusion generative models, i.e., the combinatorial complexity. Data samples are generally high-dimensional, and for various structured generation tasks, additional attributes are combined to associate…

April 30, 2026

Privacy-Preserving Federated Learning Framework for Distributed Chemical Process Optimization

arXiv:2604.26073v1 Announce Type: new Abstract: Industrial chemical plants often operate under strict data confidentiality constraints, making centralized data-driven process modeling difficult. Federated learning (FL) provides a promising solution by enabling collaborative model training across distributed facilities without sharing raw operational…

April 30, 2026

A projection-based framework for gradient-free and parallel learning

arXiv:2506.05878v2 Announce Type: replace Abstract: We present a feasibility-seeking approach to neural network training. This mathematical optimization framework is distinct from conventional gradient-based loss minimization and uses projection operators and iterative projection algorithms. We reformulate training as a large-scale feasibility…

April 30, 2026

PPG-Based Affect Recognition with Long-Range Deep Models: A Measurement-Driven Comparison of CNN, Transformer, and Mamba Architectures

arXiv:2604.26078v1 Announce Type: new Abstract: Photoplethysmography (PPG) is increasingly used in wearable affective computing due to its low cost and ease of integration into consumer devices. Recent advances in deep learning have introduced long-range sequence models, such as Transformers, and…

April 30, 2026

Hybrid Quantum-Classical Ridgelet Neural Networks for Portfolio Optimization

arXiv:2601.03654v2 Announce Type: replace Abstract: In this study, we introduce a quantum computing method that incorporates Ridglet transforms into quantum processing pipelines for financial time-series forecasting with Quantum Approximate Optimization Algorithm (QAOA)-based portfolio optimization. We propose a Quantum Ridgelet Neural…

April 30, 2026

Momentum-Conserving Graph Neural Networks for Deformable Objects

arXiv:2604.26097v1 Announce Type: new Abstract: Graph neural networks (GNNs) have emerged as a versatile and efficient option for modeling the dynamic behavior of deformable materials. While GNNs generalize readily to arbitrary shapes, mesh topologies, and material parameters, existing architectures struggle…

April 30, 2026

Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

arXiv:2604.14246v2 Announce Type: replace Abstract: Sparse Mixture-of-Experts (MoE) models have achieved remarkable scalability, yet they remain vulnerable to hallucinations, particularly when processing long-tail knowledge. We identify that this fragility stems from static Top-$k$ routing: routers tend to favor high-frequency patterns…

April 30, 2026