Archives AI News

RewriteNets: End-to-End Trainable String-Rewriting for Generative Sequence Modeling

arXiv:2601.07868v1 Announce Type: new Abstract: Dominant sequence models like the Transformer represent structure implicitly through dense attention weights, incurring quadratic complexity. We propose RewriteNets, a novel neural architecture built on an alternative paradigm: explicit, parallel string rewriting. Each layer in…

RMBRec: Robust Multi-Behavior Recommendation towards Target Behaviors

arXiv:2601.08705v1 Announce Type: cross Abstract: Multi-behavior recommendation faces a critical challenge in practice: auxiliary behaviors (e.g., clicks, carts) are often noisy, weakly correlated, or semantically misaligned with the target behavior (e.g., purchase), which leads to biased preference learning and suboptimal…

Max-Min Neural Network Operators For Approximation of Multivariate Functions

arXiv:2601.07886v1 Announce Type: new Abstract: In this paper, we develop a multivariate framework for approximation by max-min neural network operators. Building on the recent advances in approximation theory by neural network operators, particularly, the univariate max-min operators, we propose and…

Attacks on fairness in Federated Learning

arXiv:2311.12715v3 Announce Type: replace Abstract: Federated Learning is an important emerging distributed training paradigm that keeps data private on clients. It is now well understood that by controlling only a small subset of FL clients, it is possible to introduce…

KVzap: Fast, Adaptive, and Faithful KV Cache Pruning

arXiv:2601.07891v1 Announce Type: new Abstract: Growing context lengths in transformer-based language models have made the key-value (KV) cache a critical inference bottleneck. While many KV cache pruning methods have been proposed, they have not yet been adopted in major inference…

SLogic: Subgraph-Informed Logical Rule Learning for Knowledge Graph Completion

arXiv:2510.00279v2 Announce Type: replace Abstract: Logical rule-based methods offer an interpretable approach to knowledge graph completion (KGC) by capturing compositional relationships in the form of human-readable inference rules. While existing logical rule-based methods learn rule confidence scores, they typically assign…

Revealing the Attention Floating Mechanism in Masked Diffusion Models

arXiv:2601.07894v1 Announce Type: new Abstract: Masked diffusion models (MDMs), which leverage bidirectional attention and a denoising process, are narrowing the performance gap with autoregressive models (ARMs). However, their internal attention mechanisms remain under-explored. This paper investigates the attention behaviors in…