Archives AI News

HADSF: Aspect Aware Semantic Control for Explainable Recommendation

arXiv:2510.26994v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) promise more effective information extraction for review-based recommender systems, yet current methods still (i) mine free-form reviews without scope control, producing redundant and noisy representations, (ii) lack principled…

November 3, 2025

An All-Reduce Compatible Top-K Compressor for Communication-Efficient Distributed Learning

arXiv:2510.26709v2 Announce Type: replace Abstract: Communication remains a central bottleneck in large-scale distributed machine learning, and gradient sparsification has emerged as a promising strategy to alleviate this challenge. However, existing gradient compressors face notable limitations: Rand-$K$ discards structural information and…

November 3, 2025

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules

arXiv:2510.26997v1 Announce Type: new Abstract: Learning rules — prescriptions for updating model parameters to improve performance — are typically assumed rather than derived. Why do some learning rules work better than others, and under what assumptions can a given rule…

November 3, 2025

Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm

arXiv:2502.10650v3 Announce Type: replace-cross Abstract: Advances in deep learning and representation learning have transformed item factor analysis (IFA) in the item response theory (IRT) literature by enabling more efficient and accurate parameter estimation. Variational Autoencoders (VAEs) have been one of…

November 3, 2025

A Framework for Fair Evaluation of Variance-Aware Bandit Algorithms

arXiv:2510.27001v1 Announce Type: new Abstract: Multi-armed bandit (MAB) problems serve as a fundamental building block for more complex reinforcement learning algorithms. However, evaluating and comparing MAB algorithms remains challenging due to the lack of standardized conditions and replicability. This is…

November 3, 2025

Conformal Object Detection by Sequential Risk Control

arXiv:2505.24038v2 Announce Type: replace-cross Abstract: Recent advances in object detectors have led to their adoption for industrial uses. However, their deployment in safety-critical applications is hindered by the inherent lack of reliability of neural networks and the complex structure of…

November 3, 2025

Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase

arXiv:2510.27002v1 Announce Type: new Abstract: While world models are increasingly positioned as a pathway to overcoming data scarcity in domains such as robotics, open training infrastructure for world modeling remains nascent. We introduce Jasmine, a performant JAX-based world modeling codebase…

November 3, 2025

MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding

arXiv:2510.25327v3 Announce Type: replace-cross Abstract: Real-time multimodal inference on resource-constrained edge devices is essential for applications such as autonomous driving, human-computer interaction, and mobile health. However, prior work often overlooks the tight coupling between sensing dynamics and model execution, as…

November 3, 2025

Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems

arXiv:2510.27004v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models improve transformer efficiency but lack a unified theoretical explanation, especially when both feed-forward and attention layers are allowed to specialize. To this end, we study the Mixture-of-Transformers (MoT), a tractable theoretical framework…

November 3, 2025

Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

arXiv:2303.03272v2 Announce Type: replace Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of…

November 3, 2025