Archives AI News

Learning General Policies with Policy Gradient Methods

arXiv:2512.19366v1 Announce Type: cross Abstract: While reinforcement learning methods have delivered remarkable results in a number of settings, generalization, i.e., the ability to produce policies that generalize in a reliable and systematic way, has remained a challenge. The problem of…

Graph Transformers: A Survey

arXiv:2407.09777v2 Announce Type: replace Abstract: Graph transformers are a recent advancement in machine learning, offering a new class of neural network models for graph-structured data. The synergy between transformers and graph learning demonstrates strong performance and versatility across various graph-related…

FairExpand: Individual Fairness on Graphs with Partial Similarity Information

arXiv:2512.18180v1 Announce Type: new Abstract: Individual fairness, which requires that similar individuals should be treated similarly by algorithmic systems, has become a central principle in fair machine learning. Individual fairness has garnered traction in graph representation learning due to its…

Focus on Likely Classes for Test-Time Prediction

arXiv:2505.03819v3 Announce Type: replace Abstract: We ask: Can focusing on likely classes of a single, in-domain sample improve model predictions? Prior work argued “no”. We put forward a novel rationale in favor of “yes”: Sharedness of features among classes indicates…

When Does Learning Renormalize? Sufficient Conditions for Power Law Spectral Dynamics

arXiv:2512.18209v1 Announce Type: new Abstract: Empirical power–law scaling has been widely observed across modern deep learning systems, yet its theoretical origins and scope of validity remain incompletely understood. The Generalized Resolution–Shell Dynamics (GRSD) framework models learning as spectral energy transport…

Low-Regret and Low-Complexity Learning for Hierarchical Inference

arXiv:2508.08985v3 Announce Type: replace Abstract: This work focuses on Hierarchical Inference (HI) in edge intelligence systems, where a compact Local-ML model on an end-device works in conjunction with a high-accuracy Remote-ML model on an edge-server. HI aims to reduce latency,…

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

arXiv:2512.18215v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has become a key paradigm to improve the reasoning capabilities of Multimodal Large Language Models (MLLMs). However, prevalent group-based algorithms such as GRPO require multi-rollout sampling for each prompt.…