Archives AI News

EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning

arXiv:2510.17928v1 Announce Type: new Abstract: Reliable verifiable data has become a key driver of capability gains in modern language models, enabling stable reinforcement learning with verifiable rewards and effective distillation that transfers competence across math, coding, and agentic tasks. Yet…

October 22, 2025

Understanding Differential Transformer Unchains Pretrained Self-Attentions

arXiv:2505.16333v3 Announce Type: replace Abstract: Differential Transformer has recently gained significant attention for its impressive empirical performance, often attributed to its ability to perform noise canceled attention. However, precisely how differential attention achieves its empirical benefits remains poorly understood. Moreover,…

October 22, 2025

From Observations to Parameters: Detecting Changepoint in Nonlinear Dynamics with Simulation-based Inference

arXiv:2510.17933v1 Announce Type: new Abstract: Detecting regime shifts in chaotic time series is hard because observation-space signals are entangled with intrinsic variability. We propose Parameter–Space Changepoint Detection (Param–CPD), a two–stage framework that first amortizes Bayesian inference of governing parameters with…

October 22, 2025

TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decoding

arXiv:2507.09252v3 Announce Type: replace Abstract: We propose TPP-SD, a novel approach that accelerates Transformer temporal point process (TPP) sampling by adapting speculative decoding (SD) techniques from language models. By identifying the structural similarities between thinning algorithms for TPPs and speculative…

October 22, 2025

UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts

arXiv:2510.17937v1 Announce Type: new Abstract: We present UniRL-Zero, a unified reinforcement learning (RL) framework that boosts, multimodal language model understanding and reasoning, diffusion model multimedia generation, and their beneficial interaction capabilities within a unified model. Our work defines six scenarios…

October 22, 2025

Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning

arXiv:2510.13865v3 Announce Type: replace Abstract: We introduce the Deep Edge Filter, a novel approach that applies high-pass filtering to deep neural network features to improve model generalizability. Our method is motivated by our hypothesis that neural networks encode task-relevant semantic…

October 22, 2025

Demystifying Transition Matching: When and Why It Can Beat Flow Matching

arXiv:2510.17991v1 Announce Type: new Abstract: Flow Matching (FM) underpins many state-of-the-art generative models, yet recent results indicate that Transition Matching (TM) can achieve higher quality with fewer sampling steps. This work answers the question of when and why TM outperforms…

October 22, 2025

Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation

arXiv:2402.07127v3 Announce Type: replace-cross Abstract: Robot learning of manipulation skills is hindered by the scarcity of diverse, unbiased datasets. While curated datasets can help, challenges remain in generalizability and real-world transfer. Meanwhile, large-scale “in-the-wild” video datasets have driven progress in…

October 22, 2025

Attention-Guided Deep Adversarial Temporal Subspace Clustering (A-DATSC) Model for multivariate spatiotemporal data

arXiv:2510.18004v1 Announce Type: new Abstract: Deep subspace clustering models are vital for applications such as snowmelt detection, sea ice tracking, crop health monitoring, infectious disease modeling, network load prediction, and land-use planning, where multivariate spatiotemporal data exhibit complex temporal dependencies…

October 22, 2025

Low-cost Embedded Breathing Rate Determination Using 802.15.4z IR-UWB Hardware for Remote Healthcare

arXiv:2504.03772v2 Announce Type: replace-cross Abstract: Respiratory diseases account for a significant portion of global mortality. Affordable and early detection is an effective way of addressing these ailments. To this end, a low-cost commercial off-the-shelf (COTS), IEEE 802.15.4z standard compliant impulse-radio…

October 22, 2025