Archives AI News

Learning Tennis Strategy Through Curriculum-Based Dueling Double Deep Q-Networks

arXiv:2512.22186v1 Announce Type: new Abstract: Tennis strategy optimization is a challenging sequential decision-making problem involving hierarchical scoring, stochastic outcomes, long-horizon credit assignment, physical fatigue, and adaptation to opponent skill. I present a reinforcement learning framework that integrates a custom tennis…

Wireless Traffic Prediction with Large Language Model

arXiv:2512.22178v1 Announce Type: new Abstract: The growing demand for intelligent, adaptive resource management in next-generation wireless networks has underscored the importance of accurate and scalable wireless traffic prediction. While recent advancements in deep learning and foundation models such as large…

Regret-Based Federated Causal Discovery with Unknown Interventions

arXiv:2512.23626v1 Announce Type: cross Abstract: Most causal discovery methods recover a completed partially directed acyclic graph representing a Markov equivalence class from observational data. Recent work has extended these methods to federated settings to address data decentralization and privacy constraints,…

Efficient Offline Reinforcement Learning: First Imitate, then Improve

arXiv:2406.13376v2 Announce Type: replace Abstract: Supervised imitation-based approaches are often favored over off-policy reinforcement learning approaches for learning policies offline, since their straightforward optimization objective makes them computationally efficient and stable to train. However, their performance is fundamentally limited by…

Transformer Reconstructed with Dynamic Value Attention

arXiv:2512.22212v1 Announce Type: new Abstract: Since transformer was firstly published in 2017, several works have been proposed to optimize it. However, the major structure of transformer remains unchanged, ignoring one of its main intrinsic limitations, which is the same static…