Archives AI News

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning

arXiv:2301.11321v3 Announce Type: replace Abstract: Off-policy learning from multistep returns is crucial for sample-efficient reinforcement learning, but counteracting off-policy bias without exacerbating variance is challenging. Classically, off-policy bias is corrected in a per-decision manner: past temporal-difference errors are re-weighted by…

December 23, 2025

Enhancing Multi-Agent Collaboration with Attention-Based Actor-Critic Policies

arXiv:2507.22782v3 Announce Type: replace-cross Abstract: This paper introduces Team-Attention-Actor-Critic (TAAC), a reinforcement learning algorithm designed to enhance multi-agent collaboration in cooperative environments. TAAC employs a Centralized Training/Centralized Execution scheme incorporating multi-headed attention mechanisms in both the actor and critic. This…

December 23, 2025

Gabliteration: Adaptive Multi-Directional Neural Weight Modification for Selective Behavioral Alteration in Large Language Models

arXiv:2512.18901v1 Announce Type: cross Abstract: We present Gabliteration, a novel neural weight modification technique that advances beyond traditional abliteration methods by implementing adaptive multi-directional projections with regularized layer selection. Our approach addresses the fundamental limitation of existing methods that compromise…

December 23, 2025

Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis

arXiv:2511.07329v2 Announce Type: replace Abstract: It introduces FractalNet, a fractal-inspired computational architectures for advanced large language model analysis that mainly challenges model diversity on a large scale in an efficient manner. The new set-up involves a template-driven generator, runner, and…

December 23, 2025

Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy

arXiv:2302.09049v3 Announce Type: replace-cross Abstract: We construct multiperiodic processes — a simple example of stationary ergodic (but not mixing) processes over natural numbers that enjoy the vanishing entropy rate under a mild condition. Multiperiodic processes are supported on randomly shifted…

December 23, 2025

FedOAED: Federated On-Device Autoencoder Denoiser for Heterogeneous Data under Limited Client Availability

arXiv:2512.17986v1 Announce Type: new Abstract: Over the last few decades, machine learning (ML) and deep learning (DL) solutions have demonstrated their potential across many applications by leveraging large amounts of high-quality data. However, strict data-sharing regulations such as the General…

December 23, 2025

A Dataset and Benchmarks for Atrial Fibrillation Detection from Electrocardiograms of Intensive Care Unit Patients

arXiv:2512.18031v1 Announce Type: new Abstract: Objective: Atrial fibrillation (AF) is the most common cardiac arrhythmia experienced by intensive care unit (ICU) patients and can cause adverse health effects. In this study, we publish a labelled ICU dataset and benchmarks for…

December 23, 2025

A Hybrid Inductive-Transductive Network for Traffic Flow Imputation on Unsampled Locations

arXiv:2512.17984v1 Announce Type: new Abstract: Accurately imputing traffic flow at unsensed locations is difficult: loop detectors provide precise but sparse measurements, speed from probe vehicles is widely available yet only weakly correlated with flow, and nearby links often exhibit strong…

December 23, 2025

MoE-TransMov: A Transformer-based Model for Next POI Prediction in Familiar & Unfamiliar Movements

arXiv:2512.17985v1 Announce Type: new Abstract: Accurate prediction of the next point of interest (POI) within human mobility trajectories is essential for location-based services, as it enables more timely and personalized recommendations. In particular, with the rise of these approaches, studies…

December 23, 2025

Parameter-Efficient Fine-Tuning for HAR: Integrating LoRA and QLoRA into Transformer Models

arXiv:2512.17983v1 Announce Type: new Abstract: Human Activity Recognition is a foundational task in pervasive computing. While recent advances in self-supervised learning and transformer-based architectures have significantly improved HAR performance, adapting large pretrained models to new domains remains a practical challenge…

December 23, 2025