Archives AI News

MoE-TransMov: A Transformer-based Model for Next POI Prediction in Familiar & Unfamiliar Movements

arXiv:2512.17985v1 Announce Type: new Abstract: Accurate prediction of the next point of interest (POI) within human mobility trajectories is essential for location-based services, as it enables more timely and personalized recommendations. In particular, with the rise of these approaches, studies…

December 23, 2025

Parameter-Efficient Fine-Tuning for HAR: Integrating LoRA and QLoRA into Transformer Models

arXiv:2512.17983v1 Announce Type: new Abstract: Human Activity Recognition is a foundational task in pervasive computing. While recent advances in self-supervised learning and transformer-based architectures have significantly improved HAR performance, adapting large pretrained models to new domains remains a practical challenge…

December 23, 2025

CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs

arXiv:2512.17970v1 Announce Type: new Abstract: Weight-only quantization is widely used to mitigate the memory-bound nature of LLM inference. Codebook-based methods extend this trend by achieving strong accuracy in the extremely low-bit regime (e.g., 2-bit). However, current kernels rely on dequantization,…

December 23, 2025

Convolutional-neural-operator-based transfer learning for solving PDEs

arXiv:2512.17969v1 Announce Type: new Abstract: Convolutional neural operator is a CNN-based architecture recently proposed to enforce structure-preserving continuous-discrete equivalence and enable the genuine, alias-free learning of solution operators of PDEs. This neural operator was demonstrated to outperform for certain cases…

December 23, 2025

MOORL: A Framework for Integrating Offline-Online Reinforcement Learning

arXiv:2506.09574v2 Announce Type: replace Abstract: Sample efficiency and exploration remain critical challenges in Deep Reinforcement Learning (DRL), particularly in complex domains. Offline RL, which enables agents to learn optimal policies from static, pre-collected datasets, has emerged as a promising alternative.…

December 23, 2025

Towards Benchmarking Privacy Vulnerabilities in Selective Forgetting with Large Language Models

arXiv:2512.18035v1 Announce Type: new Abstract: The rapid advancements in artificial intelligence (AI) have primarily focused on the process of learning from data to acquire knowledgeable learning systems. As these systems are increasingly deployed in critical areas, ensuring their privacy and…

December 23, 2025

GenUQ: Predictive Uncertainty Estimates via Generative Hyper-Networks

arXiv:2509.21605v2 Announce Type: replace Abstract: Operator learning is a recently developed generalization of regression to mappings between functions. It promises to drastically reduce expensive numerical integration of PDEs to fast evaluations of mappings between functional states of a system, i.e.,…

December 23, 2025

Probabilistic Digital Twins of Users: Latent Representation Learning with Statistically Validated Semantics

arXiv:2512.18056v1 Announce Type: new Abstract: Understanding user identity and behavior is central to applications such as personalization, recommendation, and decision support. Most existing approaches rely on deterministic embeddings or black-box predictive models, offering limited uncertainty quantification and little insight into…

December 23, 2025

Renormalizable Spectral-Shell Dynamics as the Origin of Neural Scaling Laws

arXiv:2512.10427v3 Announce Type: replace Abstract: Neural scaling laws and double-descent phenomena suggest that deep-network training obeys a simple macroscopic structure despite highly nonlinear optimization dynamics. We derive such structure directly from gradient descent in function space. For mean-squared error loss,…

December 23, 2025

Microstructure-based Variational Neural Networks for Robust Uncertainty Quantification in Materials Digital Twins

arXiv:2512.18104v1 Announce Type: new Abstract: Aleatoric uncertainties – irremovable variability in microstructure morphology, constituent behavior, and processing conditions – pose a major challenge to developing uncertainty-robust digital twins. We introduce the Variational Deep Material Network (VDMN), a physics-informed surrogate model…

December 23, 2025