Archives AI News

FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications

arXiv:2511.14865v1 Announce Type: new Abstract: Transformer-based architectures are widely adopted in sequential recommendation systems, yet their application in Financial Services (FS) presents distinct practical and modeling challenges for real-time recommendation. These include:a) long-range user interactions (implicit and explicit) spanning both…

November 20, 2025

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

arXiv:2511.14846v1 Announce Type: new Abstract: Training Large Language Models (LLMs) for multi-turn Tool-Integrated Reasoning (TIR) – where models iteratively reason, generate code, and verify through execution – remains challenging for existing reinforcement learning (RL) approaches. Current RL methods, exemplified by…

November 20, 2025

Dynamic Nested Hierarchies: Pioneering Self-Evolution in Machine Learning Architectures for Lifelong Intelligence

arXiv:2511.14823v1 Announce Type: new Abstract: Contemporary machine learning models, including large language models, exhibit remarkable capabilities in static tasks yet falter in non-stationary environments due to rigid architectures that hinder continual adaptation and lifelong learning. Building upon the nested learning…

November 20, 2025

Explaining Time Series Classification Predictions via Causal Attributions

arXiv:2405.15871v2 Announce Type: replace Abstract: Despite the excelling performance of machine learning models, understanding their decisions remains a long-standing goal. Although commonly used attribution methods from explainable AI attempt to address this issue, they typically rely on associational rather than…

November 20, 2025

Integrating Causal Inference with Graph Neural Networks for Alzheimer’s Disease Analysis

arXiv:2511.14922v1 Announce Type: new Abstract: Deep graph learning has advanced Alzheimer’s (AD) disease classification from MRI, but most models remain correlational, confounding demographic and genetic factors with disease specific features. We present Causal-GCN, an interventional graph convolutional framework that integrates…

November 20, 2025

Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion

arXiv:2503.10488v3 Announce Type: replace Abstract: Generating co-speech gestures in real time requires both temporal coherence and efficient sampling. We introduce a novel framework for streaming gesture generation that extends Rolling Diffusion models with structured progressive noise scheduling, enabling seamless long-sequence…

November 20, 2025

How to Train Private Clinical Language Models: A Comparative Study of Privacy-Preserving Pipelines for ICD-9 Coding

arXiv:2511.14936v1 Announce Type: new Abstract: Large language models trained on clinical text risk exposing sensitive patient information, yet differential privacy (DP) methods often severely degrade the diagnostic accuracy needed for deployment. Despite rapid progress in DP optimisation and text generation,…

November 20, 2025

Coresets from Trajectories: Selecting Data via Correlation of Loss Differences

arXiv:2508.20230v2 Announce Type: replace Abstract: Deep learning models achieve state-of-the-art performance across domains but face scalability challenges in real-time or resource-constrained scenarios. To address this, we propose Correlation of Loss Differences (CLD), a simple and scalable metric for coreset selection…

November 20, 2025

Knowledge Graphs as Structured Memory for Embedding Spaces: From Training Clusters to Explainable Inference

arXiv:2511.14961v1 Announce Type: new Abstract: We introduce Graph Memory (GM), a structured non-parametric framework that augments embedding-based inference with a compact, relational memory over region-level prototypes. Rather than treating each training instance in isolation, GM summarizes the embedding space into…

November 20, 2025

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

arXiv:2511.11743v2 Announce Type: replace Abstract: Deploying deep neural networks on resource-constrained devices faces two critical challenges: maintaining accuracy under aggressive quantization while ensuring predictable inference latency. We present a curiosity-driven quantized Mixture-of-Experts framework that addresses both through Bayesian epistemic uncertainty-based…

November 20, 2025