Archives AI News

FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications

arXiv:2511.14865v1 Announce Type: new Abstract: Transformer-based architectures are widely adopted in sequential recommendation systems, yet their application in Financial Services (FS) presents distinct practical and modeling challenges for real-time recommendation. These include:a) long-range user interactions (implicit and explicit) spanning both…

November 20, 2025

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

arXiv:2511.14846v1 Announce Type: new Abstract: Training Large Language Models (LLMs) for multi-turn Tool-Integrated Reasoning (TIR) – where models iteratively reason, generate code, and verify through execution – remains challenging for existing reinforcement learning (RL) approaches. Current RL methods, exemplified by…

November 20, 2025

Dynamic Nested Hierarchies: Pioneering Self-Evolution in Machine Learning Architectures for Lifelong Intelligence

arXiv:2511.14823v1 Announce Type: new Abstract: Contemporary machine learning models, including large language models, exhibit remarkable capabilities in static tasks yet falter in non-stationary environments due to rigid architectures that hinder continual adaptation and lifelong learning. Building upon the nested learning…

November 20, 2025

Scientists get a first look at the innermost region of a white dwarf system

X-ray observations reveal surprising features of the dying star’s most energetic environment.

November 20, 2025

VisPlay: Self-Evolving Vision-Language Models from Images

arXiv:2511.15661v1 Announce Type: cross Abstract: Reinforcement learning (RL) provides a principled framework for improving Vision-Language Models (VLMs) on complex reasoning tasks. However, existing RL approaches often rely on human-annotated labels or task-specific heuristics to define verifiable rewards, both of which…

November 20, 2025

Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch

arXiv:2510.16088v3 Announce Type: replace-cross Abstract: Quantization of neural networks provides benefits of inference in less compute and memory requirements. Previous work in quantization lack two important aspects which this work provides. First almost all previous work in quantization used a…

November 20, 2025

Decentralized Gaussian Process Classification and an Application in Subsea Robotics

arXiv:2511.15529v1 Announce Type: cross Abstract: Teams of cooperating autonomous underwater vehicles (AUVs) rely on acoustic communication for coordination, yet this communication medium is constrained by limited range, multi-path effects, and low bandwidth. One way to address the uncertainty associated with…

November 20, 2025

$pi^{*}_{0.6}$: a VLA That Learns From Experience

arXiv:2511.14759v2 Announce Type: replace Abstract: We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of…

November 20, 2025

Operator learning for energy-efficient building ventilation control with computational fluid dynamics simulation of a real-world classroom

arXiv:2504.21243v2 Announce Type: replace-cross Abstract: Energy-efficient ventilation control plays a vital role in reducing building energy consumption while ensuring occupant health and comfort. While Computational Fluid Dynamics (CFD) simulations provide detailed and physically accurate representation of indoor airflow, their high…

November 20, 2025

Energy-based generator matching: A neural sampler for general state space

arXiv:2505.19646v3 Announce Type: replace Abstract: We propose Energy-based generator matching (EGM), a modality-agnostic approach to train generative models from energy functions in the absence of data. Extending the recently proposed generator matching, EGM enables training of arbitrary continuous-time Markov processes,…

November 20, 2025