Archives AI News

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

arXiv:2509.24006v2 Announce Type: replace Abstract: In Diffusion Transformer (DiT) models, particularly for video generation, attention latency is a major bottleneck due to the long sequence length and the quadratic complexity. We find that attention weights can be separated into two…

November 20, 2025

It’s LIT! Reliability-Optimized LLMs with Inspectable Tools

arXiv:2511.14903v1 Announce Type: new Abstract: Large language models (LLMs) have exhibited remarkable capabilities across various domains. The ability to call external tools further expands their capability to handle real-world tasks. However, LLMs often follow an opaque reasoning process, which limits…

November 20, 2025

Structured Contrastive Learning for Interpretable Latent Representations

arXiv:2511.14920v1 Announce Type: new Abstract: Neural networks exhibit severe brittleness to semantically irrelevant transformations. A mere 75ms electrocardiogram (ECG) phase shift degrades latent cosine similarity from 1.0 to 0.2, while sensor rotations collapse activity recognition performance with inertial measurement units…

November 20, 2025

Transformer-Guided Deep Reinforcement Learning for Optimal Takeoff Trajectory Design of an eVTOL Drone

arXiv:2511.14887v1 Announce Type: new Abstract: The rapid advancement of electric vertical take-off and landing (eVTOL) aircraft offers a promising opportunity to alleviate urban traffic congestion. Thus, developing optimal takeoff trajectories for minimum energy consumption becomes essential for broader eVTOL aircraft…

November 20, 2025

Bringing Federated Learning to Space

arXiv:2511.14889v1 Announce Type: new Abstract: As Low Earth Orbit (LEO) satellite constellations rapidly expand to hundreds and thousands of spacecraft, the need for distributed on-board machine learning becomes critical to address downlink bandwidth limitations. Federated learning (FL) offers a promising…

November 20, 2025

FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications

arXiv:2511.14865v1 Announce Type: new Abstract: Transformer-based architectures are widely adopted in sequential recommendation systems, yet their application in Financial Services (FS) presents distinct practical and modeling challenges for real-time recommendation. These include:a) long-range user interactions (implicit and explicit) spanning both…

November 20, 2025

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

arXiv:2511.14846v1 Announce Type: new Abstract: Training Large Language Models (LLMs) for multi-turn Tool-Integrated Reasoning (TIR) – where models iteratively reason, generate code, and verify through execution – remains challenging for existing reinforcement learning (RL) approaches. Current RL methods, exemplified by…

November 20, 2025

Dynamic Nested Hierarchies: Pioneering Self-Evolution in Machine Learning Architectures for Lifelong Intelligence

arXiv:2511.14823v1 Announce Type: new Abstract: Contemporary machine learning models, including large language models, exhibit remarkable capabilities in static tasks yet falter in non-stationary environments due to rigid architectures that hinder continual adaptation and lifelong learning. Building upon the nested learning…

November 20, 2025

Explaining Time Series Classification Predictions via Causal Attributions

arXiv:2405.15871v2 Announce Type: replace Abstract: Despite the excelling performance of machine learning models, understanding their decisions remains a long-standing goal. Although commonly used attribution methods from explainable AI attempt to address this issue, they typically rely on associational rather than…

November 20, 2025

Integrating Causal Inference with Graph Neural Networks for Alzheimer’s Disease Analysis

arXiv:2511.14922v1 Announce Type: new Abstract: Deep graph learning has advanced Alzheimer’s (AD) disease classification from MRI, but most models remain correlational, confounding demographic and genetic factors with disease specific features. We present Causal-GCN, an interventional graph convolutional framework that integrates…

November 20, 2025