Archives AI News

Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch

arXiv:2510.16088v3 Announce Type: replace-cross Abstract: Quantization of neural networks provides benefits of inference in less compute and memory requirements. Previous work in quantization lack two important aspects which this work provides. First almost all previous work in quantization used a…

November 20, 2025

Decentralized Gaussian Process Classification and an Application in Subsea Robotics

arXiv:2511.15529v1 Announce Type: cross Abstract: Teams of cooperating autonomous underwater vehicles (AUVs) rely on acoustic communication for coordination, yet this communication medium is constrained by limited range, multi-path effects, and low bandwidth. One way to address the uncertainty associated with…

November 20, 2025

$pi^{*}_{0.6}$: a VLA That Learns From Experience

arXiv:2511.14759v2 Announce Type: replace Abstract: We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of…

November 20, 2025

Operator learning for energy-efficient building ventilation control with computational fluid dynamics simulation of a real-world classroom

arXiv:2504.21243v2 Announce Type: replace-cross Abstract: Energy-efficient ventilation control plays a vital role in reducing building energy consumption while ensuring occupant health and comfort. While Computational Fluid Dynamics (CFD) simulations provide detailed and physically accurate representation of indoor airflow, their high…

November 20, 2025

Energy-based generator matching: A neural sampler for general state space

arXiv:2505.19646v3 Announce Type: replace Abstract: We propose Energy-based generator matching (EGM), a modality-agnostic approach to train generative models from energy functions in the absence of data. Extending the recently proposed generator matching, EGM enables training of arbitrary continuous-time Markov processes,…

November 20, 2025

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

arXiv:2509.24006v2 Announce Type: replace Abstract: In Diffusion Transformer (DiT) models, particularly for video generation, attention latency is a major bottleneck due to the long sequence length and the quadratic complexity. We find that attention weights can be separated into two…

November 20, 2025

It’s LIT! Reliability-Optimized LLMs with Inspectable Tools

arXiv:2511.14903v1 Announce Type: new Abstract: Large language models (LLMs) have exhibited remarkable capabilities across various domains. The ability to call external tools further expands their capability to handle real-world tasks. However, LLMs often follow an opaque reasoning process, which limits…

November 20, 2025

Structured Contrastive Learning for Interpretable Latent Representations

arXiv:2511.14920v1 Announce Type: new Abstract: Neural networks exhibit severe brittleness to semantically irrelevant transformations. A mere 75ms electrocardiogram (ECG) phase shift degrades latent cosine similarity from 1.0 to 0.2, while sensor rotations collapse activity recognition performance with inertial measurement units…

November 20, 2025

Transformer-Guided Deep Reinforcement Learning for Optimal Takeoff Trajectory Design of an eVTOL Drone

arXiv:2511.14887v1 Announce Type: new Abstract: The rapid advancement of electric vertical take-off and landing (eVTOL) aircraft offers a promising opportunity to alleviate urban traffic congestion. Thus, developing optimal takeoff trajectories for minimum energy consumption becomes essential for broader eVTOL aircraft…

November 20, 2025

Bringing Federated Learning to Space

arXiv:2511.14889v1 Announce Type: new Abstract: As Low Earth Orbit (LEO) satellite constellations rapidly expand to hundreds and thousands of spacecraft, the need for distributed on-board machine learning becomes critical to address downlink bandwidth limitations. Federated learning (FL) offers a promising…

November 20, 2025