Archives AI News

On Evolution-Based Models for Experimentation Under Interference

arXiv:2511.21675v1 Announce Type: cross Abstract: Causal effect estimation in networked systems is central to data-driven decision making. In such settings, interventions on one unit can spill over to others, and in complex physical or social systems, the interaction pathways driving…

Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment

arXiv:2511.20913v1 Announce Type: new Abstract: Existing studies on reinforcement learning (RL) for sepsis management have mostly followed an established problem setup, in which patient data are aggregated into 4-hour time steps. Although concerns have been raised regarding the coarseness of…

Single- vs. Dual-Policy Reinforcement Learning for Dynamic Bike Rebalancing

arXiv:2402.03589v2 Announce Type: replace Abstract: Bike-sharing systems (BSS) provide a sustainable urban mobility solution, but ensuring their reliability requires effective rebalancing strategies to address stochastic demand and prevent station imbalances. This paper proposes reinforcement learning (RL) algorithms for dynamic rebalancing…

Operationalizing Quantized Disentanglement

arXiv:2511.20927v1 Announce Type: new Abstract: Recent theoretical work established the unsupervised identifiability of quantized factors under any diffeomorphism. The theory assumes that quantization thresholds correspond to axis-aligned discontinuities in the probability density of the latent factors. By constraining a learned…

No Request Left Behind: Tackling Heterogeneity in Long-Context LLM Inference with Medha

arXiv:2409.17264v5 Announce Type: replace Abstract: Deploying million-token Large Language Models (LLMs) is challenging because production workloads are highly heterogeneous, mixing short queries and long documents. This heterogeneity, combined with the quadratic complexity of attention, creates severe convoy effects where long-running…

F-INR: Functional Tensor Decomposition for Implicit Neural Representations

arXiv:2503.21507v2 Announce Type: replace Abstract: Implicit Neural Representations (INRs) model signals as continuous, differentiable functions. However, monolithic INRs scale poorly with data dimensionality, leading to excessive training costs. We propose F-INR, a framework that addresses this limitation by factorizing a…

Dataset Poisoning Attacks on Behavioral Cloning Policies

arXiv:2511.20992v1 Announce Type: new Abstract: Behavior Cloning (BC) is a popular framework for training sequential decision policies from expert demonstrations via supervised learning. As these policies are increasingly being deployed in the real world, their robustness and potential vulnerabilities are…

A Gray-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse

arXiv:2408.10901v4 Announce Type: replace-cross Abstract: Recent advancements in Latent Diffusion Models (LDMs) have revolutionized image synthesis and manipulation, raising significant concerns about data misappropriation and intellectual property infringement. While adversarial attacks have been extensively explored as a protective measure against…