Archives AI News

Large Language Model-Empowered Decision Transformer for UAV-Enabled Data Collection

arXiv:2509.13934v2 Announce Type: replace-cross Abstract: The deployment of unmanned aerial vehicles (UAVs) for reliable and energy-efficient data collection from spatially distributed devices holds great promise in supporting diverse Internet of Things (IoT) applications. Nevertheless, the limited endurance and communication range…

Local Mechanisms of Compositional Generalization in Conditional Diffusion

arXiv:2509.16447v1 Announce Type: new Abstract: Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization, the ability…

Entropic Causal Inference: Graph Identifiability

arXiv:2509.16463v1 Announce Type: new Abstract: Entropic causal inference is a recent framework for learning the causal graph between two variables from observational data by finding the information-theoretically simplest structural explanation of the data, i.e., the model with smallest entropy. In…

Fr’echet Geodesic Boosting

arXiv:2509.18013v1 Announce Type: cross Abstract: Gradient boosting has become a cornerstone of machine learning, enabling base learners such as decision trees to achieve exceptional predictive performance. While existing algorithms primarily handle scalar or Euclidean outputs, increasingly prevalent complex-structured data, such…

Towards Universal Debiasing for Language Models-based Tabular Data Generation

arXiv:2509.16475v1 Announce Type: new Abstract: Large language models (LLMs) have achieved promising results in tabular data generation. However, inherent historical biases in tabular datasets often cause LLMs to exacerbate fairness issues, particularly when multiple advantaged and protected features are involved.…

Expressive Power of Graph Neural Networks for (Mixed-Integer) Quadratic Programs

arXiv:2406.05938v2 Announce Type: replace Abstract: Quadratic programming (QP) is the most widely applied category of problems in nonlinear programming. Many applications require real-time/fast solutions, though not necessarily with high precision. Existing methods either involve matrix decomposition or use the preconditioned…

Revisiting Broken Windows Theory

arXiv:2509.16490v1 Announce Type: new Abstract: We revisit the longstanding question of how physical structures in urban landscapes influence crime. Leveraging machine learning-based matching techniques to control for demographic composition, we estimate the effects of several types of urban structures on…

Test-Time Training Scaling Laws for Chemical Exploration in Drug Design

arXiv:2501.19153v3 Announce Type: replace Abstract: Chemical Language Models (CLMs) leveraging reinforcement learning (RL) have shown promise in de novo molecular design, yet often suffer from mode collapse, limiting their exploration capabilities. Inspired by Test-Time Training (TTT) in large language models,…