Archives AI News

Forecasting Supply Chain Disruptions with Foresight Learning

arXiv:2604.01298v1 Announce Type: new Abstract: Anticipating supply chain disruptions before they materialize is a core challenge for firms and policymakers alike. A key difficulty is learning to reason reliably about infrequent, high-impact events from noisy and unstructured inputs – a…

Coarsening Causal DAG Models

arXiv:2601.10531v2 Announce Type: replace-cross Abstract: Directed acyclic graphical (DAG) models are a powerful tool for representing causal relationships among jointly distributed random variables, especially concerning data from across different experimental settings. However, it is not always practical or desirable to…

Model Merging via Data-Free Covariance Estimation

arXiv:2604.01329v1 Announce Type: new Abstract: Model merging provides a way of cheaply combining individual models to produce a model that inherits each individual’s capabilities. While some merging methods can approach the performance of multitask training, they are often heuristically motivated…

Causal K-Means Clustering

arXiv:2405.03083v5 Announce Type: replace-cross Abstract: Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify…

WFR-FM: Simulation-Free Dynamic Unbalanced Optimal Transport

arXiv:2601.06810v2 Announce Type: replace Abstract: The Wasserstein-Fisher-Rao (WFR) metric extends dynamic optimal transport (OT) by coupling displacement with change of mass, providing a principled geometry for modeling unbalanced snapshot dynamics. Existing WFR solvers, however, are often unstable, computationally expensive, and…

Regret Bounds for Reinforcement Learning from Multi-Source Imperfect Preferences

arXiv:2603.20453v2 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) replaces hard-to-specify rewards with pairwise trajectory preferences, yet regret-oriented theory often assumes that preference labels are generated consistently from a single ground-truth objective. In practical RLHF systems, however, feedback…

DiffGradCAM: A Universal Class Activation Map Resistant to Adversarial Training

arXiv:2506.08514v3 Announce Type: replace Abstract: Class Activation Mapping (CAM) and its gradient-based variants (e.g., GradCAM) have become standard tools for explaining Convolutional Neural Network (CNN) predictions. However, these approaches typically focus on individual logits, while for neural networks using softmax,…

Efficient and Principled Scientific Discovery through Bayesian Optimization: A Tutorial

arXiv:2604.01328v1 Announce Type: new Abstract: Traditional scientific discovery relies on an iterative hypothesise-experiment-refine cycle that has driven progress for centuries, but its intuitive, ad-hoc implementation often wastes resources, yields inefficient designs, and misses critical insights. This tutorial presents Bayesian Optimisation…

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing

arXiv:2208.02389v3 Announce Type: replace Abstract: Motivated by practical considerations in machine learning for financial decision-making, such as risk aversion and large action space, we consider risk-aware bandits optimization with applications in smart order routing (SOR). Specifically, based on preliminary observations…