Archives AI News

MoCA: Multi-modal Cross-masked Autoencoder for Digital Health Measurements

arXiv:2506.02260v3 Announce Type: replace Abstract: Wearable devices enable continuous multi-modal physiological and behavioral monitoring, yet analysis of these data streams faces fundamental challenges including the lack of gold-standard labels and incomplete sensor data. While self-supervised learning approaches have shown promise…

Beyond the Average: Distributional Causal Inference under Imperfect Compliance

arXiv:2509.15594v1 Announce Type: cross Abstract: We study the estimation of distributional treatment effects in randomized experiments with imperfect compliance. When participants do not adhere to their assigned treatments, we leverage treatment assignment as an instrumental variable to identify the local…

Information Geometry of Variational Bayes

arXiv:2509.15641v1 Announce Type: cross Abstract: We highlight a fundamental connection between information geometry and variational Bayes (VB) and discuss its consequences for machine learning. Under certain conditions, a VB solution always requires estimation or computation of natural gradients. We show…

A noise-corrected Langevin algorithm and sampling by half-denoising

arXiv:2410.05837v3 Announce Type: replace-cross Abstract: The Langevin algorithm is a classic method for sampling from a given pdf in a real space. In its basic version, it only requires knowledge of the gradient of the log-density, also called the score…

Generalization and Optimization of SGD with Lookahead

arXiv:2509.15776v1 Announce Type: cross Abstract: The Lookahead optimizer enhances deep learning models by employing a dual-weight update mechanism, which has been shown to improve the performance of underlying optimizers such as SGD. However, most theoretical studies focus on its convergence…

Transfer learning under latent space model

arXiv:2509.15797v1 Announce Type: cross Abstract: Latent space model plays a crucial role in network analysis, and accurate estimation of latent variables is essential for downstream tasks such as link prediction. However, the large number of parameters to be estimated presents…