Archives AI News

DeepBlip: Estimating Conditional Average Treatment Effects Over Time

arXiv:2511.14545v1 Announce Type: cross Abstract: Structural nested mean models (SNMMs) are a principled approach to estimate the treatment effects over time. A particular strength of SNMMs is to break the joint effect of treatment sequences over time into localized, time-specific…

Derivative of the truncated singular value and eigen decomposition

arXiv:2511.14651v1 Announce Type: cross Abstract: Recently developed applications in the field of machine learning and computational physics rely on automatic differentiation techniques, that require stable and efficient linear algebra gradient computations. This technical note provides a comprehensive and detailed discussion…

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

arXiv:2511.13841v1 Announce Type: new Abstract: Reinforcement learning(RL) post-training has become essential for aligning large language models (LLMs), yet its efficiency is increasingly constrained by the rollout phase, where long trajectories are generated token by token. We identify a major bottleneck:the…

Predicting the Performance of Black-box LLMs through Self-Queries

arXiv:2501.01558v3 Announce Type: replace Abstract: As large language models (LLMs) are increasingly relied on in AI systems, predicting when they make mistakes is crucial. While a great deal of work in the field uses internal representations to interpret model behavior,…

Tractable Probabilistic Models for Investment Planning

arXiv:2511.13888v1 Announce Type: new Abstract: Investment planning in power utilities, such as generation and transmission expansion, requires decade-long forecasts under profound uncertainty. Forecasting of energy mix and energy use decades ahead is nontrivial. Classical approaches focus on generating a finite…

A Survey of Cross-domain Graph Learning: Progress and Future Directions

arXiv:2503.11086v2 Announce Type: replace Abstract: Graph learning plays a vital role in mining and analyzing complex relationships within graph data and has been widely applied to real-world scenarios such as social, citation, and e-commerce networks. Foundation models in computer vision…