Archives AI News

Momentum Guidance: Plug-and-Play Guidance for Flow Models

arXiv:2602.20360v1 Announce Type: new Abstract: Flow-based generative models have become a strong framework for high-quality generative modeling, yet pretrained models are rarely used in their vanilla conditional form: conditional samples without guidance often appear diffuse and lack fine-grained detail due…

Foundation Models for Causal Inference via Prior-Data Fitted Networks

arXiv:2506.10914v3 Announce Type: replace Abstract: Prior-data fitted networks (PFNs) have recently been proposed as a promising way to train tabular foundation models. PFNs are transformers that are pre-trained on synthetic data generated from a prespecified prior distribution and that enable…

Quantitative Approximation Rates for Group Equivariant Learning

arXiv:2602.20370v1 Announce Type: new Abstract: The universal approximation theorem establishes that neural networks can approximate any continuous function on a compact set. Later works in approximation theory provide quantitative approximation rates for ReLU networks on the class of $alpha$-H”older functions…

Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

arXiv:2509.15796v2 Announce Type: replace Abstract: The goal of protein design is to generate amino acid sequences that fold into functional structures with desired properties. Prior methods combining autoregressive language models with Monte Carlo Tree Search (MCTS) struggle with long-range dependencies…

cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context

arXiv:2602.20396v1 Announce Type: new Abstract: Explainable artificial intelligence promises to yield insights into relevant features, thereby enabling humans to examine and scrutinize machine learning models or even facilitating scientific discovery. Considering the widespread technique of Shapley values, we find that…

TeamFormer: Shallow Parallel Transformers with Progressive Approximation

arXiv:2510.15425v2 Announce Type: replace Abstract: The widespread ‘deeper is better’ philosophy has driven the creation of architectures like ResNet and Transformer, which achieve high performance by stacking numerous layers. However, increasing model depth comes with challenges such as longer training…

GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

arXiv:2602.20399v1 Announce Type: new Abstract: Neural simulators promise efficient surrogates for physics simulation, but scaling them is bottlenecked by the prohibitive cost of generating high-fidelity training data. Pre-training on abundant off-the-shelf geometries offers a natural alternative, yet faces a fundamental…