Archives AI News

Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

arXiv:2602.20344v1 Announce Type: new Abstract: Graph self-supervised learning (GSSL) has demonstrated strong potential for generating expressive graph embeddings without the need for human annotations, making it particularly valuable in domains with high labeling costs such as molecular graph analysis. However,…

February 25, 2026

CONTINA: Confidence Interval for Traffic Demand Prediction with Coverage Guarantee

arXiv:2504.13961v2 Announce Type: replace Abstract: Accurate short-term traffic demand prediction is critical for the operation of traffic systems. Besides point estimation, the confidence interval of the prediction is also of great importance. Many models for traffic operations, such as shared…

February 25, 2026

Momentum Guidance: Plug-and-Play Guidance for Flow Models

arXiv:2602.20360v1 Announce Type: new Abstract: Flow-based generative models have become a strong framework for high-quality generative modeling, yet pretrained models are rarely used in their vanilla conditional form: conditional samples without guidance often appear diffuse and lack fine-grained detail due…

February 25, 2026

Foundation Models for Causal Inference via Prior-Data Fitted Networks

arXiv:2506.10914v3 Announce Type: replace Abstract: Prior-data fitted networks (PFNs) have recently been proposed as a promising way to train tabular foundation models. PFNs are transformers that are pre-trained on synthetic data generated from a prespecified prior distribution and that enable…

February 25, 2026

Quantitative Approximation Rates for Group Equivariant Learning

arXiv:2602.20370v1 Announce Type: new Abstract: The universal approximation theorem establishes that neural networks can approximate any continuous function on a compact set. Later works in approximation theory provide quantitative approximation rates for ReLU networks on the class of $alpha$-H”older functions…

February 25, 2026

Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

arXiv:2509.15796v2 Announce Type: replace Abstract: The goal of protein design is to generate amino acid sequences that fold into functional structures with desired properties. Prior methods combining autoregressive language models with Monte Carlo Tree Search (MCTS) struggle with long-range dependencies…

February 25, 2026

cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context

arXiv:2602.20396v1 Announce Type: new Abstract: Explainable artificial intelligence promises to yield insights into relevant features, thereby enabling humans to examine and scrutinize machine learning models or even facilitating scientific discovery. Considering the widespread technique of Shapley values, we find that…

February 25, 2026

TeamFormer: Shallow Parallel Transformers with Progressive Approximation

arXiv:2510.15425v2 Announce Type: replace Abstract: The widespread ‘deeper is better’ philosophy has driven the creation of architectures like ResNet and Transformer, which achieve high performance by stacking numerous layers. However, increasing model depth comes with challenges such as longer training…

February 25, 2026

GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

arXiv:2602.20399v1 Announce Type: new Abstract: Neural simulators promise efficient surrogates for physics simulation, but scaling them is bottlenecked by the prohibitive cost of generating high-fidelity training data. Pre-training on abundant off-the-shelf geometries offers a natural alternative, yet faces a fundamental…

February 25, 2026

LORE: Jointly Learning the Intrinsic Dimensionality and Relative Similarity Structure From Ordinal Data

arXiv:2602.04192v2 Announce Type: replace Abstract: Learning the intrinsic dimensionality of subjective perceptual spaces such as taste, smell, or aesthetics from ordinal data is a challenging problem. We introduce LORE (Low Rank Ordinal Embedding), a scalable framework that jointly learns both…

February 25, 2026