Archives AI News

Unified Cross-Scale 3D Generation and Understanding via Autoregressive Modeling

arXiv:2503.16278v3 Announce Type: replace Abstract: 3D structure modeling is essential across scales, enabling applications from fluid simulation and 3D reconstruction to protein folding and molecular docking. Yet, despite shared 3D spatial patterns, current approaches remain fragmented, with models narrowly specialized…

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs

arXiv:2410.20749v2 Announce Type: replace Abstract: Despite the impressive generative abilities of black-box large language models (LLMs), their inherent opacity hinders further advancements in capabilities such as reasoning, planning, and personalization. Existing works aim to enhance LLM capabilities via domain-specific adaptation,…

Wavefunction Flows: Efficient Quantum Simulation of Continuous Flow Models

arXiv:2510.08462v1 Announce Type: cross Abstract: Flow models are a cornerstone of modern machine learning. They are generative models that progressively transform probability distributions according to learned dynamics. Specifically, they learn a continuous-time Markov process that efficiently maps samples from a…

From Moments to Models: Graphon Mixture-Aware Mixup and Contrastive Learning

arXiv:2510.03690v2 Announce Type: replace Abstract: Real-world graph datasets often consist of mixtures of populations, where graphs are generated from multiple distinct underlying distributions. However, modern representation learning approaches, such as graph contrastive learning (GCL) and augmentation methods like Mixup, typically…

Markets for Models

arXiv:2503.02946v3 Announce Type: replace-cross Abstract: Motivated by the prevalence of prediction problems in the economy, we study markets in which firms sell models to a consumer to help improve their prediction. Firms decide whether to enter, choose models to train…

Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs

arXiv:2510.07429v1 Announce Type: new Abstract: Efficient use of large language models (LLMs) is critical for deployment at scale: without adaptive routing, systems either overpay for strong models or risk poor performance from weaker ones. Selecting the right LLM for each…