Archives AI News

Will updating your AI agents help or hamper their performance? Raindrop’s new tool Experiments tells you

It seems like almost every week for the last two years since ChatGPT launched, new large language models (LLMs) from rival labs or from OpenAI itself have been released. Enterprises are hard pressed to keep up with the massive pace…

October 10, 2025

Microsoft Research Releases Skala: a Deep-Learning Exchange–Correlation Functional Targeting Hybrid-Level Accuracy at Semi-Local Cost

TL;DR: Skala is a deep-learning exchange–correlation functional for Kohn–Sham Density Functional Theory (DFT) that targets hybrid-level accuracy at semi-local cost, reporting MAE ≈ 1.06 kcal/mol on W4-17 (0.85 on the single-reference subset) and WTMAD-2 ≈ 3.89 kcal/mol on GMTKN55; evaluations…

October 10, 2025

Unified Cross-Scale 3D Generation and Understanding via Autoregressive Modeling

arXiv:2503.16278v3 Announce Type: replace Abstract: 3D structure modeling is essential across scales, enabling applications from fluid simulation and 3D reconstruction to protein folding and molecular docking. Yet, despite shared 3D spatial patterns, current approaches remain fragmented, with models narrowly specialized…

October 10, 2025

BLAZER: Bootstrapping LLM-based Manipulation Agents with Zero-Shot Data Generation

arXiv:2510.08572v1 Announce Type: cross Abstract: Scaling data and models has played a pivotal role in the remarkable progress of computer vision and language. Inspired by these domains, recent efforts in robotics have similarly focused on scaling both data and model…

October 10, 2025

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs

arXiv:2410.20749v2 Announce Type: replace Abstract: Despite the impressive generative abilities of black-box large language models (LLMs), their inherent opacity hinders further advancements in capabilities such as reasoning, planning, and personalization. Existing works aim to enhance LLM capabilities via domain-specific adaptation,…

October 10, 2025

FireGNN: Neuro-Symbolic Graph Neural Networks with Trainable Fuzzy Rules for Interpretable Medical Image Classification

arXiv:2509.10510v2 Announce Type: replace-cross Abstract: Medical image classification requires not only high predictive performance but also interpretability to ensure clinical trust and adoption. Graph Neural Networks (GNNs) offer a powerful framework for modeling relational structures within datasets; however, standard GNNs…

October 10, 2025

Wavefunction Flows: Efficient Quantum Simulation of Continuous Flow Models

arXiv:2510.08462v1 Announce Type: cross Abstract: Flow models are a cornerstone of modern machine learning. They are generative models that progressively transform probability distributions according to learned dynamics. Specifically, they learn a continuous-time Markov process that efficiently maps samples from a…

October 10, 2025

From Moments to Models: Graphon Mixture-Aware Mixup and Contrastive Learning

arXiv:2510.03690v2 Announce Type: replace Abstract: Real-world graph datasets often consist of mixtures of populations, where graphs are generated from multiple distinct underlying distributions. However, modern representation learning approaches, such as graph contrastive learning (GCL) and augmentation methods like Mixup, typically…

October 10, 2025

Markets for Models

arXiv:2503.02946v3 Announce Type: replace-cross Abstract: Motivated by the prevalence of prediction problems in the economy, we study markets in which firms sell models to a consumer to help improve their prediction. Firms decide whether to enter, choose models to train…

October 10, 2025

Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs

arXiv:2510.07429v1 Announce Type: new Abstract: Efficient use of large language models (LLMs) is critical for deployment at scale: without adaptive routing, systems either overpay for strong models or risk poor performance from weaker ones. Selecting the right LLM for each…

October 10, 2025