Archives AI News

Demystifying Domain-adaptive Post-training for Financial LLMs

arXiv:2501.04961v3 Announce Type: replace-cross Abstract: Domain-adaptive post-training of large language models (LLMs) has emerged as a promising approach for specialized domains such as medicine and finance. However, significant challenges remain in identifying optimal adaptation criteria and training strategies across varying…

GraphPFN: A Prior-Data Fitted Graph Foundation Model

arXiv:2509.21489v1 Announce Type: new Abstract: Foundation models pretrained on large-scale datasets have transformed such fields as natural language processing and computer vision, but their application to graph data remains limited. Recently emerged graph foundation models, such as G2T-FM, utilize tabular…

SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models

arXiv:2509.21498v1 Announce Type: new Abstract: Diffusion models (DMs), lauded for their generative performance, are computationally prohibitive due to their billion-scale parameters and iterative denoising dynamics. Existing efficiency techniques, such as quantization, timestep reduction, or pruning, offer savings in compute, memory,…

Smoothing-Based Conformal Prediction for Balancing Efficiency and Interpretability

arXiv:2509.22529v1 Announce Type: cross Abstract: Conformal Prediction (CP) is a distribution-free framework for constructing statistically rigorous prediction sets. While popular variants such as CD-split improve CP’s efficiency, they often yield prediction sets composed of multiple disconnected subintervals, which are difficult…

DistillKac: Few-Step Image Generation via Damped Wave Equations

arXiv:2509.21513v1 Announce Type: new Abstract: We present DistillKac, a fast image generator that uses the damped wave equation and its stochastic Kac representation to move probability mass at finite speed. In contrast to diffusion models whose reverse time velocities can…