Archives AI News

dLLM: Simple Diffusion Language Modeling

arXiv:2602.22661v1 Announce Type: cross Abstract: Although diffusion language models (DLMs) are evolving quickly, many recent models converge on a set of shared components. These components, however, are distributed across ad-hoc research codebases or lack transparent implementations, making them difficult to…

Unsupervised Continual Learning for Amortized Bayesian Inference

arXiv:2602.22884v1 Announce Type: cross Abstract: Amortized Bayesian Inference (ABI) enables efficient posterior estimation using generative neural networks trained on simulated data, but often suffers from performance degradation under model misspecification. While self-consistency (SC) training on unlabeled empirical data can enhance…

Manifold of Failure: Behavioral Attraction Basins in Language Models

arXiv:2602.22291v1 Announce Type: new Abstract: While prior work has focused on projecting adversarial examples back onto the manifold of natural data to restore safety, we argue that a comprehensive understanding of AI safety requires characterizing the unsafe regions themselves. This…

Regular Fourier Features for Nonstationary Gaussian Processes

arXiv:2602.23006v1 Announce Type: cross Abstract: Simulating a Gaussian process requires sampling from a high-dimensional Gaussian distribution, which scales cubically with the number of sample locations. Spectral methods address this challenge by exploiting the Fourier representation, treating the spectral density as…

The Spacetime of Diffusion Models: An Information Geometry Perspective

arXiv:2505.17517v4 Announce Type: replace Abstract: We present a novel geometric perspective on the latent space of diffusion models. We first show that the standard pullback approach, utilizing the deterministic probability flow ODE decoder, is fundamentally flawed. It provably forces geodesics…

Simplex-to-Euclidean Bijections for Categorical Flow Matching

arXiv:2510.27480v2 Announce Type: replace Abstract: We propose a method for learning and sampling from probability distributions supported on the simplex. Our approach maps the open simplex to Euclidean space via smooth bijections, leveraging the Aitchison geometry to define the mappings,…

MoDora: Tree-Based Semi-Structured Document Analysis System

arXiv:2602.23061v1 Announce Type: cross Abstract: Semi-structured documents integrate diverse interleaved data elements (e.g., tables, charts, hierarchical paragraphs) arranged in various and often irregular layouts. These documents are widely observed across domains and account for a large portion of real-world data.…

Efficient Graph Coloring with Neural Networks: A Physics-Inspired Approach for Large Graphs

arXiv:2408.01503v2 Announce Type: replace Abstract: Combinatorial optimization problems near algorithmic phase transitions represent a fundamental challenge for both classical algorithms and machine learning approaches. Among them, graph coloring stands as a prototypical constraint satisfaction problem exhibiting sharp dynamical and satisfiability…