Archives AI News

Simplex-to-Euclidean Bijections for Categorical Flow Matching

arXiv:2510.27480v2 Announce Type: replace Abstract: We propose a method for learning and sampling from probability distributions supported on the simplex. Our approach maps the open simplex to Euclidean space via smooth bijections, leveraging the Aitchison geometry to define the mappings,…

MoDora: Tree-Based Semi-Structured Document Analysis System

arXiv:2602.23061v1 Announce Type: cross Abstract: Semi-structured documents integrate diverse interleaved data elements (e.g., tables, charts, hierarchical paragraphs) arranged in various and often irregular layouts. These documents are widely observed across domains and account for a large portion of real-world data.…

Efficient Graph Coloring with Neural Networks: A Physics-Inspired Approach for Large Graphs

arXiv:2408.01503v2 Announce Type: replace Abstract: Combinatorial optimization problems near algorithmic phase transitions represent a fundamental challenge for both classical algorithms and machine learning approaches. Among them, graph coloring stands as a prototypical constraint satisfaction problem exhibiting sharp dynamical and satisfiability…

Code World Models for Parameter Control in Evolutionary Algorithms

arXiv:2602.22260v1 Announce Type: new Abstract: Can an LLM learn how an optimizer behaves — and use that knowledge to control it? We extend Code World Models (CWMs), LLM-synthesized Python programs that predict environment dynamics, from deterministic games to stochastic combinatorial…

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

arXiv:2511.07885v3 Announce Type: replace-cross Abstract: Large language model (LLM) queries are predominantly processed by frontier models in centralized cloud infrastructure. Rapidly growing demand strains this paradigm, and cloud providers struggle to scale infrastructure at pace. Two advances enable us to…

Deep Sequence Modeling with Quantum Dynamics: Language as a Wave Function

arXiv:2602.22255v1 Announce Type: new Abstract: We introduce a sequence modeling framework in which the latent state is a complex-valued wave function evolving on a finite-dimensional Hilbert space under a learned, time-dependent Hamiltonian. Unlike standard recurrent architectures that rely on gating…