Archives AI News

Inference-Time Scaling of Diffusion Language Models via Trajectory Refinement

arXiv:2507.08390v3 Announce Type: replace Abstract: Discrete diffusion models have recently emerged as strong alternatives to autoregressive language models, matching their performance through large-scale training. However, inference-time control remains relatively underexplored. In this work, we study how to steer generation toward…

On the Geometry of Positional Encodings in Transformers

arXiv:2604.05217v1 Announce Type: new Abstract: Neural language models process sequences of words, but the mathematical operations inside them are insensitive to the order in which words appear. Positional encodings are the component added to remedy this. Despite their importance, positional…

Curvature-Aware Optimization for High-Accuracy Physics-Informed Neural Networks

arXiv:2604.05230v1 Announce Type: new Abstract: Efficient and robust optimization is essential for neural networks, enabling scientific machine learning models to converge rapidly to very high accuracy — faithfully capturing complex physical behavior governed by differential equations. In this work, we…