Archives AI News

da4ml: Distributed Arithmetic for Real-time Neural Networks on FPGAs

arXiv:2507.04535v2 Announce Type: replace-cross Abstract: Neural networks with a latency requirement on the order of microseconds, like the ones used at the CERN Large Hadron Collider, are typically deployed on FPGAs fully unrolled and pipelined. A bottleneck for the deployment…

Consequentialist Objectives and Catastrophe

arXiv:2603.15017v3 Announce Type: replace-cross Abstract: Because human preferences are too complex to codify, AIs operate with misspecified objectives. Optimizing such objectives often produces undesirable outcomes; this phenomenon is known as reward hacking. Such outcomes are not necessarily catastrophic. Indeed, most…

The Exact Replica Threshold for Nonlinear Moments of Quantum States

arXiv:2604.22627v1 Announce Type: cross Abstract: Joint measurements on multiple copies of a quantum state provide access to nonlinear observables such as $operatorname{tr}(rho^t)$, but whether replica number marks a sharp information-theoretic resource boundary has remained unclear. For every fixed order $tge…

Insect-inspired modular architectures as inductive biases for reinforcement learning

arXiv:2604.22081v1 Announce Type: new Abstract: Most reinforcement-learning (RL) controllers used in continuous control are architecturally centralized: observations are compressed into a single latent state from which both value estimates and actions are produced. Biological control systems are often organized differently.…

Relaxation-Informed Training of Neural Network Surrogate Models

arXiv:2604.22746v1 Announce Type: cross Abstract: ReLU neural networks trained as surrogate models can be embedded exactly in mixed-integer linear programs (MILPs), enabling global optimization over the learned function. The tractability of the resulting MILP depends on structural properties of the…

Removing Sandbagging in LLMs by Training with Weak Supervision

arXiv:2604.22082v1 Announce Type: new Abstract: As AI systems begin to automate complex tasks, supervision increasingly relies on weaker models or limited human oversight that cannot fully verify output quality. A model more capable than its supervisors could exploit this gap…