Archives AI News

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

arXiv:2603.03517v1 Announce Type: new Abstract: General-purpose large language models (LLMs) that rely on in-context learning do not reliably deliver the scientific understanding and performance required for drug discovery tasks. Simply increasing model size or introducing reasoning tokens does not yield…

Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition

arXiv:2506.18703v3 Announce Type: replace-cross Abstract: Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition. When using appropriate modeling units, e.g., byte-pair encoding, these systems are in principle open vocabulary systems. In practice, however, they often fail to recognize words…

Q-Measure-Learning for Continuous State RL: Efficient Implementation and Convergence

arXiv:2603.03523v1 Announce Type: new Abstract: We study reinforcement learning in infinite-horizon discounted Markov decision processes with continuous state spaces, where data are generated online from a single trajectory under a Markovian behavior policy. To avoid maintaining an infinite-dimensional, function-valued estimate,…

The Geometry of Reasoning: Flowing Logics in Representation Space

arXiv:2510.09782v2 Announce Type: replace-cross Abstract: We study how large language models (LLMs) “think” through their representation space. We propose a novel geometric framework that models an LLM’s reasoning as flows — embedding trajectories evolving where logic goes. We disentangle logical…

Test-Time Meta-Adaptation with Self-Synthesis

arXiv:2603.03524v1 Announce Type: new Abstract: As strong general reasoners, large language models (LLMs) encounter diverse domains and tasks, where the ability to adapt and self-improve at test time is valuable. We introduce MASS, a meta-learning framework that enables LLMs to…

Chimera: Neuro-Symbolic Attention Primitives for Trustworthy Dataplane Intelligence

arXiv:2602.12851v2 Announce Type: replace-cross Abstract: Deploying expressive learning models directly on programmable dataplanes promises line-rate, low-latency traffic analysis but remains hindered by strict hardware constraints and the need for predictable, auditable behavior. Chimera introduces a principled framework that maps attention-oriented…

CodeTaste: Can LLMs Generate Human-Level Code Refactorings?

arXiv:2603.04177v1 Announce Type: cross Abstract: Large language model (LLM) coding agents can generate working code, but their solutions often accumulate complexity, duplication, and architectural debt. Human developers address such issues through refactoring: behavior-preserving program transformations that improve structure and maintainability.…

mlx-snn: Spiking Neural Networks on Apple Silicon via MLX

arXiv:2603.03529v1 Announce Type: new Abstract: We introduce mlx-snn, the first spiking neural network (SNN) library built natively on Apple’s MLX framework. As SNN research grows rapidly, all major libraries — snnTorch, Norse, SpikingJelly, Lava — target PyTorch or custom backends,…