Archives AI News

Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

arXiv:2603.04428v1 Announce Type: new Abstract: Multi-agent LLM systems on edge devices face a memory management problem: device RAM is too small to hold every agent’s KV cache simultaneously. On Apple M4 Pro with 10.2 GB of cache budget, only 3…

March 6, 2026

Flowers: A Warp Drive for Neural PDE Solvers

arXiv:2603.04430v1 Announce Type: new Abstract: We introduce Flowers, a neural architecture for learning PDE solution operators built entirely from multihead warps. Aside from pointwise channel mixing and a multiscale scaffold, Flowers use no Fourier multipliers, no dot-product attention, and no…

March 6, 2026

X-raying rocks reveals their carbon-storing capacity

New research by MIT geophysicists could assist efforts to remove carbon from the atmosphere and store it underground.

March 6, 2026

Seeds of something different

Kate Brown’s book, “Tiny Gardens Everywhere,” examines the hidden history of urban farming, its extensive use, and the politics of growing food.

March 6, 2026

Revolutionizing Mixed Precision Quantization: Towards Training-free Automatic Proxy Discovery via Large Language Models

arXiv:2512.07419v2 Announce Type: replace Abstract: Mixed-Precision Quantization (MPQ) liberates Deep Neural Networks (DNNs) from the Out-Of-Memory (OOM) bottleneck and has garnered increasing research attention. However, conventional methods either rely on costly differentiable optimization search, which is neither efficient nor flexible,…

March 6, 2026

Supervised Metric Regularization Through Alternating Optimization for Multi-Regime Physics-Informed Neural Networks

arXiv:2602.09980v2 Announce Type: replace Abstract: Standard Physics-Informed Neural Networks (PINNs) often face challenges when modeling parameterized dynamical systems with sharp regime transitions, such as bifurcations. In these scenarios, the continuous mapping from parameters to solutions can result in spectral bias…

March 6, 2026

Continuous Chain of Thought Enables Parallel Exploration and Reasoning

arXiv:2505.23648v3 Announce Type: replace Abstract: Modern language models generate chain-of-thought traces by autoregressively sampling tokens from a finite vocabulary. While this discrete sampling has achieved remarkable success, conducting chain-of-thought with continuously-valued tokens (CoT2) offers a richer and more expressive alternative.…

March 6, 2026

Topology Structure Optimization of Reservoirs Using GLMY Homology

arXiv:2509.11612v2 Announce Type: replace Abstract: Reservoir is an efficient network for time series processing. It is well known that network structure is one of the determinants of its performance. However, the topology structure of reservoirs, as well as their performance,…

March 6, 2026

Inference-time optimization for experiment-grounded protein ensemble generation

arXiv:2602.24007v2 Announce Type: replace-cross Abstract: Protein function relies on dynamic conformational ensembles, yet current generative models like AlphaFold3 often fail to produce ensembles that match experimental data. Recent experiment-guided generators attempt to address this by steering the reverse diffusion process.…

March 6, 2026

RoboPocket: Improve Robot Policies Instantly with Your Phone

arXiv:2603.05504v1 Announce Type: cross Abstract: Scaling imitation learning is fundamentally constrained by the efficiency of data collection. While handheld interfaces have emerged as a scalable solution for in-the-wild data acquisition, they predominantly operate in an open-loop manner: operators blindly collect…

March 6, 2026