Archives AI News

Steer Like the LLM: Activation Steering that Mimics Prompting

arXiv:2605.03907v1 Announce Type: cross Abstract: Large language models can be steered at inference time through prompting or activation interventions, but activation steering methods often underperform compared to prompt-based approaches. We propose a framework that formulates prompt steering as a form…

May 6, 2026

Calibration of the underlying surface parameters for urban flood using latent variables and adjoint equation

arXiv:2605.02959v1 Announce Type: new Abstract: Calibrating the urban underlying surface parameters is crucial for urban flood simulation. We formulate the parameter calibration problem into an optimization problem within the Bayesian framework using the maximum likelihood principle. We adopt the urban…

May 6, 2026

Safety and accuracy follow different scaling laws in clinical large language models

arXiv:2605.04039v1 Announce Type: cross Abstract: Clinical LLMs are often scaled by increasing model size, context length, retrieval complexity, or inference-time compute, with the implicit expectation that higher accuracy implies safer behavior. This assumption is incomplete in medicine, where a few…

May 6, 2026

ZeRO-Prefill: Zero Redundancy Overheads in MoE Prefill Serving

arXiv:2605.02960v1 Announce Type: new Abstract: Production LLM workloads increasingly serve discriminative tasks, such as classification, recommendation, and verification, whose answers are read from the logits of a single prefill pass with no autoregressive decoding. Serving these prefill-only workloads on mixture-of-experts…

May 6, 2026

Method for stress-testing cloud computing algorithms helps avoid network failures

The “MetaEase” technique provides a heads-up to potential scenarios that could cause long wait-times or outages.

May 6, 2026

Fisher Decorator: Refining Flow Policy via a Local Transport Map

arXiv:2604.17919v2 Announce Type: replace Abstract: Recent advances in flow-based offline reinforcement learning (RL) have achieved strong performance by parameterizing policies via flow matching. However, they still face critical trade-offs among expressiveness, optimality, and efficiency. In particular, existing flow policies interpret…

May 6, 2026

InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy

arXiv:2507.02974v3 Announce Type: replace Abstract: As major progress in LLM-based long-form text generation enables paradigms such as retrieval-augmented generation (RAG) and inference-time scaling, safely incorporating private information into the generation remains a critical open question. We present InvisibleInk, a highly…

May 6, 2026

Learning a Stochastic Differential Equation Model of Tropical Cyclone Intensification from Reanalysis and Observational Data

arXiv:2601.08116v3 Announce Type: replace Abstract: Tropical cyclones are among the most consequential weather hazards, yet estimates of their risk are limited by the relatively short historical record. To extend these records, researchers often generate large ensembles of synthetic storms using…

May 6, 2026

Amortized Variational Inference for Joint Posterior and Predictive Distributions in Bayesian Uncertainty Quantification

arXiv:2605.03710v1 Announce Type: cross Abstract: Bayesian predictive inference propagates parameter uncertainty to quantities of interest through the posterior-predictive distribution. In practice, this is typically performed using a two-stage procedure: first approximating the posterior distribution of model parameters, and then propagating…

May 6, 2026

Label-Efficient School Detection from Aerial Imagery via Weakly Supervised Pretraining and Fine-Tuning

arXiv:2605.03968v1 Announce Type: cross Abstract: Accurate school detection is essential for supporting education initiatives, including infrastructure planning and expanding internet connectivity to underserved areas. However, many regions around the world face challenges due to outdated, incomplete, or unavailable official records.…

May 6, 2026