Archives AI News

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

arXiv:2604.16029v1 Announce Type: cross Abstract: Parallel reasoning enhances Large Reasoning Models (LRMs) but incurs prohibitive costs due to futile paths caused by early errors. To mitigate this, path pruning at the prefix level is essential, yet existing research remains fragmented…

April 20, 2026

Jailbreak Scaling Laws for Large Language Models: Polynomial-Exponential Crossover

arXiv:2603.11331v2 Announce Type: replace Abstract: Adversarial attacks can reliably steer safety-aligned large language models toward unsafe behavior. Empirically, we find that strong adversarial prompt-injection attacks can amplify attack success rate from the slow polynomial growth observed without injection to exponential…

April 20, 2026

${pi}_{0.7}$: a Steerable Generalist Robotic Foundation Model with Emergent Capabilities

arXiv:2604.15483v1 Announce Type: new Abstract: We present a new robotic foundation model, called ${pi}_{0.7}$, that can enable strong out-of-the-box performance in a wide range of scenarios. ${pi}_{0.7}$ can follow diverse language instructions in unseen environments, including multi-stage tasks with various…

April 20, 2026

Sentiment Analysis of German Sign Language Fairy Tales

arXiv:2604.16138v1 Announce Type: cross Abstract: We present a dataset and a model for sentiment analysis of German sign language (DGS) fairy tales. First, we perform sentiment analysis for three levels of valence (negative, neutral, positive) on German fairy tales text…

April 20, 2026

FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models

arXiv:2604.15488v1 Announce Type: new Abstract: Large language models (LLMs) often exhibit undesirable behaviors, such as safety violations and hallucinations. Although inference-time steering offers a cost-effective way to adjust model behavior without updating its parameters, existing methods often fail to be…

April 20, 2026

FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users

arXiv:2502.19312v2 Announce Type: replace Abstract: Effective personalization of LLMs is critical for a broad range of user-interfacing applications such as virtual assistants and content curation. Inspired by the strong in-context capabilities of LLMs, we propose few-shot preference optimization (FSPO), an…

April 20, 2026

ProtoTTA: Prototype-Guided Test-Time Adaptation

arXiv:2604.15494v1 Announce Type: new Abstract: Deep networks that rely on prototypes-interpretable representations that can be related to the model input-have gained significant attention for balancing high accuracy with inherent interpretability, which makes them suitable for critical domains such as healthcare.…

April 20, 2026

ChemAmp: Amplified Chemistry Tools via Composable Agents

arXiv:2505.21569v3 Announce Type: replace Abstract: Although LLM-based agents are proven to master tool orchestration in scientific fields, particularly chemistry, their single-task performance remains limited by underlying tool constraints. To this end, we propose tool amplification, a novel paradigm that enhances…

April 20, 2026

Optimizing Stochastic Gradient Push under Broadcast Communications

arXiv:2604.15549v1 Announce Type: new Abstract: We consider the problem of minimizing the convergence time for decentralized federated learning (DFL) in wireless networks under broadcast communications, with focus on mixing matrix design. The mixing matrix is a critical hyperparameter for DFL…

April 20, 2026

Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning

arXiv:2509.21010v2 Announce Type: replace Abstract: The de novo generation of drug-like molecules capable of inducing desirable phenotypic changes is receiving increasing attention. However, previous methods predominantly rely on expression profiles to guide molecule generation, but overlook the perturbative effect of…

April 20, 2026