Archives AI News

Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning

arXiv:2509.21010v2 Announce Type: replace Abstract: The de novo generation of drug-like molecules capable of inducing desirable phenotypic changes is receiving increasing attention. However, previous methods predominantly rely on expression profiles to guide molecule generation, but overlook the perturbative effect of…

April 20, 2026

${pi}_{0.7}$: a Steerable Generalist Robotic Foundation Model with Emergent Capabilities

arXiv:2604.15483v1 Announce Type: new Abstract: We present a new robotic foundation model, called ${pi}_{0.7}$, that can enable strong out-of-the-box performance in a wide range of scenarios. ${pi}_{0.7}$ can follow diverse language instructions in unseen environments, including multi-stage tasks with various…

April 20, 2026

Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning

arXiv:2604.15414v1 Announce Type: new Abstract: Continual reinforcement learning must balance retention with adaptation, yet many methods still rely on emph{single-model preservation}, committing to one evolving policy as the main reusable solution across tasks. Even when a previously successful policy is…

April 20, 2026

Natural gradient descent with momentum

arXiv:2604.15554v1 Announce Type: new Abstract: We consider the problem of approximating a function by an element of a nonlinear manifold which admits a differentiable parametrization, typical examples being neural networks with differentiable activation functions or tensor networks. Natural gradient descent…

April 20, 2026

Sentiment Analysis of German Sign Language Fairy Tales

arXiv:2604.16138v1 Announce Type: cross Abstract: We present a dataset and a model for sentiment analysis of German sign language (DGS) fairy tales. First, we perform sentiment analysis for three levels of valence (negative, neutral, positive) on German fairy tales text…

April 20, 2026

Power to the Clients: Federated Learning in a Dictatorship Setting

arXiv:2510.22149v3 Announce Type: replace Abstract: Federated learning (FL) has emerged as a promising paradigm for decentralized model training, enabling multiple clients to collaboratively learn a shared model without exchanging their local data. However, the decentralized nature of FL also introduces…

April 20, 2026

FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models

arXiv:2604.15488v1 Announce Type: new Abstract: Large language models (LLMs) often exhibit undesirable behaviors, such as safety violations and hallucinations. Although inference-time steering offers a cost-effective way to adjust model behavior without updating its parameters, existing methods often fail to be…

April 20, 2026

Learning Affine-Equivariant Proximal Operators

arXiv:2604.15556v1 Announce Type: new Abstract: Proximal operators are fundamental across many applications in signal processing and machine learning, including solving ill-posed inverse problems. Recent work has introduced Learned Proximal Networks (LPNs), providing parametric functions that compute exact proximals for data-driven…

April 20, 2026

FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users

arXiv:2502.19312v2 Announce Type: replace Abstract: Effective personalization of LLMs is critical for a broad range of user-interfacing applications such as virtual assistants and content curation. Inspired by the strong in-context capabilities of LLMs, we propose few-shot preference optimization (FSPO), an…

April 20, 2026

Dynamic Tool Dependency Retrieval for Lightweight Function Calling

arXiv:2512.17052v4 Announce Type: replace Abstract: Function calling agents powered by Large Language Models (LLMs) select external tools to automate complex tasks. On-device agents typically use a retrieval module to select relevant tools, improving performance and reducing context length. However, existing…

April 20, 2026