Archives AI News

Armijo Line-search Can Make (Stochastic) Gradient Descent Provably Faster

arXiv:2503.00229v4 Announce Type: replace Abstract: Armijo line-search (Armijo-LS) is a standard method to set the step-size for gradient descent (GD). For smooth functions, Armijo-LS alleviates the need to know the global smoothness constant L and adapts to the “local” smoothness,…

Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling

arXiv:2602.20210v1 Announce Type: new Abstract: Crystal modeling spans a family of conditional and unconditional generation tasks across different modalities, including crystal structure prediction (CSP) and emph{de novo} generation (DNG). While recent deep generative models have shown promising performance, they remain…

Model Merging in the Essential Subspace

arXiv:2602.20208v1 Announce Type: new Abstract: Model merging aims to integrate multiple task-specific fine-tuned models derived from a shared pre-trained checkpoint into a single multi-task model without additional training. Despite extensive research, task interference remains a major obstacle that often undermines…

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

arXiv:2602.20197v1 Announce Type: new Abstract: Reinforcement Learning with verifiable rewards (RLVR) has emerged as a primary learning paradigm for enhancing the reasoning capabilities of multi-modal large language models (MLLMs). However, during RL training, the enormous state space of MLLM and…

FedAvg-Based CTMC Hazard Model for Federated Bridge Deterioration Assessment

arXiv:2602.20194v1 Announce Type: new Abstract: Bridge periodic inspection records contain sensitive information about public infrastructure, making cross-organizational data sharing impractical under existing data governance constraints. We propose a federated framework for estimating a Continuous-Time Markov Chain (CTMC) hazard model of…

CryoLVM: Self-supervised Learning from Cryo-EM Density Maps with Large Vision Models

arXiv:2602.02620v2 Announce Type: replace-cross Abstract: Cryo-electron microscopy (cryo-EM) has revolutionized structural biology by enabling near-atomic-level visualization of biomolecular assemblies. However, the exponential growth in cryo-EM data throughput and complexity, coupled with diverse downstream analytical tasks, necessitates unified computational frameworks that…