Archives AI News

Adaptive Redundancy Regulation for Balanced Multimodal Information Refinement

arXiv:2511.13755v1 Announce Type: new Abstract: Multimodal learning aims to improve performance by leveraging data from multiple sources. During joint multimodal training, due to modality bias, the advantaged modality often dominates backpropagation, leading to imbalanced optimization. Existing methods still face two…

SCALEX: Scalable Concept and Latent Exploration for Diffusion Models

arXiv:2511.13750v1 Announce Type: new Abstract: Image generation models frequently encode social biases, including stereotypes tied to gender, race, and profession. Existing methods for analyzing these biases in diffusion models either focus narrowly on predefined categories or depend on manual interpretation…

EvoLM: In Search of Lost Language Model Training Dynamics

arXiv:2506.16029v2 Announce Type: replace-cross Abstract: Modern language model (LM) training has been divided into multiple stages, making it difficult for downstream developers to evaluate the impact of design choices made at each stage. We present EvoLM, a model suite that…

ChemFixer: Correcting Invalid Molecules to Unlock Previously Unseen Chemical Space

arXiv:2511.13758v1 Announce Type: new Abstract: Deep learning-based molecular generation models have shown great potential in efficiently exploring vast chemical spaces by generating potential drug candidates with desired properties. However, these models often produce chemically invalid molecules, which limits the usable…

MoETTA: Test-Time Adaptation Under Mixed Distribution Shifts with MoE-LayerNorm

arXiv:2511.13760v1 Announce Type: new Abstract: Test-Time adaptation (TTA) has proven effective in mitigating performance drops under single-domain distribution shifts by updating model parameters during inference. However, real-world deployments often involve mixed distribution shifts, where test samples are affected by diverse…