Archives AI News

How Memory in Optimization Algorithms Implicitly Modifies the Loss

arXiv:2502.02132v2 Announce Type: replace Abstract: In modern optimization methods used in deep learning, each update depends on the history of previous iterations, often referred to as memory, and this dependence decays fast as the iterates go further into the past.…

Higher-Order Causal Structure Learning with Additive Models

arXiv:2511.03831v1 Announce Type: new Abstract: Causal structure learning has long been the central task of inferring causal insights from data. Despite the abundance of real-world processes exhibiting higher-order mechanisms, however, an explicit treatment of interactions in causal discovery has received…

Enhancing Q-Value Updates in Deep Q-Learning via Successor-State Prediction

arXiv:2511.03836v1 Announce Type: new Abstract: Deep Q-Networks (DQNs) estimate future returns by learning from transitions sampled from a replay buffer. However, the target updates in DQN often rely on next states generated by actions from past, potentially suboptimal, policy. As…

Fraud-Proof Revenue Division on Subscription Platforms

arXiv:2511.04465v1 Announce Type: cross Abstract: We study a model of subscription-based platforms where users pay a fixed fee for unlimited access to content, and creators receive a share of the revenue. Existing approaches to detecting fraud predominantly rely on machine…

Efficient Model Development through Fine-tuning Transfer

arXiv:2503.20110v2 Announce Type: replace-cross Abstract: Modern LLMs struggle with efficient updates, as each new pretrained model version requires repeating expensive alignment processes. This challenge also applies to domain- or languagespecific models, where fine-tuning on specialized data must be redone for…

FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

arXiv:2510.02578v3 Announce Type: replace-cross Abstract: We present FLOWR:root, an equivariant flow-matching model for pocket-aware 3D ligand generation with joint binding affinity prediction and confidence estimation. The model supports de novo generation, pharmacophore-conditional sampling, fragment elaboration, and multi-endpoint affinity prediction (pIC50,…

Test-Time Warmup for Multimodal Large Language Models

arXiv:2509.10641v2 Announce Type: replace Abstract: Multimodal Large Language Models (MLLMs) hold great promise for advanced reasoning at the intersection of text and images, yet they have not fully realized this potential. MLLMs typically integrate an LLM, a vision encoder, and…