Archives AI News

Empowering Decision Trees via Shape Function Branching

arXiv:2510.19040v1 Announce Type: new Abstract: Decision trees are prized for their interpretability and strong performance on tabular data. Yet, their reliance on simple axis-aligned linear splits often forces deep, complex structures to capture non-linear feature effects, undermining human comprehension of…

Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

arXiv:2505.24379v3 Announce Type: replace Abstract: Large Language Models are typically trained on datasets collected from the web, which may inadvertently contain harmful or sensitive personal information. To address growing privacy concerns, unlearning methods have been proposed to remove the influence…

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning

arXiv:2507.16814v2 Announce Type: replace Abstract: Enhancing large vision-language models (LVLMs) with visual slow-thinking reasoning is crucial for solving complex multimodal tasks. However, since LVLMs are mainly trained with vision-language alignment, it is difficult to adopt on-policy reinforcement learning (RL) to…

Weight Decay may matter more than muP for Learning Rate Transfer in Practice

arXiv:2510.19093v1 Announce Type: new Abstract: Transferring the optimal learning rate from small to large neural networks can enable efficient training at scales where hyperparameter tuning is otherwise prohibitively expensive. To this end, the Maximal Update Parameterization (muP) proposes a learning…

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

arXiv:2411.13022v3 Announce Type: replace-cross Abstract: Physics-driven deep learning (PD-DL) approaches have become popular for improved reconstruction of fast magnetic resonance imaging (MRI) scans. Though PD-DL offers higher acceleration rates than existing clinical fast MRI techniques, their use has been limited…