Archives AI News

Category learning in deep neural networks: Information content and geometry of internal representations

arXiv:2510.19021v1 Announce Type: new Abstract: In animals, category learning enhances discrimination between stimuli close to the category boundary. This phenomenon, called categorical perception, was also empirically observed in artificial neural networks trained on classification tasks. In previous modeling works based…

October 23, 2025

Customizing Spider Silk: Generative Models with Mechanical Property Conditioning for Protein Engineering

arXiv:2504.08437v2 Announce Type: replace Abstract: The remarkable mechanical properties of spider silk, including its tensile strength and extensibility, are primarily governed by the repetitive regions of the proteins that constitute the fiber, the major ampullate spidroins (MaSps). However, establishing correlations…

October 23, 2025

Empowering Decision Trees via Shape Function Branching

arXiv:2510.19040v1 Announce Type: new Abstract: Decision trees are prized for their interpretability and strong performance on tabular data. Yet, their reliance on simple axis-aligned linear splits often forces deep, complex structures to capture non-linear feature effects, undermining human comprehension of…

October 23, 2025

Unlearned but Not Forgotten: Data Extraction after Exact Unlearning in LLM

arXiv:2505.24379v3 Announce Type: replace Abstract: Large Language Models are typically trained on datasets collected from the web, which may inadvertently contain harmful or sensitive personal information. To address growing privacy concerns, unlearning methods have been proposed to remove the influence…

October 23, 2025

POLAR: Policy-based Layerwise Reinforcement Learning Method for Stealthy Backdoor Attacks in Federated Learning

arXiv:2510.19056v1 Announce Type: new Abstract: Federated Learning (FL) enables decentralized model training across multiple clients without exposing local data, but its distributed feature makes it vulnerable to backdoor attacks. Despite early FL backdoor attacks modifying entire models, recent studies have…

October 23, 2025

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning

arXiv:2507.16814v2 Announce Type: replace Abstract: Enhancing large vision-language models (LVLMs) with visual slow-thinking reasoning is crucial for solving complex multimodal tasks. However, since LVLMs are mainly trained with vision-language alignment, it is difficult to adopt on-policy reinforcement learning (RL) to…

October 23, 2025

Weight Decay may matter more than muP for Learning Rate Transfer in Practice

arXiv:2510.19093v1 Announce Type: new Abstract: Transferring the optimal learning rate from small to large neural networks can enable efficient training at scales where hyperparameter tuning is otherwise prohibitively expensive. To this end, the Maximal Update Parameterization (muP) proposes a learning…

October 23, 2025

Rebalancing with Calibrated Sub-classes (RCS): A Statistical Fusion-based Framework for Robust Imbalanced Classification across Modalities

arXiv:2510.13656v2 Announce Type: replace Abstract: Class imbalance, where certain classes have insufficient data, poses a critical challenge for robust classification, often biasing models toward majority classes. Distribution calibration offers a promising avenue to address this by estimating more accurate class…

October 23, 2025

What Makes a Good Curriculum? Disentangling the Effects of Data Ordering on LLM Mathematical Reasoning

arXiv:2510.19099v1 Announce Type: new Abstract: Curriculum learning (CL) – ordering training data from easy to hard – has become a popular strategy for improving reasoning in large language models (LLMs). Yet prior work employs disparate difficulty metrics and training setups,…

October 23, 2025

Fast MRI for All: Bridging Access Gaps by Training without Raw Data

arXiv:2411.13022v3 Announce Type: replace-cross Abstract: Physics-driven deep learning (PD-DL) approaches have become popular for improved reconstruction of fast magnetic resonance imaging (MRI) scans. Though PD-DL offers higher acceleration rates than existing clinical fast MRI techniques, their use has been limited…

October 23, 2025