Archives AI News

Position: Many generalization measures for deep learning are fragile

arXiv:2510.18934v1 Announce Type: new Abstract: A wide variety of generalization measures have been applied to deep neural networks (DNNs). Although obtaining tight bounds remains challenging, such measures are often assumed to reproduce qualitative generalization trends. In this position paper, we…

Noise-corrected GRPO: From Noisy Rewards to Unbiased Gradients

arXiv:2510.18924v1 Announce Type: new Abstract: Reinforcement learning from human feedback (RLHF) or verifiable rewards (RLVR), the standard paradigm for aligning LLMs or building recent SOTA reasoning models, is highly sensitive to noise from inconsistent or erroneous rewards. Yet, the interaction…

Benchmarking On-Device Machine Learning on Apple Silicon with MLX

arXiv:2510.18921v1 Announce Type: new Abstract: The recent widespread adoption of Large Language Models (LLMs) and machine learning in general has sparked research interest in exploring the possibilities of deploying these models on smaller devices such as laptops and mobile phones.…

ADPO: Anchored Direct Preference Optimization

arXiv:2510.18913v1 Announce Type: new Abstract: Anchored Direct Preference Optimization (ADPO) is a unified framework that generalizes Direct Preference Optimization (DPO) with soft preferences, reference-policy anchoring, and groupwise extensions. While standard DPO assumes hard binary labels and pairwise comparisons, ADPO introduces:…

NeuroAda: Activating Each Neuron’s Potential for Parameter-Efficient Fine-Tuning

arXiv:2510.18940v1 Announce Type: new Abstract: Existing parameter-efficient fine-tuning (PEFT) methods primarily fall into two categories: addition-based and selective in-situ adaptation. The former, such as LoRA, introduce additional modules to adapt the model to downstream tasks, offering strong memory efficiency. However,…

A Comprehensive Benchmark for RNA 3D Structure-Function Modeling

arXiv:2503.21681v3 Announce Type: replace-cross Abstract: The relationship between RNA structure and function has recently attracted interest within the deep learning community, a trend expected to intensify as nucleic acid structure models advance. Despite this momentum, the lack of standardized, accessible…