Archives AI News

Advances in GRPO for Generation Models: A Survey

arXiv:2603.06623v1 Announce Type: new Abstract: Large-scale flow matching models have achieved strong performance across generative tasks such as text-to-image, video, 3D, and speech synthesis. However, aligning their outputs with human preferences and task-specific objectives remains challenging. Flow-GRPO extends Group Relative…

Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations

arXiv:2603.07866v1 Announce Type: cross Abstract: Robust grasping in cluttered, unstructured environments remains challenging for mobile legged manipulators due to occlusions that lead to partial observations, unreliable depth estimates, and the need for collision-free, execution-feasible approaches. In this paper we present…

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

arXiv:2603.06626v1 Announce Type: new Abstract: Traditional Mixture-of-Experts (MoE) training typically proceeds without any structural priors, effectively requiring the model to simultaneously train expert weights while searching for an optimal routing policy within a vast combinatorial space. This entanglement often leads…

Electrocardiogram Classification with Transformers Using Koopman and Wavelet Features

arXiv:2603.08339v1 Announce Type: cross Abstract: Electrocardiogram (ECG) analysis is vital for detecting cardiac abnormalities, yet robust automated classification is challenging due to the complexity and variability of physiological signals. In this work, we investigate transformer-based ECG classification using features derived…

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

arXiv:2603.08640v1 Announce Type: cross Abstract: AI agents have become surprisingly proficient at software engineering over the past year, largely due to improvements in reasoning capabilities. This raises a deeper question: can these systems extend their capabilities to automate AI research…

A new Uncertainty Principle in Machine Learning

arXiv:2603.06634v1 Announce Type: new Abstract: Many scientific problems in the context of machine learning can be reduced to the search of polynomial answers in appropriate variables. The Hevisidization of arbitrary polynomial is actually provided by one-and-the same two-layer expression. What…

OTAD: An Optimal Transport-Induced Robust Model for Agnostic Adversarial Attack

arXiv:2408.00329v2 Announce Type: replace Abstract: Deep neural networks (DNNs) are vulnerable to small adversarial perturbations of the inputs, posing a significant challenge to their reliability and robustness. Empirical methods such as adversarial training can defend against particular attacks but remain…