Archives AI News

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

arXiv:2603.06622v1 Announce Type: new Abstract: Accurate short-term power load forecasting is important to effectively manage, optimize, and ensure the robustness of modern power systems. This paper performs an empirical evaluation of a traditional statistical model and deep learning approaches for…

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

arXiv:2603.03596v2 Announce Type: replace-cross Abstract: Conventionally, memory in end-to-end robotic learning involves inputting a sequence of past observations into the learned policy. However, in complex multi-stage real-world tasks, the robot’s memory must represent past events at multiple levels of granularity:…

Advances in GRPO for Generation Models: A Survey

arXiv:2603.06623v1 Announce Type: new Abstract: Large-scale flow matching models have achieved strong performance across generative tasks such as text-to-image, video, 3D, and speech synthesis. However, aligning their outputs with human preferences and task-specific objectives remains challenging. Flow-GRPO extends Group Relative…

Viewpoint-Agnostic Grasp Pipeline using VLM and Partial Observations

arXiv:2603.07866v1 Announce Type: cross Abstract: Robust grasping in cluttered, unstructured environments remains challenging for mobile legged manipulators due to occlusions that lead to partial observations, unreliable depth estimates, and the need for collision-free, execution-feasible approaches. In this paper we present…

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

arXiv:2603.06626v1 Announce Type: new Abstract: Traditional Mixture-of-Experts (MoE) training typically proceeds without any structural priors, effectively requiring the model to simultaneously train expert weights while searching for an optimal routing policy within a vast combinatorial space. This entanglement often leads…

Electrocardiogram Classification with Transformers Using Koopman and Wavelet Features

arXiv:2603.08339v1 Announce Type: cross Abstract: Electrocardiogram (ECG) analysis is vital for detecting cardiac abnormalities, yet robust automated classification is challenging due to the complexity and variability of physiological signals. In this work, we investigate transformer-based ECG classification using features derived…

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

arXiv:2603.08640v1 Announce Type: cross Abstract: AI agents have become surprisingly proficient at software engineering over the past year, largely due to improvements in reasoning capabilities. This raises a deeper question: can these systems extend their capabilities to automate AI research…