Archives AI News

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

arXiv:2602.07906v3 Announce Type: replace Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimization over long horizons. While recent LLM-based agents show promise, current prompt-based agents for MLE suffer from behavioral stagnation due to frozen parameters. Although…

March 3, 2026

LIDS: LLM Summary Inference Under the Layered Lens

arXiv:2603.00105v1 Announce Type: new Abstract: Large language models (LLMs) have gained significant attention by many researchers and practitioners in natural language processing (NLP) since the introduction of ChatGPT in 2022. One notable feature of ChatGPT is its ability to generate…

March 3, 2026

VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

arXiv:2406.16227v2 Announce Type: replace-cross Abstract: Effective clustering of biomedical data is crucial in precision medicine, enabling accurate stratifiction of patients or samples. However, the growth in availability of high-dimensional categorical data, including `omics data, necessitates computationally efficient clustering algorithms. We…

March 3, 2026

MAML-KT: Addressing Cold Start Problem in Knowledge Tracing for New Students via Few-Shot Model-Agnostic Meta Learning

arXiv:2603.00137v1 Announce Type: new Abstract: Knowledge tracing (KT) models are commonly evaluated by training on early interactions from all students and testing on later responses. While effective for measuring average predictive performance, this evaluation design obscures a cold start scenario…

March 3, 2026

Rapid training of Hamiltonian graph networks using random features

arXiv:2506.06558v3 Announce Type: replace Abstract: Learning dynamical systems that respect physical symmetries and constraints remains a fundamental challenge in data-driven modeling. Integrating physical laws with graph neural networks facilitates principled modeling of complex N-body dynamics and yields accurate and permutation-invariant…

March 3, 2026

Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It

arXiv:2509.02391v3 Announce Type: replace Abstract: The success of federated learning (FL) ultimately depends on how strategic participants behave under partial observability, yet most formulations still treat FL as a static optimization problem. We instead view FL deployments as governed strategic…

March 3, 2026

Unsupervised Discovery of Intermediate Phase Order in the Frustrated $J_1$-$J_2$ Heisenberg Model via Prometheus Framework

arXiv:2602.21468v2 Announce Type: replace-cross Abstract: The spin-$1/2$ $J_1$-$J_2$ Heisenberg model on the square lattice exhibits a debated intermediate phase between N’eel antiferromagnetic and stripe ordered regimes, with competing theories proposing plaquette valence bond, nematic, and quantum spin liquid ground states.…

March 3, 2026

A Randomized Linearly Convergent Frank-Wolfe-type Method for Smooth Convex Minimization over the Spectrahedron

arXiv:2503.01441v2 Announce Type: replace-cross Abstract: We consider the problem of minimizing a smooth and convex function over the $n$-dimensional spectrahedron — the set of real symmetric $ntimes n$ positive semidefinite matrices with unit trace, which underlies numerous applications in statistics,…

March 3, 2026

Property-Driven Evaluation of GNN Expressiveness at Scale: Datasets, Framework, and Study

arXiv:2603.00044v1 Announce Type: new Abstract: Advancing trustworthy AI requires principled software engineering approaches to model evaluation. Graph Neural Networks (GNNs) have achieved remarkable success in processing graph-structured data, however, their expressiveness in capturing fundamental graph properties remains an open challenge.…

March 3, 2026

HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

arXiv:2601.18753v2 Announce Type: replace Abstract: The reliability of Large Language Models (LLMs) in high-stakes domains such as healthcare, law, and scientific discovery is often compromised by hallucinations. These failures typically stem from two sources: data-driven hallucinations and reasoning-driven hallucinations. However,…

March 3, 2026