Archives AI News

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

arXiv:2603.03378v2 Announce Type: replace Abstract: Large language model (LLM) agents offer a promising data-driven approach to automating Site Reliability Engineering (SRE), yet their enterprise deployment is constrained by three challenges: restricted access to proprietary data, unsafe action execution under permission-governed…

SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

arXiv:2506.01062v3 Announce Type: replace-cross Abstract: We introduce SealQA, a new challenge benchmark for evaluating SEarch-Augmented Language models on fact-seeking questions where web search yields conflicting, noisy, or unhelpful results. SealQA comes in three flavors: (1) Seal-0 (main) and (2) Seal-Hard,…

On Emergences of Non-Classical Statistical Characteristics in Classical Neural Networks

arXiv:2603.04451v1 Announce Type: new Abstract: Inspired by measurement incompatibility and Bell-family inequalities in quantum mechanics, we propose the Non-Classical Network (NCnet), a simple classical neural architecture that stably exhibits non-classical statistical behaviors under typical and interpretable experimental setups. We find…