Archives AI News

TRACE: Learning to Compute on Graphs

arXiv:2509.21886v1 Announce Type: new Abstract: Learning to compute, the ability to model the functional behavior of a computational graph, is a fundamental challenge for graph representation learning. Yet, the dominant paradigm is architecturally mismatched for this task. This flawed assumption,…

GenesisGeo: Technical Report

arXiv:2509.21896v1 Announce Type: new Abstract: We present GenesisGeo, an automated theorem prover in Euclidean geometry. We have open-sourced a large-scale geometry dataset of 21.8 million geometric problems, over 3 million of which contain auxiliary constructions. Specially, we significantly accelerate the…

Stuffed Mamba: Oversized States Lead to the Inability to Forget

arXiv:2410.07145v3 Announce Type: replace-cross Abstract: Recent advancements in recurrent architectures, such as Mamba and RWKV, have showcased strong language capabilities. Unlike transformer-based models, these architectures encode all contextual information into a fixed-size state, leading to great inference efficiency. However, this…

DyRo-MCTS: A Robust Monte Carlo Tree Search Approach to Dynamic Job Shop Scheduling

arXiv:2509.21902v1 Announce Type: new Abstract: Dynamic job shop scheduling, a fundamental combinatorial optimisation problem in various industrial sectors, poses substantial challenges for effective scheduling due to frequent disruptions caused by the arrival of new jobs. State-of-the-art methods employ machine learning…

RuCCoD: Towards Automated ICD Coding in Russian

arXiv:2502.21263v2 Announce Type: replace-cross Abstract: This study investigates the feasibility of automating clinical coding in Russian, a language with limited biomedical resources. We present a new dataset for ICD coding, which includes diagnosis fields from electronic health records (EHRs) annotated…

UniErase: Towards Balanced and Precise Unlearning in Language Models

arXiv:2505.15674v2 Announce Type: replace-cross Abstract: Large language models (LLMs) require iterative updates to address the outdated information problem, where LLM unlearning offers an approach for selective removal. However, mainstream unlearning methods primarily rely on fine-tuning techniques, which often lack precision…

RISK: A Framework for GUI Agents in E-commerce Risk Management

arXiv:2509.21982v1 Announce Type: new Abstract: E-commerce risk management requires aggregating diverse, deeply embedded web data through multi-step, stateful interactions, which traditional scraping methods and most existing Graphical User Interface (GUI) agents cannot handle. These agents are typically limited to single-step…