Archives AI News

ToMA: Token Merge with Attention for Diffusion Models

arXiv:2509.10918v2 Announce Type: replace Abstract: Diffusion models excel in high-fidelity image generation but face scalability limits due to transformers’ quadratic attention complexity. Plug-and-play token reduction methods like ToMeSD and ToFu reduce FLOPs by merging redundant tokens in generated images but…

September 24, 2025

NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment

arXiv:2509.18125v1 Announce Type: new Abstract: Healthcare systems face increasing pressure to allocate limited nursing resources efficiently while accounting for skill heterogeneity, patient acuity, staff fatigue, and continuity of care. Traditional optimization and heuristic scheduling methods struggle to capture these dynamic,…

September 24, 2025

Adaptive Kernel Design for Bayesian Optimization Is a Piece of CAKE with LLMs

arXiv:2509.17998v2 Announce Type: replace Abstract: The efficiency of Bayesian optimization (BO) relies heavily on the choice of the Gaussian process (GP) kernel, which plays a central role in balancing exploration and exploitation under limited evaluation budgets. Traditional BO methods often…

September 24, 2025

Anomaly Detection in Electric Vehicle Charging Stations Using Federated Learning

arXiv:2509.18126v1 Announce Type: new Abstract: Federated Learning (FL) is a decentralized training framework widely used in IoT ecosystems that preserves privacy by keeping raw data local, making it ideal for IoT-enabled cyber-physical systems with sensing and communication like Smart Grids…

September 24, 2025

Variational decision diagrams for quantum-inspired machine learning applications

arXiv:2502.04271v2 Announce Type: replace-cross Abstract: Decision diagrams (DDs) have emerged as an efficient tool for simulating quantum circuits due to their capacity to exploit data redundancies in quantum states and quantum operations, enabling the efficient computation of probability amplitudes. However,…

September 24, 2025

Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework

arXiv:2509.18127v1 Announce Type: new Abstract: Increasing deployment of large language models (LLMs) in real-world applications raises significant safety concerns. Most existing safety research focuses on evaluating LLM outputs or specific safety tasks, limiting their ability to ad- dress broader, undefined…

September 24, 2025

Meta-Semantics Augmented Few-Shot Relational Learning

arXiv:2505.05684v3 Announce Type: replace-cross Abstract: Few-shot relational learning on knowledge graph (KGs) aims to perform reasoning over relations with only a few training examples. While current methods have focused primarily on leveraging specific relational information, rich semantics inherent in KGs…

September 24, 2025

Accounting for Uncertainty in Machine Learning Surrogates: A Gauss-Hermite Quadrature Approach to Reliability Analysis

arXiv:2509.18128v1 Announce Type: new Abstract: Machine learning surrogates are increasingly employed to replace expensive computational models for physics-based reliability analysis. However, their use introduces epistemic uncertainty from model approximation errors, which couples with aleatory uncertainty in model inputs, potentially compromising…

September 24, 2025

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo

arXiv:2508.18370v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated exceptional capabilities when trained within executable runtime environments, notably excelling at software engineering tasks through verified feedback loops. Yet, scalable and generalizable execution-grounded environments remain scarce, limiting progress in…

September 24, 2025

Research on Metro Transportation Flow Prediction Based on the STL-GRU Combined Model

arXiv:2509.18130v1 Announce Type: new Abstract: In the metro intelligent transportation system, accurate transfer passenger flow prediction is a key link in optimizing operation plans and improving transportation efficiency. To further improve the theory of metro internal transfer passenger flow prediction…

September 24, 2025