Archives AI News

SOH-KLSTM: A Hybrid Kolmogorov-Arnold Network and LSTM Model for Enhanced Lithium-Ion Battery Health Monitoring

arXiv:2509.10496v1 Announce Type: new Abstract: Accurate and reliable State Of Health (SOH) estimation for Lithium (Li) batteries is critical to ensure the longevity, safety, and optimal performance of applications like electric vehicles, unmanned aerial vehicles, consumer electronics, and renewable energy…

September 16, 2025

Identifiable Autoregressive Variational Autoencoders for Nonlinear and Nonstationary Spatio-Temporal Blind Source Separation

arXiv:2509.11962v1 Announce Type: cross Abstract: The modeling and prediction of multivariate spatio-temporal data involve numerous challenges. Dimension reduction methods can significantly simplify this process, provided that they account for the complex dependencies between variables and across time and space. Nonlinear…

September 16, 2025

LogGuardQ: A Cognitive-Enhanced Reinforcement Learning Framework for Cybersecurity Anomaly Detection in Security Logs

arXiv:2509.10511v1 Announce Type: new Abstract: Reinforcement learning (RL) has transformed sequential decision-making, but traditional algorithms like Deep Q-Networks (DQNs) and Proximal Policy Optimization (PPO) often struggle with efficient exploration, stability, and adaptability in dynamic environments. This study presents LogGuardQ (Adaptive…

September 16, 2025

TED: Accelerate Model Training by Internal Generalization

arXiv:2405.03228v3 Announce Type: replace Abstract: Large language models have demonstrated strong performance in recent years, but the high cost of training drives the need for efficient methods to compress dataset sizes. We propose TED pruning, a method that addresses the…

September 16, 2025

A Service-Oriented Adaptive Hierarchical Incentive Mechanism for Federated Learning

arXiv:2509.10512v1 Announce Type: new Abstract: Recently, federated learning (FL) has emerged as a novel framework for distributed model training. In FL, the task publisher (TP) releases tasks, and local model owners (LMOs) use their local data to train models. Sometimes,…

September 16, 2025

Mixture-of-Clustered-Experts: Advancing Expert Specialization and Generalization in Instruction Tuning

arXiv:2509.10513v1 Announce Type: new Abstract: A sparse Mixture-of-Experts (MoE) architecture has emerged as a highly scalable solution by conditionally activating sub-modules without a proportional increase in computational costs. However, improving expert specialization to enhance performance and generalization remains a challenge…

September 16, 2025

Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models

arXiv:2309.01219v3 Announce Type: replace-cross Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input,…

September 16, 2025

A Differential Manifold Perspective and Universality Analysis of Continuous Attractors in Artificial Neural Networks

arXiv:2509.10514v1 Announce Type: new Abstract: Continuous attractors are critical for information processing in both biological and artificial neural systems, with implications for spatial navigation, memory, and deep learning optimization. However, existing research lacks a unified framework to analyze their properties…

September 16, 2025

Hallucinated Span Detection with Multi-View Attention Features

arXiv:2504.04335v2 Announce Type: replace-cross Abstract: This study addresses the problem of hallucinated span detection in the outputs of large language models. It has received less attention than output-level hallucination detection despite its practical importance. Prior work has shown that attentions…

September 16, 2025

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor

arXiv:2509.10515v1 Announce Type: new Abstract: Offline preference optimization methods are efficient for large language models (LLMs) alignment. Direct Preference optimization (DPO)-like learning, one of the most popular approaches, stands out for its efficiency in reward modeling. However, these methods typically…

September 16, 2025