Archives AI News

SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals

arXiv:2502.01042v5 Announce Type: replace Abstract: Large language models (LLMs) exhibit exceptional capabilities across various tasks but also pose risks by generating harmful content. Existing safety mechanisms, while improving model safety, often lead to overly cautious behavior and fail to fully…

September 16, 2025

Variational Gaussian Mixture Manifold Models for Client-Specific Federated Personalization

arXiv:2509.10521v1 Announce Type: new Abstract: Personalized federated learning (PFL) often fails under label skew and non-stationarity because a single global parameterization ignores client-specific geometry. We introduce VGM$^2$ (Variational Gaussian Mixture Manifold), a geometry-centric PFL framework that (i) learns client-specific parametric…

September 16, 2025

Topology-Aware and Highly Generalizable Deep Reinforcement Learning for Efficient Retrieval in Multi-Deep Storage Systems

arXiv:2506.14787v2 Announce Type: replace Abstract: In modern industrial and logistics environments, the rapid expansion of fast delivery services has heightened the demand for storage systems that combine high efficiency with increased density. Multi-deep autonomous vehicle storage and retrieval systems (AVS/RS)…

September 16, 2025

Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction

arXiv:2509.10522v1 Announce Type: new Abstract: Air traffic controllers (ATCOs) issue high-intensity voice commands in dense airspace, where accurate workload modeling is critical for safety and efficiency. This paper proposes a multimodal deep learning framework that integrates structured data, trajectory sequences,…

September 16, 2025

Principled Approximation Methods for Efficient and Scalable Deep Learning

arXiv:2509.00174v2 Announce Type: replace Abstract: Recent progress in deep learning has been driven by increasingly larger models. However, their computational and energy demands have grown proportionally, creating significant barriers to their deployment and to a wider adoption of deep learning…

September 16, 2025

From Predictions to Explanations: Explainable AI for Autism Diagnosis and Identification of Critical Brain Regions

arXiv:2509.10523v1 Announce Type: new Abstract: Autism spectrum disorder (ASD) is a neurodevelopmental condition characterized by atypical brain maturation. However, the adaptation of transfer learning paradigms in machine learning for ASD research remains notably limited. In this study, we propose a…

September 16, 2025

Generalized Dirichlet Energy and Graph Laplacians for Clustering Directed and Undirected Graphs

arXiv:2203.03221v3 Announce Type: replace-cross Abstract: Clustering in directed graphs remains a fundamental challenge due to the asymmetry in edge connectivity, which limits the applicability of classical spectral methods originally designed for undirected graphs. A common workaround is to symmetrize the…

September 16, 2025

Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning

arXiv:2509.10526v1 Announce Type: new Abstract: This paper presents a novel approach to neural network pruning by integrating a graph-based observation space into an AutoML framework to address the limitations of existing methods. Traditional pruning approaches often depend on hand-crafted heuristics…

September 16, 2025

Social Perception of Faces in a Vision-Language Model

arXiv:2408.14435v2 Announce Type: replace-cross Abstract: We explore social perception of human faces in CLIP, a widely used open-source vision-language model. To this end, we compare the similarity in CLIP embeddings between different textual prompts and a set of face images.…

September 16, 2025

STM-Graph: A Python Framework for Spatio-Temporal Mapping and Graph Neural Network Predictions

arXiv:2509.10528v1 Announce Type: new Abstract: Urban spatio-temporal data present unique challenges for predictive analytics due to their dynamic and complex nature. We introduce STM-Graph, an open-source Python framework that transforms raw spatio-temporal urban event data into graph representations suitable for…

September 16, 2025