Archives AI News

P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats

arXiv:2511.06838v1 Announce Type: cross Abstract: The substantial memory bandwidth and computational demand of large language models (LLMs) present critical challenges for efficient inference. To tackle this, the literature has explored heterogeneous systems that combine neural processing units (NPUs) with DRAM-based…

November 11, 2025

Depth-induced NTK: Bridging Over-parameterized Neural Networks and Deep Neural Kernels

arXiv:2511.05585v1 Announce Type: new Abstract: While deep learning has achieved remarkable success across a wide range of applications, its theoretical understanding of representation learning remains limited. Deep neural kernels provide a principled framework to interpret over-parameterized neural networks by mapping…

November 11, 2025

Walsh-Hadamard Neural Operators for Solving PDEs with Discontinuous Coefficients

arXiv:2511.07347v1 Announce Type: cross Abstract: Neural operators have emerged as powerful tools for learning solution operators of partial differential equations (PDEs). However, standard spectral methods based on Fourier transforms struggle with problems involving discontinuous coefficients due to the Gibbs phenomenon…

November 11, 2025

Prompting Neural-Guided Equation Discovery Based on Residuals

arXiv:2511.05586v1 Announce Type: new Abstract: Neural-guided equation discovery systems use a data set as prompt and predict an equation that describes the data set without extensive search. However, if the equation does not meet the user’s expectations, there are few…

November 11, 2025

Adaptive Group Robust Ensemble Knowledge Distillation

arXiv:2411.14984v2 Announce Type: replace Abstract: Neural networks can learn spurious correlations in the data, often leading to performance degradation for underrepresented subgroups. Studies have demonstrated that the disparity is amplified when knowledge is distilled from a complex teacher model to…

November 11, 2025

CoPRIS: Efficient and Stable Reinforcement Learning via Concurrency-Controlled Partial Rollout with Importance Sampling

arXiv:2511.05589v1 Announce Type: new Abstract: Reinforcement learning (RL) post-training has become a trending paradigm for enhancing the capabilities of large language models (LLMs). Most existing RL systems for LLMs operate in a fully synchronous manner, where training must wait for…

November 11, 2025

From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching

arXiv:2505.24843v2 Announce Type: replace Abstract: Models that learn spurious correlations from training data often fail when deployed in new environments. While many methods aim to learn invariant representations to address this, they often underperform standard empirical risk minimization (ERM). We…

November 11, 2025

FedSparQ: Adaptive Sparse Quantization with Error Feedback for Robust & Efficient Federated Learning

arXiv:2511.05591v1 Announce Type: new Abstract: Federated Learning (FL) enables collaborative model training across decentralized clients while preserving data privacy by keeping raw data local. However, FL suffers from significant communication overhead due to the frequent exchange of high-dimensional model updates…

November 11, 2025

An upper bound of the silhouette validation metric for clustering

arXiv:2509.08625v2 Announce Type: replace Abstract: The silhouette coefficient quantifies, for each observation, the balance between within-cluster cohesion and between-cluster separation, taking values in [-1, 1]. The average silhouette width (ASW) is a widely used internal measure of clustering quality, with…

November 11, 2025

GRAVER: Generative Graph Vocabularies for Robust Graph Foundation Models Fine-tuning

arXiv:2511.05592v1 Announce Type: new Abstract: Inspired by the remarkable success of foundation models in language and vision, Graph Foundation Models (GFMs) hold significant promise for broad applicability across diverse graph tasks and domains. However, existing GFMs struggle with unstable few-shot…

November 11, 2025