Archives AI News

Exploiting the Experts: Unauthorized Compression in MoE-LLMs

arXiv:2511.19480v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) architectures are increasingly adopted in large language models (LLMs) for their scalability and efficiency. However, their modular structure introduces a unique vulnerability: adversaries can attempt to compress or repurpose models by pruning experts…

WavefrontDiffusion: Dynamic Decoding Schedule or Improved Reasoning

arXiv:2511.19473v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) have shown strong potential for text generation and are becoming a competitive alternative to autoregressive models. The denoising strategy plays an important role in determining the quality of their outputs. Mainstream…

PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained Transformer

arXiv:2511.19472v1 Announce Type: new Abstract: Prefix adders are widely used in compute-intensive applications for their high speed. However, designing optimized prefix adders is challenging due to strict design rules and an exponentially large design space. We introduce PrefixGPT, a generative…

The Generalized Proximity Forest

arXiv:2511.19487v1 Announce Type: new Abstract: Recent work has demonstrated the utility of Random Forest (RF) proximities for various supervised machine learning tasks, including outlier detection, missing data imputation, and visualization. However, the utility of the RF proximities depends upon the…

Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment

arXiv:2501.17690v4 Announce Type: replace-cross Abstract: We introduce a novel segmentation-aware joint training framework called generative reinforcement network (GRN) that integrates segmentation loss feedback to optimize both image generation and segmentation performance in a single stage. An image enhancement technique called…

Generative Model-Aided Continual Learning for CSI Feedback in FDD mMIMO-OFDM Systems

arXiv:2511.19490v1 Announce Type: new Abstract: Deep autoencoder (DAE) frameworks have demonstrated their effectiveness in reducing channel state information (CSI) feedback overhead in massive multiple-input multiple-output (mMIMO) orthogonal frequency division multiplexing (OFDM) systems. However, existing CSI feedback models struggle to adapt…

Steganographic Backdoor Attacks in NLP: Ultra-Low Poisoning and Defense Evasion

arXiv:2511.14301v2 Announce Type: replace-cross Abstract: Transformer models are foundational to natural language processing (NLP) applications, yet remain vulnerable to backdoor attacks introduced through poisoned data, which implant hidden behaviors during training. To strengthen the ability to prevent such compromises, recent…