Archives AI News

IPAD: Inverse Prompt for AI Detection – A Robust and Interpretable LLM-Generated Text Detector

arXiv:2502.15902v3 Announce Type: replace Abstract: Large Language Models (LLMs) have attained human-level fluency in text generation, which complicates the distinguishing between human-written and LLM-generated texts. This increases the risk of misuse and highlights the need for reliable detectors. Yet, existing…

November 19, 2025

Library Liberation: Competitive Performance Matmul Through Compiler-composed Nanokernels

arXiv:2511.13764v1 Announce Type: new Abstract: The rapidly evolving landscape of AI and machine learning workloads has widened the gap between high-level domain operations and efficient hardware utilization. Achieving near-peak performance still demands deep hardware expertise-experts either handcraft target-specific kernels (e.g.,…

November 19, 2025

Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

arXiv:2507.00965v2 Announce Type: replace Abstract: Many machine learning tasks can benefit from external knowledge. Large knowledge graphs store such knowledge, and embedding methods can be used to distill it into ready-to-use vector representations for downstream applications. For this purpose, current…

November 19, 2025

PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning

arXiv:2511.13765v1 Announce Type: new Abstract: Offline imitation learning (offline IL) enables training effective policies without requiring explicit reward annotations. Recent approaches attempt to estimate rewards for unlabeled datasets using a small set of expert demonstrations. However, these methods often assume…

November 19, 2025

Clone Deterministic 3D Worlds

arXiv:2510.26782v2 Announce Type: replace Abstract: A world model is an internal model that simulates how the world evolves. Given past observations and actions, it predicts the future physical state of both the embodied agent and its environment. Accurate world models…

November 19, 2025

Credal Ensemble Distillation for Uncertainty Quantification

arXiv:2511.13766v1 Announce Type: new Abstract: Deep ensembles (DE) have emerged as a powerful approach for quantifying predictive uncertainty and distinguishing its aleatoric and epistemic components, thereby enhancing model robustness and reliability. However, their high computational and memory costs during inference…

November 19, 2025

MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient Regularization

arXiv:2511.12199v2 Announce Type: replace Abstract: The surrogate gradient (SG) method has shown significant promise in enhancing the performance of deep spiking neural networks (SNNs), but it also introduces vulnerabilities to adversarial attacks. Although spike coding strategies and neural dynamics parameters…

November 19, 2025

Dynamic Temperature Scheduler for Knowledge Distillation

arXiv:2511.13767v1 Announce Type: new Abstract: Knowledge Distillation (KD) trains a smaller student model using a large, pre-trained teacher model, with temperature as a key hyperparameter controlling the softness of output probabilities. Traditional methods use a fixed temperature throughout training, which…

November 19, 2025

MoM: Linear Sequence Modeling with Mixture-of-Memories

arXiv:2502.13685v4 Announce Type: replace-cross Abstract: Linear sequence modeling methods, such as linear attention, state space modeling, and linear RNNs, offer significant efficiency improvements by reducing the complexity of training and inference. However, these methods typically compress the entire input sequence…

November 19, 2025

Compiling to linear neurons

arXiv:2511.13769v1 Announce Type: new Abstract: We don’t program neural networks directly. Instead, we rely on an indirect style where learning algorithms, like gradient descent, determine a neural network’s function by learning from data. This indirect style is often a virtue;…

November 19, 2025