Archives AI News

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

arXiv:2603.02236v1 Announce Type: new Abstract: Recent studies have demonstrated the potential of Large Language Models (LLMs) in generating GPU Kernels. Current benchmarks focus on the translation of high-level languages into CUDA, overlooking the more general and challenging task of text-to-CUDA…

March 4, 2026

Scale-invariant Gaussian derivative residual networks

arXiv:2603.02843v1 Announce Type: cross Abstract: Generalisation across image scales remains a fundamental challenge for deep networks, which often fail to handle images at scales not seen during training (the out-of-distribution problem). In this paper, we present provably scale-invariant Gaussian derivative…

March 4, 2026

Concept Heterogeneity-aware Representation Steering

arXiv:2603.02237v1 Announce Type: new Abstract: Representation steering offers a lightweight mechanism for controlling the behavior of large language models (LLMs) by intervening on internal activations at inference time. Most existing methods rely on a single global steering direction, typically obtained…

March 4, 2026

Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

arXiv:2603.03146v1 Announce Type: cross Abstract: emph{Integrated communication and computation} (IC$^2$) has emerged as a new paradigm for enabling efficient edge inference in sixth-generation (6G) networks. However, the design of IC$^2$ technologies is hindered by the lack of a tractable theoretical…

March 4, 2026

Length Generalization Bounds for Transformers

arXiv:2603.02238v1 Announce Type: new Abstract: Length generalization is a key property of a learning algorithm that enables it to make correct predictions on inputs of any length, given finite training data. To provide such a guarantee, one needs to be…

March 4, 2026

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

arXiv:2603.03269v1 Announce Type: cross Abstract: Feedforward geometric foundation models achieve strong short-window reconstruction, yet scaling them to minutes-long videos is bottlenecked by quadratic attention complexity or limited effective memory in recurrent designs. We present LoGeR (Long-context Geometric Reconstruction), a novel…

March 4, 2026

High-order Knowledge Based Network Controllability Robustness Prediction: A Hypergraph Neural Network Approach

arXiv:2603.02265v1 Announce Type: new Abstract: In order to evaluate the invulnerability of networks against various types of attacks and provide guidance for potential performance enhancement as well as controllability maintenance, network controllability robustness (NCR) has attracted increasing attention in recent…

March 4, 2026

Making informed decisions in cutting tool maintenance in milling: A KNN-based model agnostic approach

arXiv:2310.14629v3 Announce Type: replace Abstract: Tool Condition Monitoring (TCM) is vital for maintaining productivity and product quality in machining. This study leverages machine learning to analyze real-time force signals collected from experiments under various tool wear conditions. Statistical analysis and…

March 4, 2026

Boosting Meta-Learning for Few-Shot Text Classification via Label-guided Distance Scaling

arXiv:2603.02267v1 Announce Type: new Abstract: Few-shot text classification aims to recognize unseen classes with limited labeled text samples. Existing approaches focus on boosting meta-learners by developing complex algorithms in the training stage. However, the labeled samples are randomly selected during…

March 4, 2026

A “ChatGPT for spreadsheets” helps solve difficult engineering challenges faster

The approach could help engineers tackle extremely complex design problems, from power grid optimization to vehicle design.

March 4, 2026