Archives AI News

The Anatomy of a Triton Attention Kernel

arXiv:2511.11581v1 Announce Type: new Abstract: A long-standing goal in both industry and academia is to develop an LLM inference platform that is portable across hardware architectures, eliminates the need for low-level hand-tuning, and still delivers best-in-class efficiency. In this work,…

Decoupling Positional and Symbolic Attention Behavior in Transformers

arXiv:2511.11579v1 Announce Type: new Abstract: An important aspect subtending language understanding and production is the ability to independently encode positional and symbolic information of the words within a sentence. In Transformers, positional information is typically encoded using Positional Encodings (PEs).…

DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs

arXiv:2511.11576v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) have accelerated research on automated optimization modeling. While real-world decision-making is inherently uncertain, most existing work has focused on deterministic optimization with known parameters, leaving the application of…

Self-Supervised Learning of Graph Representations for Network Intrusion Detection

arXiv:2509.16625v3 Announce Type: replace Abstract: Detecting intrusions in network traffic is a challenging task, particularly under limited supervision and constantly evolving attack patterns. While recent works have leveraged graph neural networks for network intrusion detection, they often decouple representation learning…

Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL

arXiv:2511.11592v1 Announce Type: new Abstract: Maximum entropy has become a mainstream off-policy reinforcement learning (RL) framework for balancing exploitation and exploration. However, two bottlenecks still limit further performance improvement: (1) non-stationary Q-value estimation caused by jointly injecting entropy and updating…