Archives AI News

The Anatomy of a Triton Attention Kernel

arXiv:2511.11581v1 Announce Type: new Abstract: A long-standing goal in both industry and academia is to develop an LLM inference platform that is portable across hardware architectures, eliminates the need for low-level hand-tuning, and still delivers best-in-class efficiency. In this work,…

November 18, 2025

Parallel and Multi-Stage Knowledge Graph Retrieval for Behaviorally Aligned Financial Asset Recommendations

arXiv:2511.11583v1 Announce Type: new Abstract: Large language models (LLMs) show promise for personalized financial recommendations but are hampered by context limits, hallucinations, and a lack of behavioral grounding. Our prior work, FLARKO, embedded structured knowledge graphs (KGs) in LLM prompts…

November 18, 2025

Decoupling Positional and Symbolic Attention Behavior in Transformers

arXiv:2511.11579v1 Announce Type: new Abstract: An important aspect subtending language understanding and production is the ability to independently encode positional and symbolic information of the words within a sentence. In Transformers, positional information is typically encoded using Positional Encodings (PEs).…

November 18, 2025

DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs

arXiv:2511.11576v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) have accelerated research on automated optimization modeling. While real-world decision-making is inherently uncertain, most existing work has focused on deterministic optimization with known parameters, leaving the application of…

November 18, 2025

Detecting Statistically Significant Fairness Violations in Recidivism Forecasting Algorithms

arXiv:2511.11575v1 Announce Type: new Abstract: Machine learning algorithms are increasingly deployed in critical domains such as finance, healthcare, and criminal justice [1]. The increasing popularity of algorithmic decision-making has stimulated interest in algorithmic fairness within the academic community. Researchers have…

November 18, 2025

Self-Supervised Learning of Graph Representations for Network Intrusion Detection

arXiv:2509.16625v3 Announce Type: replace Abstract: Detecting intrusions in network traffic is a challenging task, particularly under limited supervision and constantly evolving attack patterns. While recent works have leveraged graph neural networks for network intrusion detection, they often decouple representation learning…

November 18, 2025

WildfireGenome: Interpretable Machine Learning Reveals Local Drivers of Wildfire Risk and Their Cross-County Variation

arXiv:2511.11589v1 Announce Type: new Abstract: Current wildfire risk assessments rely on coarse hazard maps and opaque machine learning models that optimize regional accuracy while sacrificing interpretability at the decision scale. WildfireGenome addresses these gaps through three components: (1) fusion of…

November 18, 2025

Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing

arXiv:2511.08229v5 Announce Type: replace Abstract: Time series forecasting is critical for decision-making across dynamic domains such as energy, finance, transportation, and cloud computing. However, real-world time series often exhibit non-stationarity, including temporal distribution shifts and spectral variability, which pose significant…

November 18, 2025

Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL

arXiv:2511.11592v1 Announce Type: new Abstract: Maximum entropy has become a mainstream off-policy reinforcement learning (RL) framework for balancing exploitation and exploration. However, two bottlenecks still limit further performance improvement: (1) non-stationary Q-value estimation caused by jointly injecting entropy and updating…

November 18, 2025

Fast Equivariant Imaging: Acceleration for Unsupervised Learning via Augmented Lagrangian and Auxiliary PnP Denoisers

arXiv:2507.06764v2 Announce Type: replace-cross Abstract: In this work, we propose Fast Equivariant Imaging (FEI), a novel unsupervised learning framework to rapidly and efficiently train deep imaging networks without ground-truth data. From the perspective of reformulating the Equivariant Imaging based optimization…

November 18, 2025