Archives AI News

Output Supervision Can Obfuscate the Chain of Thought

arXiv:2511.11584v1 Announce Type: new Abstract: OpenAI (2025) showed that training against a chain of thought (CoT) monitor can cause obfuscated CoTs, which contain bad behavior the monitor cannot detect. They proposed to keep CoTs monitorable by training only against output…

November 18, 2025

Parameter-Efficient and Personalized Federated Training of Generative Models at the Edge

arXiv:2511.11585v1 Announce Type: new Abstract: Large generative models (for example, language and diffusion models) enable high-quality text and image synthesis but are hard to train or adapt in cross-device federated settings due to heavy computation and communication and statistical/system heterogeneity.…

November 18, 2025

The Anatomy of a Triton Attention Kernel

arXiv:2511.11581v1 Announce Type: new Abstract: A long-standing goal in both industry and academia is to develop an LLM inference platform that is portable across hardware architectures, eliminates the need for low-level hand-tuning, and still delivers best-in-class efficiency. In this work,…

November 18, 2025

Parallel and Multi-Stage Knowledge Graph Retrieval for Behaviorally Aligned Financial Asset Recommendations

arXiv:2511.11583v1 Announce Type: new Abstract: Large language models (LLMs) show promise for personalized financial recommendations but are hampered by context limits, hallucinations, and a lack of behavioral grounding. Our prior work, FLARKO, embedded structured knowledge graphs (KGs) in LLM prompts…

November 18, 2025

Decoupling Positional and Symbolic Attention Behavior in Transformers

arXiv:2511.11579v1 Announce Type: new Abstract: An important aspect subtending language understanding and production is the ability to independently encode positional and symbolic information of the words within a sentence. In Transformers, positional information is typically encoded using Positional Encodings (PEs).…

November 18, 2025

DAOpt: Modeling and Evaluation of Data-Driven Optimization under Uncertainty with LLMs

arXiv:2511.11576v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) have accelerated research on automated optimization modeling. While real-world decision-making is inherently uncertain, most existing work has focused on deterministic optimization with known parameters, leaving the application of…

November 18, 2025

Detecting Statistically Significant Fairness Violations in Recidivism Forecasting Algorithms

arXiv:2511.11575v1 Announce Type: new Abstract: Machine learning algorithms are increasingly deployed in critical domains such as finance, healthcare, and criminal justice [1]. The increasing popularity of algorithmic decision-making has stimulated interest in algorithmic fairness within the academic community. Researchers have…

November 18, 2025

Self-Supervised Learning of Graph Representations for Network Intrusion Detection

arXiv:2509.16625v3 Announce Type: replace Abstract: Detecting intrusions in network traffic is a challenging task, particularly under limited supervision and constantly evolving attack patterns. While recent works have leveraged graph neural networks for network intrusion detection, they often decouple representation learning…

November 18, 2025

WildfireGenome: Interpretable Machine Learning Reveals Local Drivers of Wildfire Risk and Their Cross-County Variation

arXiv:2511.11589v1 Announce Type: new Abstract: Current wildfire risk assessments rely on coarse hazard maps and opaque machine learning models that optimize regional accuracy while sacrificing interpretability at the decision scale. WildfireGenome addresses these gaps through three components: (1) fusion of…

November 18, 2025

Bigger datasets aren’t always better

MIT researchers developed a way to identify the smallest dataset that guarantees optimal solutions to complex problems.

November 18, 2025