Archives AI News

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

arXiv:2511.19253v2 Announce Type: replace Abstract: Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing approaches rely on fixed heuristics or use Large Language…

December 11, 2025

Learning Unmasking Policies for Diffusion Language Models

arXiv:2512.09106v1 Announce Type: new Abstract: Diffusion (Large) Language Models (dLLMs) now match the downstream performance of their autoregressive counterparts on many tasks, while holding the promise of being more efficient during inference. One particularly successful variant is masked discrete diffusion,…

December 11, 2025

Local-Curvature-Aware Knowledge Graph Embedding: An Extended Ricci Flow Approach

arXiv:2512.07332v2 Announce Type: replace Abstract: Knowledge graph embedding (KGE) relies on the geometry of the embedding space to encode semantic and structural relations. Existing methods place all entities on one homogeneous manifold, Euclidean, spherical, hyperbolic, or their product/multi-curvature variants, to…

December 11, 2025

Spectral Embedding via Chebyshev Bases for Robust DeepONet Approximation

arXiv:2512.09165v1 Announce Type: new Abstract: Deep Operator Networks (DeepONets) have become a central tool in data-driven operator learning, providing flexible surrogates for nonlinear mappings arising in partial differential equations (PDEs). However, the standard trunk design based on fully connected layers…

December 11, 2025

A Multivariate Bernoulli-Based Sampling Method for Multi-Label Data with Application to Meta-Research

arXiv:2512.08371v2 Announce Type: replace Abstract: Datasets may contain observations with multiple labels. If the labels are not mutually exclusive, and if the labels vary greatly in frequency, obtaining a sample that includes sufficient observations with scarcer labels to make inferences…

December 11, 2025

Understanding the Failure Modes of Transformers through the Lens of Graph Neural Networks

arXiv:2512.09182v1 Announce Type: new Abstract: Transformers and more specifically decoder-only transformers dominate modern LLM architectures. While they have shown to work exceptionally well, they are not without issues, resulting in surprising failure modes and predictably asymmetric performance degradation. This article…

December 11, 2025

New materials could boost the energy efficiency of microelectronics

By stacking multiple active components based on new materials on the back end of a computer chip, this new approach reduces the amount of energy wasted during computation.

December 11, 2025

RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning

arXiv:2512.09829v1 Announce Type: cross Abstract: The massive scale of modern AI accelerators presents critical challenges to traditional fault assessment methodologies, which face prohibitive computational costs and provide poor coverage of critical failure modes. This paper introduces RIFT (Reinforcement Learning-guided Intelligent…

December 11, 2025

Neural Diversity Regularizes Hallucinations in Language Models

arXiv:2510.20690v2 Announce Type: replace-cross Abstract: Language models continue to hallucinate despite increases in parameters, compute, and data. We propose neural diversity — decorrelated parallel representations — as a principled mechanism that reduces hallucination rates at fixed parameter and data budgets.…

December 11, 2025

The Ky Fan Norms and Beyond: Dual Norms and Combinations for Matrix Optimization

arXiv:2512.09678v1 Announce Type: cross Abstract: In this article, we explore the use of various matrix norms for optimizing functions of weight matrices, a crucial problem in training large language models. Moving beyond the spectral norm underlying the Muon update, we…

December 11, 2025