Archives AI News

PLEIADES: Building Temporal Kernels with Orthogonal Polynomials

arXiv:2405.12179v5 Announce Type: replace Abstract: We introduce a class of neural networks named PLEIADES (PoLynomial Expansion In Adaptive Distributed Event-based Systems), which contains temporal convolution kernels generated from orthogonal polynomial basis functions. We focus on interfacing these networks with event-based…

GDR-learners: Orthogonal Learning of Generative Models for Potential Outcomes

arXiv:2509.22953v1 Announce Type: new Abstract: Various deep generative models have been proposed to estimate potential outcomes distributions from observational data. However, none of them have the favorable theoretical property of general Neyman-orthogonality and, associated with it, quasi-oracle efficiency and double…

Variational Deep Learning via Implicit Regularization

arXiv:2505.20235v2 Announce Type: replace Abstract: Modern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of architecture, hyperparameters and optimization procedure.…

Functional Critic Modeling for Provably Convergent Off-Policy Actor-Critic

arXiv:2509.22964v1 Announce Type: new Abstract: Off-policy reinforcement learning (RL) with function approximation offers an effective way to improve sample efficiency by reusing past experience. Within this setting, the actor-critic (AC) framework has achieved strong empirical success. However, both the critic…

Adaptive Sample Scheduling for Direct Preference Optimization

arXiv:2506.17252v2 Announce Type: replace Abstract: Direct Preference Optimization (DPO) has emerged as an effective approach for aligning large language models (LLMs) with human preferences. However, its performance is highly dependent on the quality of the underlying human preference data. To…