Archives AI News

Diagnosing Neural Convergence with Topological Alignment Spectra

arXiv:2411.08687v2 Announce Type: replace Abstract: Representational similarity in neural networks is inherently scale-dependent, yet widely used metrics such as Centered Kernel Alignment (CKA) and Procrustes analysis provide only global scalar estimates. These scalars often fail to distinguish micro-scale geometric jitter…

Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling

arXiv:2508.13749v3 Announce Type: replace Abstract: In this paper, we study sequential decision-making for maximizing the Sharpe ratio (SR) in a stochastic multi-armed bandit (MAB) setting. Unlike standard bandit formulations that maximize cumulative reward, SR optimization requires balancing expected return and…

L’evy-Flow Models: Heavy-Tail-Aware Normalizing Flows for Financial Risk Management

arXiv:2604.00195v1 Announce Type: new Abstract: We introduce L’evy-Flows, a class of normalizing flow models that replace the standard Gaussian base distribution with L’evy process-based distributions, specifically Variance Gamma (VG) and Normal-Inverse Gaussian (NIG). These distributions naturally capture heavy-tailed behavior while…

Lossy Common Information in a Learnable Gray-Wyner Network

arXiv:2601.21424v2 Announce Type: replace Abstract: Many computer vision tasks share substantial overlapping information, yet conventional codecs tend to ignore this, leading to redundant and inefficient representations. The Gray-Wyner network, a classical concept from information theory, offers a principled framework for…

Experiential Reflective Learning for Self-Improving LLM Agents

arXiv:2603.24639v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have enabled the development of autonomous agents capable of complex reasoning and multi-step problem solving. However, these agents struggle to adapt to specialized environments and do not leverage…

Offline Constrained RLHF with Multiple Preference Oracles

arXiv:2604.00200v1 Announce Type: new Abstract: We study offline constrained reinforcement learning from human feedback with multiple preference oracles. Motivated by applications that trade off performance with safety or fairness, we aim to maximize target population utility subject to a minimum…