Archives AI News

Offline Constrained RLHF with Multiple Preference Oracles

arXiv:2604.00200v1 Announce Type: new Abstract: We study offline constrained reinforcement learning from human feedback with multiple preference oracles. Motivated by applications that trade off performance with safety or fairness, we aim to maximize target population utility subject to a minimum…

Diagnosing Neural Convergence with Topological Alignment Spectra

arXiv:2411.08687v2 Announce Type: replace Abstract: Representational similarity in neural networks is inherently scale-dependent, yet widely used metrics such as Centered Kernel Alignment (CKA) and Procrustes analysis provide only global scalar estimates. These scalars often fail to distinguish micro-scale geometric jitter…

TempoControl: Temporal Attention Guidance for Text-to-Video Models

arXiv:2510.02226v3 Announce Type: replace-cross Abstract: Recent advances in generative video models have enabled the creation of high-quality videos based on natural language prompts. However, these models frequently lack fine-grained temporal control, meaning they do not allow users to specify when…

Exact Graph Learning via Integer Programming

arXiv:2601.20589v2 Announce Type: replace-cross Abstract: Learning the dependence structure among variables in complex systems is a central problem across medical, natural, and social sciences. These structures can be naturally represented by graphs, and the task of inferring such graphs from…