Archives AI News

FedRef: Communication-Efficient Bayesian Fine-Tuning using a Reference Model

arXiv:2506.23210v3 Announce Type: replace Abstract: Federated learning (FL) collaboratively trains artificial intelligence (AI) models to ensure user data privacy. Sharing only model updates generated from local training on client data with the server enhances user data privacy. However, model performance…

Inference-Time Personalized Alignment with a Few User Preference Queries

arXiv:2511.02966v1 Announce Type: new Abstract: We study the problem of aligning a generative model’s response with a user’s preferences. Recent works have proposed several different formulations for personalized alignment; however, they either require a large amount of user preference queries…

Test-time Adaptation of Tiny Recursive Models

arXiv:2511.02886v1 Announce Type: new Abstract: Prior to the close of the 2025 ARC Prize competition, the leading open source approach – known as TRM, or Tiny Recursive Models – involved training a 7M parameter recursive neural network on augmented variants…

Value of Information-Enhanced Exploration in Bootstrapped DQN

arXiv:2511.02969v1 Announce Type: new Abstract: Efficient exploration in deep reinforcement learning remains a fundamental challenge, especially in environments characterized by high-dimensional states and sparse rewards. Traditional exploration strategies that rely on random local policy noise, such as $epsilon$-greedy and Boltzmann…