Archives AI News

Bi-Level Contextual Bandits for Individualized Resource Allocation under Delayed Feedback

arXiv:2511.10572v2 Announce Type: replace-cross Abstract: Equitably allocating limited resources in high-stakes domains-such as education, employment, and healthcare-requires balancing short-term utility with long-term impact, while accounting for delayed outcomes, hidden heterogeneity, and ethical constraints. However, most learning-based allocation frameworks either assume…

FedALT: Federated Fine-Tuning through Adaptive Local Training with Rest-of-World LoRA

arXiv:2503.11880v3 Announce Type: replace Abstract: Fine-tuning large language models (LLMs) in federated settings enables privacy-preserving adaptation but suffers from cross-client interference due to model aggregation. Existing federated LoRA fine-tuning methods, primarily based on FedAvg, struggle with data heterogeneity, leading to…

Convergence Bound and Critical Batch Size of Muon Optimizer

arXiv:2507.01598v3 Announce Type: replace Abstract: Muon, a recently proposed optimizer that leverages the inherent matrix structure of neural network parameters, has demonstrated strong empirical performance, indicating its potential as a successor to standard optimizers such as AdamW. This paper presents…

Transformers know more than they can tell — Learning the Collatz sequence

arXiv:2511.10811v1 Announce Type: new Abstract: We investigate transformer prediction of long Collatz steps, a complex arithmetic function that maps odd integers to their distant successors in the Collatz sequence ( $u_{n+1}=u_n/2$ if $u_n$ is even, $u_{n+1}=(3u_n+1)/2$ if $u_n$ is odd).…