Archives AI News

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

arXiv:2505.16270v2 Announce Type: replace-cross Abstract: Large language models are typically adapted to downstream tasks through supervised fine-tuning on domain-specific data. While standard fine-tuning focuses on minimizing generation loss to optimize model parameters, we take a deeper step by retaining and…

STAMP: Spatial-Temporal Adapter with Multi-Head Pooling

arXiv:2511.10848v1 Announce Type: new Abstract: Time series foundation models (TSFMs) pretrained on data from multiple domains have shown strong performance on diverse modeling tasks. Various efforts have been made to develop foundation models specific to electroencephalography (EEG) data, which records…

ExPairT-LLM: Exact Learning for LLM Code Selection by Pairwise Queries

arXiv:2511.10855v1 Announce Type: new Abstract: Despite recent advances in LLMs, the task of code generation is still challenging. To cope, code selection algorithms select the best program from multiple programs generated by an LLM. However, existing algorithms can fail to…

Private Zeroth-Order Optimization with Public Data

arXiv:2511.10859v1 Announce Type: new Abstract: One of the major bottlenecks for deploying popular first-order differentially private (DP) machine learning algorithms (e.g., DP-SGD) lies in their high computation and memory cost, despite the existence of optimized implementations. Zeroth-order methods have promise…

Bi-Level Contextual Bandits for Individualized Resource Allocation under Delayed Feedback

arXiv:2511.10572v2 Announce Type: replace-cross Abstract: Equitably allocating limited resources in high-stakes domains-such as education, employment, and healthcare-requires balancing short-term utility with long-term impact, while accounting for delayed outcomes, hidden heterogeneity, and ethical constraints. However, most learning-based allocation frameworks either assume…