Archives AI News

CAFEDistill: Learning Personalized and Dynamic Models through Federated Early-Exit Network Distillation

arXiv:2601.10015v1 Announce Type: new Abstract: Personalized Federated Learning (PFL) enables collaboratively model training on decentralized, heterogeneous data while tailoring them to each client’s unique distribution. However, existing PFL methods produce static models with a fixed tradeoff between accuracy and efficiency,…

January 16, 2026

Evaluating Large Language Models for Fair and Reliable Organ Allocation

arXiv:2504.03716v2 Announce Type: replace Abstract: Medical institutions are considering the use of LLMs in high-stakes clinical decision-making, such as organ allocation. In such sensitive use cases, evaluating fairness is imperative. However, existing evaluation methods often fall short; benchmarks are too…

January 16, 2026

Time Aggregation Features for XGBoost Models

arXiv:2601.10019v1 Announce Type: new Abstract: This paper studies time aggregation features for XGBoost models in click-through rate prediction. The setting is the Avazu click-through rate prediction dataset with strict out-of-time splits and a no-lookahead feature constraint. Features for hour H…

January 16, 2026

Learning normalized image densities via dual score matching

arXiv:2506.05310v3 Announce Type: replace Abstract: Learning probability models from data is at the heart of many machine learning endeavors, but is notoriously difficult due to the curse of dimensionality. We introduce a new framework for learning emph{normalized} energy (log probability)…

January 16, 2026

BPE: Behavioral Profiling Ensemble

arXiv:2601.10024v1 Announce Type: new Abstract: Ensemble learning is widely recognized as a pivotal strategy for pushing the boundaries of predictive performance. Traditional static ensemble methods, such as Stacking, typically assign weights by treating each base learner as a holistic entity,…

January 16, 2026

Learning Regularization Functionals for Inverse Problems: A Comparative Study

arXiv:2510.01755v2 Announce Type: replace Abstract: In recent years, a variety of learned regularization frameworks for solving inverse problems in imaging have emerged. These offer flexible modeling together with mathematical insights. The proposed methods differ in their architectural design and training…

January 16, 2026

Unlabeled Data Can Provably Enhance In-Context Learning of Transformers

arXiv:2601.10058v1 Announce Type: new Abstract: Large language models (LLMs) exhibit impressive in-context learning (ICL) capabilities, yet the quality of their predictions is fundamentally limited by the few costly labeled demonstrations that can fit into a prompt. Meanwhile, there exist vast…

January 16, 2026

How collective memory of the Rwandan genocide was preserved

Delia Wendel’s new book illuminates a painful and painstaking effort by citizens to bear witness to atrocities.

January 16, 2026

Permissive Information-Flow Analysis for Large Language Models

arXiv:2410.03055v3 Announce Type: replace Abstract: Large Language Models (LLMs) are rapidly becoming commodity components of larger software systems. This poses natural security and privacy problems: poisoned data retrieved from one component can change the model’s behavior and compromise the entire…

January 16, 2026

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

arXiv:2601.04377v3 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) has emerged as an important means of enhancing the performance of large language models (LLMs) in knowledge-intensive tasks. However, most existing RAG strategies treat retrieved passages in a flat and unstructured way,…

January 16, 2026