How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness
arXiv:2510.01163v1 Announce Type: cross Abstract: The emergence of in-context learning (ICL) in large language models (LLMs) remains poorly understood despite its consistent effectiveness, enabling models to adapt to new tasks from only a handful of examples. To clarify and improve…
