A circuit for predicting hierarchical structure in-context in Large Language Models
arXiv:2509.21534v1 Announce Type: new Abstract: Large Language Models (LLMs) excel at in-context learning, the ability to use information provided as context to improve prediction of future tokens. Induction heads have been argued to play a crucial role for in-context learning…
