Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention
Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention arXiv:2508.19414v1 Announce Type: new Abstract: We present a mechanistic case study of a format-dependent reasoning failure in Llama-3.1-8B-Instruct, where the model incorrectly judges “9.11” as larger than…
