EvoXplain: When Machine Learning Models Agree on Predictions but Disagree on Why — Measuring Mechanistic Multiplicity Across Training Runs
arXiv:2512.22240v1 Announce Type: new Abstract: Machine learning models are primarily judged by predictive performance, especially in applied settings. Once a model reaches high accuracy, its explanation is often assumed to be correct and trustworthy. However, this assumption raises an overlooked…
