Multicalibration for LLM-based Code Generation
arXiv:2512.08810v1 Announce Type: cross Abstract: As AI-based code generation becomes widespread, researchers are investigating the calibration of code LLMs – ensuring their confidence scores faithfully represent the true likelihood of code correctness. To do so, we investigate multicalibration, which can…
