Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach
arXiv:2505.01997v3 Announce Type: replace Abstract: One of the key technologies for the success of Large Language Models (LLMs) is preference alignment. However, a notable side effect of preference alignment is poor calibration: while the pre-trained models are typically well-calibrated, LLMs…
