BEACON: Bayesian Optimal Stopping for Efficient LLM Sampling
arXiv:2510.15945v1 Announce Type: new Abstract: Sampling multiple responses is a common way to improve LLM output quality, but it comes at the cost of additional computation. The key challenge is deciding when to stop generating new samples to balance accuracy…
