GPO: Learning from Critical Steps to Improve LLM Reasoning
arXiv:2509.16456v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used in various domains, showing impressive potential on different tasks. Recently, reasoning LLMs have been proposed to improve the textit{reasoning} or textit{thinking} capabilities of LLMs to solve complex problems.…
