Procedural Environment Generation for Tool-Use Agents
arXiv:2506.11045v2 Announce Type: replace Abstract: Although the power of LLM tool-use agents has ignited a flurry of recent research in this area, the curation of tool-use training data remains an open problem$-$especially for online RL training. Existing approaches to synthetic…
