Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance
arXiv:2507.17131v2 Announce Type: replace Abstract: Large language model (LLM) agents often struggle in environments where rules and required domain knowledge frequently change, such as regulatory compliance and user risk screening. Current approaches, like offline fine-tuning and standard prompting, are insufficient…
