DualTune: Decoupled Fine-Tuning for On-Device Agentic Systems
arXiv:2510.00229v1 Announce Type: new Abstract: The deployment of Large Language Models (LLMs) as agentic orchestrators has revolutionized task automation, but the need for privacy-preserving, cost-effective solutions demands on-device inference capabilities. However, local LLMs consistently underperform compared to frontier models in…
