When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
arXiv:2510.11302v1 Announce Type: cross Abstract: Object detection systems have traditionally relied on supervised learning with manually annotated bounding boxes, achieving high accuracy at the cost of substantial annotation investment. The emergence of Vision-Language Models (VLMs) offers an alternative paradigm enabling…
