FORGE: Fine-grained Multimodal Evaluation for Manufacturing Scenarios
arXiv:2604.07413v2 Announce Type: replace-cross Abstract: The manufacturing sector is increasingly adopting Multimodal Large Language Models (MLLMs) to transition from simple perception to autonomous execution, yet current evaluations fail to reflect the rigorous demands of real-world manufacturing environments. Progress is hindered…
