MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
arXiv:2508.11999v2 Announce Type: replace-cross Abstract: With the rapid advancement of e-commerce, exploring general representations rather than task-specific ones has attracted increasing research attention. For product understanding, although existing discriminative dual-flow architectures drive progress in this field, they inherently struggle to…
