Modality-Balanced Collaborative Distillation for Multi-Modal Domain Generalization
arXiv:2511.20258v1 Announce Type: cross Abstract: Weight Averaging (WA) has emerged as a powerful technique for enhancing generalization by promoting convergence to a flat loss landscape, which correlates with stronger out-of-distribution performance. However, applying WA directly to multi-modal domain generalization (MMDG)…
