Generating Fine Details of Entity Interactions
arXiv:2504.08714v2 Announce Type: replace-cross Abstract: Recent text-to-image models excel at generating high-quality object-centric images from instructions. However, images should also encapsulate rich interactions between objects, where existing models often fall short, likely due to limited training data and benchmarks for…
