LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
arXiv:2504.00010v3 Announce Type: replace Abstract: Text-to-image (T2I) generation has made remarkable progress, yet existing systems still lack intuitive control over spatial composition, object consistency, and multi-step editing. We present $textbf{LayerCraft}$, a modular framework that uses large language models (LLMs) as…
