OcclusionFormer tackles image generation occlusion with new framework

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-20 16:10

Researchers have developed OcclusionFormer, a new framework designed to improve layout-grounded image generation by explicitly handling inter-object occlusion. Existing models struggle when bounding boxes overlap, leading to ambiguous or inconsistent layering. OcclusionFormer addresses this by using a novel Diffusion Transformer that models Z-order priority and employs volume rendering for compositing. The approach is supported by a new dataset, SA-Z, which includes explicit occlusion ordering and pixel-level annotations, leading to enhanced semantic consistency and accuracy in generated images. AI

影响 Improves spatial controllability in image generation models by resolving complex occlusion relationships.

排序理由 The cluster describes a new research paper introducing a novel framework and dataset for image generation.

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-20 16:10

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

Recent layout-to-image models have achieved remarkable progress in spatial controllability. However, they still struggle with inter-object occlusion. When bounding boxes overlap, most existing methods lack explicit occlusion information, which makes the generation in intersection…
arXiv cs.CV TIER_1 English(EN) · Henghui Ding · 2026-05-20 16:10

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

Recent layout-to-image models have achieved remarkable progress in spatial controllability. However, they still struggle with inter-object occlusion. When bounding boxes overlap, most existing methods lack explicit occlusion information, which makes the generation in intersection…

报道来源 [2]

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

相关实体

相关话题