SceneLCM: End-to-End Layout-Guided Interactive Indoor Scene Generation with Latent Consistency Model

Automated generation of complex, interactive indoor scenes tailored to user prompt remains a formidable challenge. While existing methods achieve indoor scene synthesis, they struggle with rigid editing constraints, physical incoherence, excessive human effort, single-room limitations, and suboptimal material quality.

To address these limitations, we propose SceneLCM, an end-to-end framework that synergizes Large Language Model (LLM) for layout design with Latent Consistency Model(LCM) for scene optimization. Our approach decomposes scene generation into four modular pipelines: (1) Layout Generation. We employ LLM-guided 3D spatial reasoning to convert textual descriptions into parametric blueprints(3D layout). And an iterative programmatic validation mechanism iteratively refines layout parameters through LLM-mediated dialogue loops; (2) Furniture Generation. SceneLCM employs Consistency Trajectory Sampling(CTS), a consistency distillation sampling loss guided by LCM, to form fast, semantically rich, and high-quality representations. We also offer two theoretical justification to demonstrate that our CTS loss is equivalent to consistency loss and its distillation error is bounded by the truncation error of the Euler solver; (3) Environment Optimization. We use a multiresolution texture field to encode the appearance of the scene, and optimize via CTS loss. To maintain cross-geometric texture coherence, we introduce a normal-aware cross-attention decoder to predict RGB by cross-attending to the anchors locations in geometrically heterogeneous instance. (4)Physically Editing. SceneLCM supports physically editing by integrating physical simulation, achieved persistent physical realism.

Extensive experiments validate SceneLCM's superiority over state-of-the-art techniques, showing its wide-ranging potential for diverse applications.

bedroom style1

prompt: A Boho-Hippe style bedroom, beautiful floor, a window on wall, photorealistic, HD, 8K

bedroom style2

prompt: A Bohemian style bedroom, beautiful floor, a window on wall, photorealistic, HD, 8K

bedroom style3

prompt: A cubism art style bedroom, beautiful floor, a window on wall, photorealistic, HD, 8K

bedroom style4

prompt: A Modern Children bedroom, beautiful floor, a window on wall, photorealistic, HD, 8K

dining room style1

prompt: A Gypsy-classic style dining room, beautiful floor, a window on wall, photorealistic, HD, 8K

dining room style2

prompt: A cubism art style dining room, beautiful floor, a window on wall, photorealistic, HD, 8K

dining room style3

prompt: A Neo-hipple style dining room, beautiful floor, a window on wall, photorealistic, HD, 8K

dining room style4

prompt: A Gypsy dining room, beautiful floor, a window on wall, photorealistic, HD, 8K

Render Environment Across Different Rooms

We can navigate between multiple rooms while rendering them simultaneously.

Physical Editing

We tilt the room by 30 degrees, causing the furniture to move due to gravity.

Object1

prompt: A cozy office chair with a big pink back, HD, 4K.

Object2

prompt: A swivel office chair with some beautiful texture, HD, 4K.

Object3

prompt: A rectangular glass-top coffee table with a metal frame. HD, 4K.

Object4

prompt: A beautiful office chair, photorealistic, HD, 8K

object5

prompt: A modern comfortable sofa, HD, 4K.

object6

prompt: A wooden desk with metal legs, photorealistic, HD, 8K

object7

prompt: A wooden desk, rich texture, photorealistic, HD, 8K

object8

prompt: A green comfortable sofa, photorealistic, HD, 4K

object9

prompt: A portrait of the Ghost Rider, head, HDR, photorealistic, 8K.

object10

prompt: A portrait of Groot, head, HDR, photorealistic, 8K

object11

prompt: A Gundam model, with detailed panel lines and decals, photorealistic, 8K, HDR

object12

prompt: A Gundam Barbatos Lupus Rex model, Gundam, Barbatos, with detailed panel lines and decals, photorealistic, 8K, HDR.

We generate multiple rooms simultaneously. In the previous examples, although each video was rendered within a single room, one can still see other rooms and their furniture at the doorway of the room

Japanese Style

Prompt: A Japanese style bedroom, beautiful floor, a window on wall, photirealistic, HD, 8k

Although Japanese and Chinese styles share similar color schemes, Japanese style predominantly features rectangular patterns with cherry blossoms as decorative motifs.

Chinese Style

Prompt: A chinese traditional style entrance, beautiful floor, a window on wall, photirealistic, HD, 8k

Chinese style predominantly incorporates stripes and paper-cut window decorations as ornamental elements.

Texture map optimization

Prompt: A baroque style entrance, beautiful floor, a window on wall, photirealistic, HD, 8k

Directly optimize the texture map via CTS loss.

There is significant multi-view inconsistency in the room, where one side appears red while the other appears green. Additionally, numerous noise points are present

UV Parameters

Prompt: A baroque style entrance, beautiful floor, a window on wall, photirealistic, HD, 8k

Our method.

Consistency and beautiful texture.

SceneCraft with Industrial Style

prompt: This is one view of a bedroom painted by Industrial style.

DreamScene with Industrial Style

prompt1: Industrial style, 4k, 8k, best quality, ultra-detailed, finely detail, highres, high resolution

prompt2: A DSLR photo of an Industrial style bedroom

Our with Industrial Style

prompt: A Industrial style entrance, beautiful floor, a window on wall, photirealistic, HD, 8k

SceneCraft depth map with Industrial Style

DreamScene depth map with Industrial Style

Our depth map with Industrial Style

SceneCraft with Baroque Style

prompt: This is one view of a bedroom painted by baroque style.

DreamScene with Baroque Style

prompt1: Baroque style, 4k, 8k, best quality, ultra-detailed, finely detail, highres, high resolution

prompt2: A DSLR photo of an Baroque style bedroom

Our with Baroque Style

prompt: A Baroque style entrance, beautiful floor, a window on wall, photirealistic, HD, 8k

SceneLCM: End-to-End Layout-Guided Interactive Indoor Scene Generation with Latent Consistency Model

Given a textual description of the house, SceneLCM enables automated generation of multi-room, multi-scale indoor scene without human intervention.

Abstract

Contents

(Clicking on a subheading will jump to the corresponding section.)

Environment Editing

bedroom style1

bedroom style2

bedroom style3

bedroom style4

dining room style1

dining room style2

dining room style3

dining room style4

Additional Results

Render Environment Across Different Rooms

Physical Editing

Object Generation

Object1

Object2

Object3

Object4

object5

object6

object7

object8

object9

object10

object11

object12

Animation

Render Environment Across Different Rooms

Japanese Style VS Chinese Traditional Style

Japanese Style

Chinese Style

Texture map optimization VS UV Parameters

Texture map optimization

UV Parameters

Comparison of Additional Style(Industrial, baroque)

SceneCraft with Industrial Style

DreamScene with Industrial Style

Our with Industrial Style

SceneCraft depth map with Industrial Style

DreamScene depth map with Industrial Style

Our depth map with Industrial Style

SceneCraft with Baroque Style

DreamScene with Baroque Style

Our with Baroque Style

SceneCraft depth map with Baroque Style

DreamScene depth map with Baroque Style

Our depth map with Baroque Style