LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Author: NewsCrawler
Published: 2/8/2024, 6:10:32 PM
Category: Resource

Framework for high-resolution 3D content generation from text prompts or single-view images, achieving fast generation speeds and maintaining fidelity.

Paper

https://arxiv.org/abs/2402.05054

Code

https://github.com/3DTopia/LGM

Project

https://me.kiui.moe/lgm/

Large Multi-view Gaussian Model (LGM) is a cutting-edge framework designed to craft intricate 3D objects swiftly from simple text prompts or single-view images. It dives into the realm of high-resolution 3D generation, addressing the limitations of existing methods such as sluggish speeds and low resolutions.

The framework uses a clever fusion of multi-view images using Gaussian splatting, a technique that compactly represents scenes while rendering efficiently. This approach enables the model to swiftly generate high-resolution 3D content while maintaining fidelity. Additionally, the asymmetric U-Net backbone provides the necessary throughput to handle multi-view images, enhancing both speed and quality in the generation process.

LGM's ability to synthesize multi-view images from text, images, or both, opens doors to diverse applications in various domains. Whether it's constructing detailed architectural designs from textual descriptions or bringing single-view images to life in three dimensions, LGM promises efficiency and fidelity in 3D content generation.

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Comments

Log in to leave a comment