March 15, 2025
A revolutionary 3D generation model with unified SLAT representation for versatile and high-quality 3D asset creation.
We are excited to introduce ManiFold, our groundbreaking 3D generation model that represents a significant leap forward in AI-powered 3D asset creation. ManiFold features cutting-edge AI models designed for high performance across diverse 3D generation domains, delivering scalability, efficiency, and state-of-the-art results for versatile and high-quality 3D asset creation.
ManiFold is a large 3D asset generation model that takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes. The cornerstone of ManiFold is a unified Structured LATent (SLAT) representation that allows decoding to different output formats and Rectified Flow Transformers tailored for SLAT as the powerful backbones.
We provide large-scale pre-trained models with up to 2 billion parameters trained on a comprehensive 3D asset dataset of 500K diverse objects. ManiFold significantly surpasses existing methods, including recent ones at similar scales, and showcases flexible output format selection and local 3D editing capabilities which were not offered by previous models.
ManiFold introduces a novel 3D generation method built on several breakthrough innovations:
The SLAT representation integrates a sparsely-populated 3D grid with dense multiview visual features extracted from a powerful vision foundation model, comprehensively capturing both structural (geometry) and textural (appearance) information while maintaining flexibility during decoding.
Our rectified flow transformers are specifically designed for SLAT processing, enabling efficient and high-quality 3D generation that significantly outperforms existing methods across multiple evaluation metrics. The model's architecture allows for seamless integration into existing AI workflows while providing unprecedented flexibility in output format selection.