Video-to-Video
Diffusers
Safetensors
Diffuman4DPipeline
3d-generation
4d-generation
human
avatar
multi-view video
Instructions to use krahets/Diffuman4D with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use krahets/Diffuman4D with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("krahets/Diffuman4D", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
| license: openrail++ | |
| datasets: | |
| - krahets/dna_rendering_processed | |
| base_model: | |
| - stabilityai/stable-diffusion-2-1-base | |
| pipeline_tag: video-to-video | |
| tags: | |
| - 3d-generation | |
| - 4d-generation | |
| - human | |
| - avatar | |
| - multi-view video | |
| # Diffuman4D Model | |
| [**Project Page**](https://diffuman4d.github.io/) | [**Paper**](https://arxiv.org/abs/2507.13344) | [**Code**](https://github.com/zju3dv/Diffuman4D) | [**Dataset**](https://huggingface.co/datasets/krahets/dna_rendering_processed) | |
| > The official model repo for Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models. | |
| <img src="assets/images/teaser_dna.gif" width="100%" alt="teaser"> | |
| Diffuman4D enables high-fidelity free-viewpoint rendering of human performances from sparse-view videos. | |
| ## Usage | |
| See the [GitHub repo](https://github.com/zju3dv/Diffuman4D) for detailed usage. | |
| ## Cite | |
| ``` | |
| @inproceedings{jin2025diffuman4d, | |
| title={Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models}, | |
| author={Jin, Yudong and Peng, Sida and Wang, Xuan and Xie, Tao and Xu, Zhen and Yang, Yifan and Shen, Yujun and Bao, Hujun and Zhou, Xiaowei}, | |
| booktitle={International Conference on Computer Vision (ICCV)}, | |
| year={2025} | |
| } | |
| ``` | |