| --- |
| license: apache-2.0 |
| base_model: |
| - Wan-AI/Wan2.1-T2V-1.3B |
| - Wan-AI/Wan2.1-T2V-14B |
| pipeline_tag: text-to-video |
| --- |
| # rCM: Score-Regularized Continuous-Time Consistency Model |
| [**Paper**](https://arxiv.org/abs/2510.08431) | [**Website**](https://research.nvidia.com/labs/dir/rcm) | [**Code**](https://github.com/NVlabs/rcm) |
|
|
| This repo holds converted Wan official checkpoints in rCM/TurboDiffusion style. |
|
|
| Specifically, rCM equivalently replaces the Conv3d layer in the original Wan with a Linear layer for patch embedding, facilitating further optimization. The layer weight is directly reshaped without value change, e.g., from shape [5120, 16, 1, 2, 2] (Conv3d) to shape [5120, 64] (Linear). |
|
|
| ## Citation |
|
|
| ``` |
| @article{zheng2025rcm, |
| title={Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency}, |
| author={Zheng, Kaiwen and Wang, Yuji and Ma, Qianli and Chen, Huayu and Zhang, Jintao and Balaji, Yogesh and Chen, Jianfei and Liu, Ming-Yu and Zhu, Jun and Zhang, Qinsheng}, |
| journal={arXiv preprint arXiv:2510.08431}, |
| year={2025} |
| } |
| ``` |