| | --- |
| | license: mit |
| | --- |
| | |
| | ### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models |
| | ### Official Model Repo |
| |
|
| | #### Model Include: |
| | - Stage1-CAVP Pretrained Model. |
| | - Stage2-LDM Pretrained Model. |
| | - Double Guidance Classifier. |
| |
|
| | <p align="center"> |
| | <img src="teaser.png"> |
| | </p> |
| | |
| | ## BibTeX |
| |
|
| | ```bibtex |
| | @misc{luo2023difffoley, |
| | title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, |
| | author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, |
| | year={2023}, |
| | eprint={2306.17203}, |
| | archivePrefix={arXiv}, |
| | primaryClass={cs.SD} |
| | } |
| | ``` |
| |
|
| |
|
| |
|