| --- |
| license: mit |
| --- |
| |
| ### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models |
| ### Official Model Repo |
|
|
| #### Model Include: |
| - Stage1-CAVP Pretrained Model. |
| - Stage2-LDM Pretrained Model. |
| - Double Guidance Classifier. |
|
|
| <p align="center"> |
| <img src="teaser.png"> |
| </p> |
| |
| ## BibTeX |
|
|
| ```bibtex |
| @misc{luo2023difffoley, |
| title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, |
| author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, |
| year={2023}, |
| eprint={2306.17203}, |
| archivePrefix={arXiv}, |
| primaryClass={cs.SD} |
| } |
| ``` |
|
|
|
|
|
|