Project Overview
Last updated
Was this helpful?
Last updated
Was this helpful?
Diff-SVC is an open-source Singing Voice Conversion project using the Diffusion model.
GitHub repository: Author:
It is your responsibility to follow any rules and local laws that may apply when creating and sharing models.
In this guide, we will go through the 4 stages of the Diff-SVC pipeline: Dataset Preparation, Preprocessing, Training, and Inference.
Some commands and configs will be slightly different depending on the vocoder you choose. The 44.1kHz vocoder will output audio at a higher sampling rate than the 24kHz one, which usually leads to higher audio quality. So it's recommended to use 44.1kHz to train new models.
Singing Voice Conversion aims to convert one singer's voice to another's while Singing Voice Synthesis aims to generate singing voices from music scores and lyrics.
Example:
SVS:
VOCALOID
UTAU
Synthesizer V
DiffSinger
...
SVC
Vocaloid 6 (VocaloChanger feature)
Diff-SVC
...
Diff-SVC is an SVC project, hence it takes vocal audio instead of midi and lyric as input. Check the partner project if you are looking for an open-source SVS project.
(paper & code) -> (OpenVPI's Fork) -> (by@prophesier)
Diff-SVC is based on DiffSinger, DiffSinger (openvpi maintenance version), and .
This project has NO connection with the of the same name DiffSVC, please do not confuse them!