Revisiting Voice Cloning with GPT-SoVITS and so on

Forewords My last article on voice cloning is more than a year ago, and here we are again for adopting some latest advancement. Refering to some Chinese source such as this blog and this video, I was attempting to adopt new tools for my audio book service, such as CosyVoice, F5-TTS, GPT-SoVITS, and fish-speech. But before we start, I recommend to: Install miniconda for dependency sanity wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh && sudo chmod +x Miniconda3-latest-Linux-x86_64.sh && bash Miniconda3-latest-Linux-x86_64.sh Setup PyTorch environment as needed and confirm with python -m torch.utils.collect_env ...

June 11, 2025 · 8 min · Jun

A Deep Dive into Voice Cloning with SoftVC VITS and Bert-VITS2

A Deep Dive into Voice Cloning with SoftVC VITS and Bert-VITS2 In the previous post, I have tried a little bit of TTS Generation WebUI and found it’s interesting. So, I decide to train a usable model with my own voice. This voice cloning project explores both SVC for Voice Changing and VITS for Text-to-Speech. There is no one tool does all jobs. I have tested several tools for this project. Many of the good guides, like this, this and this, are in Chinese. So, I thought it’s useful to post my notes in English. ...

January 27, 2024 · 11 min · Jun