![]() (Optional) Test Configurationīefore you download any dataset, you can begin by testing your configuration with: If this doesn't work for you, you can manually download them here. Pretrained models are now downloaded automatically. Install the remaining requirements with pip install -r requirements.txt.Pick the latest stable version, your operating system, your package manager (pip by default) and finally pick any of the proposed CUDA versions if you have a GPU, otherwise pick CPU. This is necessary for reading audio files. I recommend setting up a virtual environment using venv, but this is optional. Python 3.5 or greater should work, but you'll probably have to tweak the dependencies' versions. A GPU is recommended for training and for inference speed, but is not mandatory. You can use your trained encoder models from this repo with it. Go here for more info.Ģ0/08/19: I'm working on resemblyzer, an independent package for the voice encoder (inference only). If you're planning to work on a serious project, my strong advice: find another TTS repo. If this is not your case: proceed with this repository, but you might end up being disappointed by the results. ![]() ![]() You will get a better voice quality and less prosody errors.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |