XTTS = Garbage Results (Help!)

aallsbury@alien.top · 2 years ago

XTTS = Garbage Results (Help!)

LeanderGem@alien.top · 2 years ago

Check out PIPER TTS, pretty good results and it’s super fast:

https://www.youtube.com/watch?v=GGvdq3giiTQ&ab_channel=Thorsten-Voice

Herr_Drosselmeyer@alien.top · 2 years ago

Use 10 second clips of clean audio, no music, no background noise. I like to record samples from audiobooks. Free samples on Amazon recorded with audacity work well for me.

One thing to note, my install (an implementation for SillyTavern) somehow got corrupted, no idea how. It still worked but sounded way worse. Reinstall fixed that so maybe that’s happening to you too.

Murky-Ladder8684@alien.top · 2 years ago

In the instructions on github it said to use mono 24000 wav. Double check the info though.

hjill@alien.top · 2 years ago

I had to go back to the previous version that’s on huggingface to get good audio. Somehow the latest version sounds much worse.

Edit: see https://github.com/coqui-ai/TTS/issues/3309#issue-2010577124

----Val----@alien.top · 2 years ago

Check which model you are using. The latest 2.0.3 XTTSv2 is really wonky. Manually revert it to 2.0.2.

aallsbury@alien.top · 2 years ago

Do you know an easy way to revert using the Oobabooga extension? Thanks!