I just found this.
This is huge!
As a german, I use thorsten medium as he simply made the best dataset.
Mixing english with german, speaking numbers, single letters, pausing without a “.” but just a linebreak, all those can be essential.
And… it is nearly perfect! And all local!
This is crazy!
eSpeak can finally go to rest!
I downloaded the thorsten medium and wondered, what the difference to thorsten high would be?
Thorsten high is silly haha. Emotional is also not meant for TTS more for research I think.
I think thorsten made the only good model in German, I really want to make my own one! Or get some famous people on board?
You can listen to them here