Pocket-TTS seems to be a TTS server that returns audio much faster locally than Piper, so I built a container that enables it via Wyoming protocol and zeroconf to be available in Home Assistant.
There is the ability to use an audio snippet to clone a voice that would be used by pocket-TTS, I haven’t enabled anything like that yet. PRs welcome.


I can’t pull the image:
docker run -d \ --name pocket-tts-wyoming \ -p 10201:10201 \ -e DEFAULT_VOICE=alba \ -v pocket-tts-hf-cache:/root/.cache/huggingface \ -v pocket-tts-cache:/root/.cache/pocket_tts \ pocket-tts-wyoming Unable to find image 'pocket-tts-wyoming:latest' locally docker: Error response from daemon: pull access denied for pocket-tts-wyoming, repository does not exist or may require 'docker login': denied: requested access to the resource is deniedI am logged into Docker as well via docker login.
edit I cloned the repo via Git and was able to get it to build and run.