Pocket-TTS seems to be a TTS server that returns audio much faster locally than Piper, so I built a container that enables it via Wyoming protocol and zeroconf to be available in Home Assistant.

There is the ability to use an audio snippet to clone a voice that would be used by pocket-TTS, I haven’t enabled anything like that yet. PRs welcome.

  • node815@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    23 days ago

    I can’t pull the image:

    docker run -d \
      --name pocket-tts-wyoming \
      -p 10201:10201 \
      -e DEFAULT_VOICE=alba \
      -v pocket-tts-hf-cache:/root/.cache/huggingface \
      -v pocket-tts-cache:/root/.cache/pocket_tts \
      pocket-tts-wyoming
    Unable to find image 'pocket-tts-wyoming:latest' locally
    docker: Error response from daemon: pull access denied for pocket-tts-wyoming, repository does not exist or may require 'docker login': denied: requested access to the resource is denied
    
    

    I am logged into Docker as well via docker login.
    edit I cloned the repo via Git and was able to get it to build and run.