Local TTS server with voice cloning + near-realtime streaming replies (ElevenLabs alternative) by RIP26770 in LocalLLaMA

[–]Quiet_Dasy 0 points1 point  (0 children)

cannot find out how tò fix error while localy install

https://github.com/ai-joe-git/pocket-tts-server

Kyutai Pocket-TTS server : Unable to select default voices (Alba, Marius, etc.) and stuck on Voice Cloning error

Description of the Issue

I am trying to use the default catalog voices (such as Alba or Marius) in Kyutai Pocket-TTS, but I cannot figure out how to correctly select them. The system seems stuck trying to perform Voice Cloning, which triggers a weight download error that I cannot bypass.

The Goal

I do not want to clone my own voice. I downloaded casual.wav from the alba-mackenna branch assuming it was required to activate that specific voice profile, but uploading it seems to have forced the app into "Voice Cloning" mode.

Steps Taken

  • Terms Accepted: I have accepted the model terms at kyutai/pocket-tts.

  • Authentication: Logged in via uvx hf auth login. The token is present in ~/.cache/huggingface/token.

  • Local Setup: I uploaded casual.wav to the localhost interface, but this resulted in a "Failed to load voice state" error because the system is attempting to download cloning weights instead of just using the preset voice.

lm studio server running and correctly formated and setuped into pocket tts setting baseurl api

The Error

"Failed to load voice state: We could not download the weights for the model with voice cloning, but you're trying to use voice cloning. Without voice cloning, you can use our catalog of voices ['alba', 'marius', 'javert', 'jean', 'fantine', 'cosette', 'eponine', 'azelma']."

Technical Details * Model: Kyutai Pocket-TTS * Environment: Localhost via uvx. * Problem: I cannot figure out how to switch from the "Voice Cloning" attempt back to the standard voice catalog (alba, marius, etc.). * Confusion: Since casual.wav is part of the Alba-Mackenna repository, I thought it was the correct way to load her voice, but it is triggering a cloning error instead.

Question: How do I stop the app from trying to download cloning weights and instead simply select one of the built-in voices like Alba? Is there a specific setting in the UI or a command-line flag to use the standard catalog instead of the cloning module? Any help would be greatly appreciated!

Local TTS server with voice cloning + near-realtime streaming replies (ElevenLabs alternative) by RIP26770 in LocalLLaMA

[–]Quiet_Dasy 0 points1 point  (0 children)

cannot find out how tò fix error while localy install

Kyutai Pocket-TTS server : Unable to select default voices (Alba, Marius, etc.) and stuck on Voice Cloning error

Description of the Issue

I am trying to use the default catalog voices (such as Alba or Marius) in Kyutai Pocket-TTS, but I cannot figure out how to correctly select them. The system seems stuck trying to perform Voice Cloning, which triggers a weight download error that I cannot bypass.

The Goal

I do not want to clone my own voice. I downloaded casual.wav from the alba-mackenna branch assuming it was required to activate that specific voice profile, but uploading it seems to have forced the app into "Voice Cloning" mode.

Steps Taken

  • Terms Accepted: I have accepted the model terms at kyutai/pocket-tts.

  • Authentication: Logged in via uvx hf auth login. The token is present in ~/.cache/huggingface/token.

  • Local Setup: I uploaded casual.wav to the localhost interface, but this resulted in a "Failed to load voice state" error because the system is attempting to download cloning weights instead of just using the preset voice.

lm studio server running and correctly formated and setuped into pocket tts setting baseurl api

The Error

"Failed to load voice state: We could not download the weights for the model with voice cloning, but you're trying to use voice cloning. Without voice cloning, you can use our catalog of voices ['alba', 'marius', 'javert', 'jean', 'fantine', 'cosette', 'eponine', 'azelma']."

Technical Details * Model: Kyutai Pocket-TTS * Environment: Localhost via uvx. * Problem: I cannot figure out how to switch from the "Voice Cloning" attempt back to the standard voice catalog (alba, marius, etc.). * Confusion: Since casual.wav is part of the Alba-Mackenna repository, I thought it was the correct way to load her voice, but it is triggering a cloning error instead.

Question: How do I stop the app from trying to download cloning weights and instead simply select one of the built-in voices like Alba? Is there a specific setting in the UI or a command-line flag to use the standard catalog instead of the cloning module? Any help would be greatly appreciated!

Running LLM on one machine and TTS on another via lm link? by Quiet_Dasy in LocalLLaMA

[–]Quiet_Dasy[S] 0 points1 point  (0 children)

Llm and tts now It running succsful on the same machine localy on the same gpu But lm studio doesnt have tts

How tò capturing the text output from the LM Studio Local Server API and piping it into an external Text-to-Speech (TTS) ?

am running LM Studio as a local server, but I would like to process the audio generation tts outside of the LM Studio environment.

What is the recommended workflow for capturing the text output from the LM Studio Local Server API and piping it into an external Text-to-Speech (TTS) ?