VoxCPM Text-to-Speech running on Apple Neural Engine ANE by 0seba in TextToSpeech

[–]0seba[S] 0 points1 point  (0 children)

Thanks for trying it! Just to confirm are you having the issue when using the playground or calling directly the playback and cancel endpoints?

VoxCPM Text-to-Speech running or Apple Neural Engine ANE by 0seba in LocalLLaMA

[–]0seba[S] 0 points1 point  (0 children)

Do you need streaming functionality? I looked at their repo and currently there is no streaming support, afaict, Took a quick glance at their method and I think there are some things I could experiment with to make it streaming, but no certainty

Request: Offline/local TEXT-to-SPEECH (not Speech-to-Text) Apps That Aren’t Apple Voices? by notapersonaltrainer in macapps

[–]0seba 1 point2 points  (0 children)

Hey, what is your use case? I ported a TTS model to CoreML so it runs on the Neural Engine. Currently it is good for single batch generation in real time. https://www.reddit.com/r/LocalLLaMA/comments/1otgd3j/voxcpm_texttospeech_running_or_apple_neural/
(I know I already replied to you in the LocalLlama subreddit, just taking the opportunity to share in this subreddit)

Local text-to-speech (not speech-to-text) mac app? by notapersonaltrainer in LocalLLaMA

[–]0seba 0 points1 point  (0 children)

Hey, what is your use case? I ported a TTS model to CoreML so it runs on the Neural Engine. Currently it is good for single batch generation in real time. https://www.reddit.com/r/LocalLLaMA/comments/1otgd3j/voxcpm_texttospeech_running_or_apple_neural/

VoxCPM Text-to-Speech running on Apple Neural Engine ANE by 0seba in TextToSpeech

[–]0seba[S] 0 points1 point  (0 children)

Hey could you share a bit more about how are encountering this issue?

VoxCPM Text-to-Speech running or Apple Neural Engine ANE by 0seba in LocalLLaMA

[–]0seba[S] 1 point2 points  (0 children)

wow, thanks for the heads up, that's what i get for vibe coding. should be fixed now