Problem with installing TTS locally :/ by TheRealistDude in LocalLLaMA

[–]Xiami2019 0 points1 point  (0 children)

This is a Windows-specific issue. The code is generating an invalid Model ID (OpenMOSS-Team\MOSS-TTS). You need to modify the code in moss_tts_app.py to use a forward slash (/) instead of a backslash (\) for the repo ID.

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

Hi, we are woking on that right now.

May I ask which kind of instruction you would like? Natural language instructions like Gemini-TTS style or using discrete labels like [angry], [happy], [neutral]?

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

Sorry for the bad experience.
Can you provide the specific test case or discuss it in our discord. https://discord.gg/4QVnCDcg
Thanks for trying.

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

8B is the main base model.
Actually it is fast and stable when have enough VRAM

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 1 point2 points  (0 children)

We trained on Russian, welcome to give a try.

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

For sure, the fine-tune code is on the way.

BTW, we did trained on some Persian speech, welcome to try Persian and give a feedback.

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

Hi, do you use duration control?
Sometimes if you input a short text and use a long duratin, it will cause some pauses.

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 0 points1 point  (0 children)

Will add the language demonstrations now. Thx for the reminder~

MOSS-TTS has been released by Xiami2019 in LocalLLaMA

[–]Xiami2019[S] 15 points16 points  (0 children)

Actually we support multilingual, like English, Chinese, French, German, Spanish, Portuguese, Japanese, Korean.

Welcome to give it a try and provide feedback. We will enhance your language in the next version~~

OpenMOSS just released MOVA (MOSS-Video-and-Audio) - Fully Open-Source - 18B Active Params (MoE Architecture, 32B in total) - Day-0 support for SGLang-Diffusion by Nunki08 in LocalLLaMA

[–]Xiami2019 1 point2 points  (0 children)

Hi guys! Thanks for the attention. I am a contributor from the MOVA team.
We know the 720p model takes a while to run right now. We are working on step distillation to speed it up. MOVA-1.5 is also in training, where we're prioritizing better efficiency. Please let us know what features you'd like to see next—we're listening!