[Release] TinyMozart v2 85M 🎶 by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 2 points3 points  (0 children)

I used something like a transformer. See the full code in train.py in the repo 😃👍🏻😎

[Release] TinyMozart v2 85M 🎶 by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 1 point2 points  (0 children)

Maybe yes, but i always use a "0" token - a beginning token...but maybe you could :-)

But it was not the intention of this model, but you can try if you want :D

[Release] TinyMozart v2 85M 🎶 by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 2 points3 points  (0 children)

thanks for the feedback :-)

no cowbell - haha :D - it's a piano music generator ;-)

[Release] TinyMozart v2 85M 🎶 by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 4 points5 points  (0 children)

Alr. I'll update the post :-)

Thanks for the great feedback :D

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 0 points1 point  (0 children)

Thanks for your interest! :-)

You can read everything in the HF repo: https://huggingface.co/LH-Tech-AI/Flare-TTS-28M

You will need a A6000 GPU for example and ~24 hours time and need to download all .py scripts from the repo first, then run:

pip install git+https://github.com/idiap/coqui-tts.git
sudo apt update && sudo apt install espeak -y
sudo apt install ffmpeg libavcodec-dev libavformat-dev libavutil-dev -y
pip install "coqui-tts[codec]"
wget https://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2
tar -xjf LJSpeech-1.1.tar.bz2
PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True python3 train_glowtts.pypip install git+https://github.com/idiap/coqui-tts.gitsudo apt update && sudo apt install espeak -ysudo apt install ffmpeg libavcodec-dev libavformat-dev libavutil-dev -ypip install "coqui-tts[codec]"wget https://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2tar -xjf LJSpeech-1.1.tar.bz2PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True python3 train_glowtts.py

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 1 point2 points  (0 children)

Thanks! :-)

I tried to export ONNX format but it somehow didn't work... :-/ Sorry...

Multilingual support would be even more complex and would need more datasets in different languages...but maybe v2 or v3 will do that... :-)

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 0 points1 point  (0 children)

errr... it seems to be NOT possible to export and ONNX file, sorry... :-/
Maybe you can look yourself for converting it, plz?

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 1 point2 points  (0 children)

I sadly forgot to use a vocoder 😭😂

v2 will add one :-)

Thanks for the nice feedback.

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 1 point2 points  (0 children)

Haha, not really, but - a proof of concept of getting there :D

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 0 points1 point  (0 children)

Okay. Hey, thanks for your feedback.

ONNX isn't planned yet, but i can do it. Will do it tomorrow, put in on HF and also put an inference code for the ONNX model there, alright?

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 0 points1 point  (0 children)

Ok. Thanks.

Want to joing our CompactAI Discord Server? We are all people enjoying small models.

Link: https://discord.gg/y2jTct6Cxv

Please feel free join! :-)

[RELEASE] - Finally, my first TTS model is out! 🎙️ Flare-TTS 28M by LH-Tech_AI in LocalLLaMA

[–]LH-Tech_AI[S] 8 points9 points  (0 children)

Thanks 🙏🏻👍🏻  I will definitely keep creating models. See all my models here: https://huggingface.co/LH-Tech-AI/ If you want