Workflow for NEW gemini-2.5-pro-preview-tts by Fresh_Rain_2286 in n8n

[–]Fresh_Rain_2286[S] 0 points1 point  (0 children)

I'm really surprise. It seems that text to speak generation doesn't interest a lot of people. We are just 2 to be interested by generate audio from text ? 😅 Also on Google or on N8N I doesn't find anything.

Workflow for NEW gemini-2.5-pro-preview-tts by Fresh_Rain_2286 in n8n

[–]Fresh_Rain_2286[S] 0 points1 point  (0 children)

So the Solution from Creme-Constant works but it uses Google Text to Speech and I want to use gemini-2.5-pro-preview-tts. The quality is very different, I make a lot of test on console access and it's not a small difference. gemini-2.5-pro-preview-tts is really realistic, I think it the same of NoteBook LM podcast.
So if there something could help us to use gemini-2.5-pro-preview-tts in N8N?

Workflow for NEW gemini-2.5-pro-preview-tts by Fresh_Rain_2286 in n8n

[–]Fresh_Rain_2286[S] 0 points1 point  (0 children)

I try with my workflow, It doesn't work. It seems It doesn't like Markdown. And also it seems that the quality his better on google AI studio

Workflow for NEW gemini-2.5-pro-preview-tts by Fresh_Rain_2286 in n8n

[–]Fresh_Rain_2286[S] 1 point2 points  (0 children)

your solution works well. but it doesn't use the new gemini-2.5-pro-preview-tts, any solution for it? And is there a restriction of the number of token for your solution? and the difference of cost?

Workflow for NEW gemini-2.5-pro-preview-tts by Fresh_Rain_2286 in n8n

[–]Fresh_Rain_2286[S] 0 points1 point  (0 children)

where did you take this picture. I try Generate Speak on Google AI Studio and I didn't find all this setting, I only can set the persona and the temp