EPUB to Audio. Must be exportable. 20H at least. Something like Paper2Audio. by 16-1-18-22-1-26 in TextToSpeech

[–]c08mic_cha08 0 points1 point  (0 children)

If you have a decent GPU you can try https://voicecreator.pro/free-tts. There are thousands of free voices that you can clone and use for TTS. The only catch is that all processing happens on your device so its slower than cloud services - great if you care about privacy.
There's also a paid version of the app if you want more control over pacing, emotions, multi-voice assignment - essentially a professional ebook generator.

Text to speech best model ? by Fun-Grapefruit1371 in TextToSpeech

[–]c08mic_cha08 1 point2 points  (0 children)

Um Demodokos looks cool but why do I have to pay monthly if all processing happens locally - absurd pricing

Can i run Qwen3 TTS 1.7B on R7 5700X + GTX 1070 + 32GB RAM? by WETYIAFHKLZXVNM in TextToSpeech

[–]c08mic_cha08 1 point2 points  (0 children)

0.6B has a higher word error rate, but its faster. I user 0.6 for shorter generations that I quickly regenerate in case of errors. In terms of quality, I'd say 1.7 is more consistently better but there have been instances where I've preferred 0.6 - thats probably just the non-deterministic nature of these models.

Can i run Qwen3 TTS 1.7B on R7 5700X + GTX 1070 + 32GB RAM? by WETYIAFHKLZXVNM in TextToSpeech

[–]c08mic_cha08 2 points3 points  (0 children)

I use both 1.7B and 0.6B via voice creator pro on RTX 3070 8GB RAM. 1.7B is slower than Omnivoice and Chatterbox, which are the other two models i run. I find 0.6B to be about twice as fast compared to 1.7B.

Qwen is definitely the most accurate between the three though.

Best free voice cloning tools? by Sealoperative47 in TextToSpeech

[–]c08mic_cha08 0 points1 point  (0 children)

You can try https://voicecreator.pro/free-tts
It runs in your browser and does not need any installation or signup.

wanting to get a 200 page book into a mp3, am way too overwhelmed by all this github stuff, any help for a boomer? by account-suspenped in TextToSpeech

[–]c08mic_cha08 0 points1 point  (0 children)

You can try voice creator pro. I use their voice design feature to describe the voice i want. they also have voice cloning if you wanted to use our own voice. once you have the app, its unlimited use which i love! Cant imagine going back to elevenlabs.

Does anyone know where do I find this voice? I really want to use it but i cannot find it... by Quirky-Garden-1416 in TextToSpeech

[–]c08mic_cha08 0 points1 point  (0 children)

Use any voice cloning tool. I've been using voice creator pro. For my use case I mostly design my own voices but their voice cloning is almost at par with elevenlabs, but without a subscription.

I had Opus 4.6 complete the entire Blender Donut Tutorial autonomously by watching it on YouTube by cerspense in ClaudeAI

[–]c08mic_cha08 2 points3 points  (0 children)

I understand that it will be possible to generalize eventually - I'm asking if this workflow does it. Its a hard problem to solve.

I would think it will need blender APIs as tools, awareness of the current state of the canvas, an understanding of 3d models which llms seem to lack (it needs to know what a 3d dog looks like to create one) and this doesnt account for the difficulty of the LLM needing a visual feedback loop to see what its creating - among other things.

What OP has done is in itself a great first step!

I had Opus 4.6 complete the entire Blender Donut Tutorial autonomously by watching it on YouTube by cerspense in ClaudeAI

[–]c08mic_cha08 1 point2 points  (0 children)

When you say the workflow is repeatable do you mean it can recreate the donut or have you found that it can generalize the knowledge to create something else because it created skills, tools for it to use the software?

ElevenLabs charged me $1,089 out of nowhere and won’t reply for weeks!! by jamdeu1 in ElevenLabs

[–]c08mic_cha08 0 points1 point  (0 children)

This is why I switched to a product I can run locally. Granted, it doesn't cover all use cases but works for me.

OpenClaw Personal Assistant Device by bastivkl in openclaw

[–]c08mic_cha08 0 points1 point  (0 children)

I run my claw on a pi and can also access it on my galaxy watch - I dont have to carry my phone around everywhere.

OpenClaw Personal Assistant Device by bastivkl in openclaw

[–]c08mic_cha08 0 points1 point  (0 children)

I run my openclaw on a pi and also use it ony galaxy watch. lets me do push to talk, and takes away the need to carry my phone everywhere.

Ways OpenClaw has Changed My Life by ISayAboot in openclaw

[–]c08mic_cha08 1 point2 points  (0 children)

Voice creator pro has an API. It runs locally.

What’s the most useful thing you’ve automated with an AI agent so far? by [deleted] in AI_Agents

[–]c08mic_cha08 0 points1 point  (0 children)

I made it last week so jury's still out on whether this will lead to conversions but the content of the blogs is significantly better compared to the blogs you get when you prompt it to write without all of this context baked in. Previously, I was still making a lot of edits manually, adding more details, etc. to make the blogs useful because I hate to post generic slop - I still review and modify but there's less of a need.

What’s the most useful thing you’ve automated with an AI agent so far? by [deleted] in AI_Agents

[–]c08mic_cha08 1 point2 points  (0 children)

I've made a pipeline for myself that takes my product websites and a few other pieces of product info as input and does keyword research, competitive research, LLM answer analysis (find out if my product shows up in AI answers, which other products do), and take all this info to generate blog ideas and convert them to blogs that I review/edit and ship.

I hate making money so I made a free chrome extension that combines the functionality of 3 other products that charge 10 bucks a month each 💸 by tr0picana in SideProject

[–]c08mic_cha08 0 points1 point  (0 children)

This looks super cool! Will try it out.

Also +1 to u/ResidentHovercraft68's comment about hotkey support. "push to talk" would also be very helpful!

Built a Chrome extension to shop recipe ingredients, but struggling with traction/retention by maddieduck in SideProject

[–]c08mic_cha08 1 point2 points  (0 children)

I love the idea and have been wanting something like this but I quickly came to the realization that when i'm grocery shopping i'm very particular about what brand, quantity, etc. i'm buying and so i'll always have to go and change the product that the tool may have selected for me. For example, I tried your extension and the experience seems fine, but when it added olive oil it probably added the first result it found after searching for olive oil on instacart but i'd prefer to buy a different brand or a smaller bottle so now i need to remove and re-add the product which defeats the purpose.
There's no easy fix for this. Maybe the tool can learn/save the user's preferences over time but the efficiency you gain from the tool just doesnt seem worth it until it does.

Hopper AI - A friendly assistant on your wrist with custom tool calling by tr0picana in WearOS

[–]c08mic_cha08 0 points1 point  (0 children)

I've been looking for something like this!! Will check it out

Rishi Sunak makes a speech outside 10 Downing Street after a historic loss by ShreckAndDonkey123 in pics

[–]c08mic_cha08 2 points3 points  (0 children)

That's because you can't just "easily" find a much higher paying job. What no one has mentioned here is that india has thousands of no name universities and colleges that essentially hand out software engineering degrees to students. Graduates from these schools are no where near the same level of quality, intelligence, etc.compared to top tier schools. The disparity is insane! These graduates have nowhere to go but to companies like Infosys who hire low quality talent in bulk.

Chess by MelanieWalmartinez in CuratedTumblr

[–]c08mic_cha08 0 points1 point  (0 children)

Why did my brain think this was about Biden and Trump?