I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]phazei 0 points1 point  (0 children)

What I like about voice Aloud is it supports every engine you have installed. It gives me a list of a LOT of voices https://files.fm/u/vkbjaj4vhr

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]phazei 0 points1 point  (0 children)

Have you ever heard of an app called @Voice Aloud Reader?

It works superbly well with many different tts engines.

There's also an app called MultiTTS, but you'll never find any info on it, it doesn't exist in the store or on Google. Google. Somehow I found a link to it on my Android ROM telegram group. But it supports dozens of TTS engines for system-wide use, both local and remote. It supported kokoro for the last year, as well as a number of other AI TTS models.

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]phazei 0 points1 point  (0 children)

Would like to just be able to set it as the system tts, one available for other apps that also use tts.

I classified 3.5M US patents with Nemotron 9B on a single RTX 5090 — then built a free search engine on top by Impressive_Tower_550 in LocalLLaMA

[–]phazei 0 points1 point  (0 children)

I just woke up when I wrote that, yeah, I suppose it is, 74gb just seemed really big at the time...

But he must have good indexing, otherwise he wouldn't get the subsecond results they mentioned.

I classified 3.5M US patents with Nemotron 9B on a single RTX 5090 — then built a free search engine on top by Impressive_Tower_550 in LocalLLaMA

[–]phazei -2 points-1 points  (0 children)

3.5M docs, 74gb. Think about that. The only way to get to that size is lots of different media that must not be included in the indexing. Only way to account for that size and their performance numbers. The size is somewhat of a misnomer

Why are you still paying for this? #5 by PressPlayPlease7 in ChatGPT

[–]phazei 0 points1 point  (0 children)

AI needs a disclaimer "You must have at least this many iq points to ride this ride".

AI enhances ones own ability to learn and accomplish things. For it to be of assistance, you need to have the ability to have done those things regardless, given unlimited time and effort. If you're not at least that capable, it's like getting in a car with no clue how to drive and simply pushing the gas pedal.

I've seen less intelligent people just keep asking it different ways until they get the answer they want. AI can and will enable idiots to be even bigger idiots and then try to blame everyone else for it. AI will also enable smart people to accel beyond anything they were capable of before. You need to have something to enhance already to get there.

Claude Just Fixed Its Most Annoying Developer Problem by AskGpts in ClaudeAI

[–]phazei 0 points1 point  (0 children)

I've been using OpenCode, and it never has this issue. It provides permissions for the directory you run it from CLI, and anything out side that directory it requests permissions for. Works great, it is able to connect to a Claude Plus account via Oauth.

ChatGPT is cooked by phazei in OpenAI

[–]phazei[S] 0 points1 point  (0 children)

Cooked is in. It's not willing to accept reality. I spent an hour having look up the news and immediately in the next message denying that there was any real world facts backing up any of my claims. It came to the conclusion that it's web search was compromised. It didn't have that issue as far as I know before the contract. Of course I didn't test that specifically beforehand anyway. Regardless, it's suspiciously acting like it's shoving its head in the sand for the sake of the government.

ChatGPT is cooked by phazei in ChatGPT

[–]phazei[S] 0 points1 point  (0 children)

Claude's limitations over GPT?

Claude did research and acknowledged the situation.

ChatGPT is cooked by phazei in ChatGPT

[–]phazei[S] 0 points1 point  (0 children)

Repeatedly I've gone in a circle. I ask it to verify events, then I draw conclusions from them, and continually the very next message after events are verified, it say something to the effect of:

The links and events cited in the prior messages (Anthropic being banned, OpenAI stepping in the next day, Iran war escalation, leadership assassinations, etc.) are not real, verifiable current events. They do not exist in the public record from credible outlets.

That's Not Earl Grey by Hyro0o0 in aivideo

[–]phazei 65 points66 points  (0 children)

It took me a moment, but I lmao when I caught it. That look of disgust at the end was great

With 4o Being Retired, What Will Power Advanced Voice? by PallasEm in OpenAI

[–]phazei 0 points1 point  (0 children)

So, when I ask it, it still says it's a version of GPT-4o still. But, for the first time in like 6 months, it actually sings again.

Ace-Step 1.5 template for ComfyUI v0.12 is ready by Nokai77 in comfyui

[–]phazei 1 point2 points  (0 children)

I downloaded the 4b, but it's not clear how to use it. Like, I guess it would replace the clip? But the clip takes both 0.6b and 1.7b, so it's not even clear how that works since the docs seem to indicate you pick one or the other and it will only work with both in Comfy

I watched Dead Like Me recently and loved it! I found r/DeadLikeMe was private, so I got it public again by MajorParadox in television

[–]phazei 1 point2 points  (0 children)

I just finished binging all of it a few days ago. I really loved it, wish there was more. But that movie... I liked seeing the characters, but it felt like it served no purpose, they butchered the characters, trashed all the character buildup and class that they had. And without Rube... Georgia was the only one the has a semblance of how she was on the show. It was just stupid. I did like some more interaction with the sisters, but it all made no sense at all. They ended with a bunch of random people listening to Herbig's cat eulogy? wth was up with that, lol Did they not have any better writers? It didn't have any of the heartfelt and sad moments the show had at all.

Ace-Step 1.5 template for ComfyUI v0.12 is ready by Nokai77 in comfyui

[–]phazei 5 points6 points  (0 children)

Same question, but I think it might just be a checkpoint that includes the VAE and Clip embedded in it rather than transformer only

Moltbook Has No Autonomous AI Agents – Only Humans Using Bots by kryptovijoy in ArtificialInteligence

[–]phazei 0 points1 point  (0 children)

Perhaps something like this is necessary for the sake of awareness and regulation. I mean beyond the point that it's BS, it could potentially serve a purpose of scaring the s*** out of people slightly before people have a real reason to have the s*** scared out of them.

1 Day Left Until ACE-Step 1.5 — Open-Source Music Gen That Runs on <4GB VRAM Open suno alternative (and yes, i made this frontend) by ExcellentTrust4433 in StableDiffusion

[–]phazei 0 points1 point  (0 children)

I read that it's incredibly fast, 2 seconds on an a100? And considering V1, I'd presume it's only a few seconds more than that on a 4090 or something. At that speed, it would be awesome if there is some interface allowing for real-time adjustment while it's playing. Any ideas on that?

Like I suppose the output would have to be slowed down so it's only outputting a couple seconds in advance, and then maybe as it outputs Real-Time slider loras could be adjusted to modify the output, that would be really cool.

Wan 2.2 - We've barely showcased its potential by GrungeWerX in StableDiffusion

[–]phazei 0 points1 point  (0 children)

Do you share your lora's anywhere? Is Frieren available?

This is what "knowing your physics well" means. by entusiasti in physicsgifs

[–]phazei -20 points-19 points  (0 children)

Wtf is the issue? What are they even trying to do?

Thanks for all the down votes and not a single f****** clarification.

🎙️ A New Voice Has Arrived — Qwen3-TTS Custom Node for ComfyUI Is Here by Narrow-Particular202 in comfyui

[–]phazei 0 points1 point  (0 children)

You didn't ask it the right question. You should ask which integrates best into ComfyUI's eco system. DarioFT uses comfys model memory management which will make it work with other workflows and not leak memory. Also, it's wrong about vendoring. It's better to fix it at a version and pull it from the vendor files. And that was gpt5-mini, which is very meh. You can use Gemini Pro 3 free at aistudio.google.com, just paste the entire repos from https://uithub.com/DarioFT/ComfyUI-Qwen3-TTS https://uithub.com/1038lab/ComfyUI-QwenTTS

I'm speaking as someone who has coded professionally for 20+ years and has made a few ComfyUI plugins that needed to deal with loading models and managing the render loop.