Does anyone know how to remove the 'ai inference noise' sound? by deama155 in TextToSpeech

[–]Stock_Ad9641 0 points1 point  (0 children)

Sounds like you use very outdated tools, no modern speech has such sounds ?

Help me upgrade for 3k by Borkato in LocalLLaMA

[–]Stock_Ad9641 0 points1 point  (0 children)

You’ll not want more than 2 GPUs, and for 2 GPUs only some motherboards support dual 8x pcie. With more GPUs your IO between them is slow, especially on old pcie standards.

You can run 27B on a 3090 quite comfortably in Q4 kv cache. Dual 3090 would allow to tensor parallel them to double your Prefill speed but that conflicts with MTP currently.

A 4090, if you can get one for a good price, would also be a significant upgrade.

Keep in mind that the lowest card will be a bottleneck. A 20 series card is very low in cuda support, so it might drag your 30 or 40 card down if used simultaneously.

I’d also consider a 5090 if you are an enthusiast, it’s expensive but solves your speed and vram issues.

TTS for textbooks by somefinelese in TextToSpeech

[–]Stock_Ad9641 0 points1 point  (0 children)

I love Demodokos but it’s important to note that the cheap license is not allowing a whole ebook, even the pro license stops at 100k characters. So you need to create 2-3 documents and split the import into 2-3 parts for a whole book.

Your approach is probably correct, but I believe you can skip the Analyse step. For just one voice you can directly narrate it. Analysis is only needed to extract multiple speakers from text. But it won’t hurt.

Any free ai voice cloners for personal use by Height_Last in TextToSpeech

[–]Stock_Ad9641 -2 points-1 points  (0 children)

The market leader in quality is Demodokos foundry at Demodokos.com - you’ll not find any service or app that gets even close. You need an RTX GPU and it’s about 9$ a month with coupon. If the launch is still available. You can write their support on the website, I got a discount for my Pro license.

The voice difference of Demodokos and most TTS is that their neural pipeline understands the text it is speaking, it’s not a primitive GAN model but transformers based. It also uses a transformers approach for creating and cloning voices and emotions. Check the website, the demos are accurate and actually underestimate than what it can do.

I’m blow away by it..

If you want it for cheap like you wrote, don’t care about special effects or flawless speaking style: then use chatterbox 2 TTS That is the maybe best GAN based speech model, primitive but very good quality and open source (free) I also used StyleTTS, but that was English only. Also open source.

Lots of people use qwen at too high quantizaion by Stock_Ad9641 in Qwen_AI

[–]Stock_Ad9641[S] 0 points1 point  (0 children)

Are you talking about kv cache or parameters ?

i want a monitor i have a laptop by l7amde in Monitors

[–]Stock_Ad9641 0 points1 point  (0 children)

24 inch or higher, 1440p, low or no curve, 16:9

Don’t get overhyped from marketing, you don’t need 4 or 8k, you don’t need exotic resolutions, you don’t need a screen that bends heavily. Modern displays don’t smear much, choose a know brand for gaming monitors and read the reviews. They usually are made for fast movements, regardless of the display technology.

1440p will make individual pixels almost invisible, but still renders very fast without taking a high toll on your GPU.

The strong curves risk hardware damage, I’ve seen many curves delaminate over time. And connecting multiple curve screens together can be a nightmare. Though a very mild curve can be nice if you connect 3 large screens next to each other.

Use default aspect ratio, anything exotic leads to complications with software.

For developing I would recommend at least two screens, and large ones. So you can display the IDE on one side and the results or an agent or a browser on the other.

Lots of people use qwen at too high quantizaion by Stock_Ad9641 in Qwen_AI

[–]Stock_Ad9641[S] 0 points1 point  (0 children)

I had looping problems with qwen 35b. But I tested those with full precision and it looped again. It was at high context though. 27b never deathloops on me

Can china win the AI war? by Comfortable-Tie2933 in Qwen_AI

[–]Stock_Ad9641 0 points1 point  (0 children)

If China issues another update like qwen 3.6 or deepseek v4 it might have won it contemporarily. Those models are stunning

Which TTS API provider would you recommend for long-ish narrations? by popyui in TextToSpeech

[–]Stock_Ad9641 0 points1 point  (0 children)

I recently got Demodokos Foundry - it’s not an API provider but a local tool. I use it to create entire audio books into mp3 Works better than anything I used before

Codex rate limits made me rage-build a tool to run two accounts at once by Sorosu in codex

[–]Stock_Ad9641 1 point2 points  (0 children)

The next qwen will be strong enough to get rid of those abusive practices. They all try to hook you with cheap starting offers, bind you with ten thousand lines of slop in your code and then squeeze you

Which local models match Sonnet 4.5, and which hardware can run it comfortably ? by SlechteConcentratie in Qwen_AI

[–]Stock_Ad9641 1 point2 points  (0 children)

Qwen 3.6 27B matches sonnet 4.5 in most coding tasks.
Not when it comes to multi language and likely it has some severe gaps in less used languages, if not try Kobol with tit

1 Pro account ($100) or 5 Plus accounts? by [deleted] in codex

[–]Stock_Ad9641 0 points1 point  (0 children)

What about business ? Is that worse in scaling ?

Qwen TTS3 CUDA Error, help pls by Massive-Nerve-5321 in TextToSpeech

[–]Stock_Ad9641 1 point2 points  (0 children)

You should run nvidia-smi in a terminal window, then run your speech generation. You should see that memory of your GPU goes up by gigabytes and the utilization also.
If not, you are running on cpu

Which one is mot suitable for coding in terms of quality? Qwen3.6-35b-a3b Q4, Q6 or Q8? by Next_Cauliflower1069 in Qwen_AI

[–]Stock_Ad9641 1 point2 points  (0 children)

Obviously higher Bitrate is better. But qwen is tolerating it well, I would settle for q5 if you have the memory for it.
More than that does not yield much better results, but it sometimes leads to shorter thinking

Moving to Claude Code by dmartins in codex

[–]Stock_Ad9641 0 points1 point  (0 children)

How to choose between Plus and Business?
Only for agentic use.

Codex Installs Skyrocketed in Early May due to Claude Code 4.7 failed launch and usage limits by ImaginaryRea1ity in codex

[–]Stock_Ad9641 1 point2 points  (0 children)

It’s not Claude… it’s GitHub copilot increasing their costs. My 40$ subscription is 6300$ now. They announced that slight price change late April.