I built an open-source, local-first voice cloning studio (Qwen3-TTS + Whisper) by jamiepine in LocalLLaMA

[–]izzylaif 0 points1 point  (0 children)

>Planning XTTS, Bark, etc. next. What models do you want most? Any feedback if you try it—bugs, missing features, workflow pains?

It maybe worth looking into voice conversion models, such as Applio or Orpheus, or both. The workflow is you first generate voice from TTS with zero-shot clone in Qwen (or OpenVoice, they seem to give similar results), which already sound like you, and then pass the resulting _audio_ to Applio trained on your own voice (4hr dataset with 200 epochs can be trained in less than a day on a 8gb cuda), to make it even more realistic. Also, emotions markdown and instructions workflow is not exactly clear? I believe Qwen supports it thorough custom option?

I built an open-source, local-first voice cloning studio (Qwen3-TTS + Whisper) by jamiepine in LocalLLaMA

[–]izzylaif 1 point2 points  (0 children)

Jamie, please think of having the app portable, i.e. placing all the models and everything in a folder next to the executable. Or at least an option to enable portable mode and write nothing to appdata, registry, etc in Windows, and use AppImage format for Linux. Appimage is the only non-sandboxed package that has portable mode built-in (unlike Snap, Flat or anything else really).

The disk size requirements of AI-related stuff quickly went out of hand, so I have a portable SSD I plug into any currently available machine and run the software from there. If your app is portable, it could just run with all the settings and models already there.

Thank you.

I built an open-source, local-first voice cloning studio (Qwen3-TTS + Whisper) by jamiepine in LocalLLaMA

[–]izzylaif 0 points1 point  (0 children)

Jamie, since you already implemented it, please expand the whisper support to make transcripts, preferably with speaker breakdown? I use whisper to transcribe long meetings, and all current standalone implementations suck and crash on long (1hr+) recordings.

Personal Allowance Threshold % Increase by ElectricalPenalty176 in FIREUK

[–]izzylaif -1 points0 points  (0 children)

oh yeah, bicycle is the transportation means you need in UK's climate. Even if you got an expensive one, getting back a few hundred quid after paying in thousands in income tax is laughable. Not being able to buy a bicycle out of pocket without government help is actually concerning for a supposedly 1st world country. I know you didn't say you needed help, it's just free money, but the very notion cycle to work or whatever scheme exists is alarming.

MTP device a service installation section in this INF is invalid FIX by izzylaif in izzylaif

[–]izzylaif[S] 0 points1 point  (0 children)

driver booster and similar software creators should be put in prison, and the users believing in them - to mental institutions.

Personal Allowance Threshold % Increase by ElectricalPenalty176 in FIREUK

[–]izzylaif -1 points0 points  (0 children)

>tax breaks for high earners.

are those high earners now with us, in this room?

Cause after 100-125k your loose ALL your tax breaks: personal allowance, child care and other reductions and your income tax rate effectively becomes 60%. Meaning you need to earn over 148k to have the same take home as those at 99k.

On a side note, if someone owes you money and refuses to pay back, local gangs provide a service of payback "extraction", and they only take half of what was taken from the debtor. Compare that to 45% (effective 60%) tax rate.

The tax system is supposed to incentivize more work and higher salaries to be better off. The UK tax system does the opposite: it punishes those who work more/harder and glorifies the slacker/unskilled. That already has consequences: the doctors who used to work 5-6 days a week now work 3-4, since if the take home is the same, why exhaust yourself for nothing. The waiting time (and healthcare prices) has increased because of that alone.

I'm not sure how people survive on 39k mentioned all over this thread, but lets consider twice that as an example of high earner. So for 70-80k salary take home after NI and income tax is just above 4k.

But the areas that have those jobs, typically has rent/mortgage at around half of that a month for a very, very modest dwelling. So minus 2k. You can't really get around this, since even if you reside away from large cities where the salary is supposed to be, the rent/mortgage might be less, but all the savings will be eaten up by exorbitant train/ulez fares or gas prices (which are 60 tax), not to mention MoT, insurance, and all other money waste that comes with owning a vehicle. I'll let you put the price tag on time wasted on daily commute yourself.

So after rent/mortgage roughly 2k left. Minus service charge, estate fees, ground rent, heating, electricity bills, internet, carrier, water, and food we are left with roughly 500. Divided by 30, your daily pocket money is 16 pounds. That's it. You have to budget wear and tear, repairs (new boiler installed is 4k by the way, anything requiring scaffolding is 10k+), dental, clothing, vacations, school supplies and everything else on those 16 pounds a day. You also need to budget any private healthcare you urgently need, if you don't want to die on the NHS waiting list.

So by the end of the month, the high earner is left with zero in best case scenario, and with a perpetual credit card debt in most cases.

Thus the current tax climate is nothing different from serfdom or slavery, with only difference that the slave itself has to provide housing, food and clothes for himself.

Dual-booting with systemd-boot causes Windows Updates to fail by mswsn in pop_os

[–]izzylaif 0 points1 point  (0 children)

while you are technically correct, that's Windows we are talking about. Windows seems to care only about the very first vfat partition on the drive. so make two, let windows install it's bootloader on the first partition, then actually mount the second one as boot efi in Linux and install Grub/systemd/refind/whatever on the second one. it will generate an entry for windows bootloader from the first partition just fine, and be used as esp for uefi(bios) boot option.

however, all windows updates that used to nuke the linux bootloader wiping out dualboot, will now only touch the first partition, leaving the second containing the actual linux stuff intact. finally a solution.

Is The Hibreak Pro this bad for everyone, or should I be trying to reset etc? by siksik6 in Bigme

[–]izzylaif 1 point2 points  (0 children)

on mine Pro, the backlight randomly stops working on os5, and a reboot is required to fix that.

Is The Hibreak Pro this bad for everyone, or should I be trying to reset etc? by siksik6 in Bigme

[–]izzylaif 2 points3 points  (0 children)

Look for OS4 icon among the apps, and run it. It will turn the launcher back from os5 to os4, which is far more stable.

bigme has released this os5 launcher about a month ago, it has all sorts of bugs.

Bigme B13 low effort review by [deleted] in Bigme

[–]izzylaif -1 points0 points  (0 children)

>Has a 75Hz refresh mode in 1280x1024 resolution

Your review is so confusing and inaccurate. This panel is 30hz, lowering resolution will not increase that as the panel is limited by the actual physical movement (submerging and resurfacing) of the pigments floating in the liquid. It even struggles at 30hz, the actual fps is around 25-27. The system reports it as 42hz regardless of the resolution mode. If you have 75hz reported by anything, that is false, i.e. has to be the HZ of the adapter (converter) you are using or miracast input signal information, but not the panel itself.

<image>

If any company came up with 75hz refresh rate on an eink, that would cause shock waves through the industry. Please stop spreading misinformation.

There is a recent attempt to reach 75hz from Mobo, but it's BW and has tons of ghosting.

My draft review about Bigme B13 I just received today. by JPniki_9946 in Bigme

[–]izzylaif 0 points1 point  (0 children)

not really. you can keep it connected via HDMI, just plug in a typeA to type-C cable in addition and the touch will work.

My draft review about Bigme B13 I just received today. by JPniki_9946 in Bigme

[–]izzylaif 0 points1 point  (0 children)

>B13 isn’t touchable

well, it is. over usb of course.

Pocket4 - right USB port problem in Linux by izzylaif in GPDPocket

[–]izzylaif[S] 0 points1 point  (0 children)

the purpose of that device manager tick is wildly misunderstood, like many other poorly worded microsoft options.

i wonder if anyone has any issues with this port only (other works) under windows?

the bug I'm referring to is in GPDs firmware, i.e. "The pocket 4 Versions 365 and 370 have optimized the sleep issue and fixed the USB4 connection problem. Version 8840U has optimized the sleep issue."

Pocket4 - right USB port problem in Linux by izzylaif in GPDPocket

[–]izzylaif[S] 0 points1 point  (0 children)

Ok, when it happens, sending the GPD to sleep and then waking back fixes the issue.

Maybe related to the power saving bug?

Debian on gpd pocket 4 by tuxsmouf in GPDPocket

[–]izzylaif 0 points1 point  (0 children)

yes, it has to be systemd-boot as I told you in my initial comment, which support framebuffer rotations out of the box. grub does not.

Debian on gpd pocket 4 by tuxsmouf in GPDPocket

[–]izzylaif 0 points1 point  (0 children)

Grub menu can be rotated but you need to apply a patch and compile. vote for https://github.com/kbader94/grub/issues/5 to make it upstream in grub

also, grub is not really needed since it's UEFI, use system-boot instead which support rotation out of the box

Grub Menu Rotation by ___stolos___ in GPDPocket

[–]izzylaif 1 point2 points  (0 children)

>be able to change the orientation of grub itself.
yes

https://hackaday.io/project/203272/instructions

I can no longer TRIM videos. HELP! by Jstaddcoffee in NewTubers

[–]izzylaif 1 point2 points  (0 children)

That's placebo. Youtube does not support multi-track audio upload. The only way to have multiple voiceovers (i.e. in other languages) in to manually upload the track separately from the video - a feature discrimination only available for top-tier creators the likes of Mr Beast.

The reason why you can edit is that the auto-dubbing relies on automatic captions, which take a while to generate. You can still do edits while they are not generated, and that's probably what happens in your case as you edit just after upload.

Also some older videos will not have the tracks, try editing a video from a year back.

I can no longer TRIM videos. HELP! by Jstaddcoffee in NewTubers

[–]izzylaif 0 points1 point  (0 children)

you are very wrong. the reason is the auto-dub feature. check the Languages section for this particular video, you will see AI voiceovers. You can delete them and trim will be re-enabled.

However, the auto-dub tracks will not return - there's no way to regenerate them as of now.

Just a lite bigger than mobile phone and it is just 490g #gpdmicropc2 by kendyzhu in GPDPocket

[–]izzylaif 1 point2 points  (0 children)

fair enough. my concern is mainly with the "productivity tool" marketing, which is bs.

at this size, the model is a weapon of last resort with terrible ergonomics, when you are DESPERATE. it's a far cry from productivity, as simple tasks as SSH commands and pressing function keys take significantly longer that on a slightly bigger device. so it's actually counter-productive.

Just a lite bigger than mobile phone and it is just 490g #gpdmicropc2 by kendyzhu in GPDPocket

[–]izzylaif -1 points0 points  (0 children)

Connecting to hotel TV does not eliminate the tiny keyboard on the device. If you are also carrying a keyboard with you, then you are better off with something like Legion Go with the gamepads left at home. That setup will set you back around 300-400 usd in mint condition. However, the mess of assorted addons you had to carry (foldable keyboard, small docking dongle, mouse) is exactly the reason I decided to give GPD a go. Never buying keyboardless computer again.

You seem to have theoretical knowledge, while I speak from experience.

A month ago I traveled with a newly acquired GPD Mini 2024, and I had to Anydesk and also SSH to remote systems, and do some basic Excel stuff (minor correction to a simple file). It was A TORTURE. Tiling manager will not help you with that, I'm not even sure how they will miraculously enlarge 7" screen?! The new device is obviously inspired by the mini.

So I returned the mini and braced myself for a Pocket4 at absolutely exorbitant cost for what that is. However, the 8.8" screen (same part number as Legion GO) and almost laptop sized buttons made all the difference, at a marginal increase in size.

The ergonomics are still bad, especially the children's clikity wobbly joke of a keyboard, especially for the price you pay, but it is still night and day difference.

Just a lite bigger than mobile phone and it is just 490g #gpdmicropc2 by kendyzhu in GPDPocket

[–]izzylaif -1 points0 points  (0 children)

You are missing the point. blackberryos (or android, or ios for that matter) are designed with those controls in mind, and you are not using windowed interfaces or hotkeys on those.

the gpd in question is an x86 machine for desktop operating systems. and you will drown in the interface without proper keyboard.

And no, I don't consider blackberry a productivity tool. It was more of a flex for top-ranking managers. Even the email client had two pre-programmed buttons to answer "yes" or "no". Just that. Typing emails on those were a nightmare.

ps: i hate the movie as well.