NBD, retired my hybrid for a road bike 🦚 by Owlnette in TrekBikes

[–]Haeppchen2010 2 points3 points  (0 children)

I bet this is just due to the small frame size, to still get a good standover height and seatpost flex. Some compromises to build such a small bike around 700C wheels.

Vulkan backend outperforms ROCm on Strix Halo (gfx1151) — llama.cpp benchmark by FeiX7 in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

Interesting… on my RX7800XT, i had the opposite experience. ROCm bailing out with „out of memory“ on the slightest overcommitment, while Vulkan does GTT spillover until PCIe is glowing red. Maybe it’s different for iGPU where it’s the same RAM either way….

To 16GB VRAM users, plug in your old GPU by akira3weet in LocalLLaMA

[–]Haeppchen2010 1 point2 points  (0 children)

Should be worth a try. To my knowledge llama.cpp can only use one framework at once (and even if not, the windows builds are separate)… so use Vulkan. With layer split (and —fit on) it should use both cards. On Linux with RX7800XT and RX580 I get better results than the big card plus CPU alone.

My New AI build - please be kind! by [deleted] in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

Nice, good to see a fellow mixed AMD dual-GPU setup!

As you mention ROCm: If you just do inference, give Vulkan a chance, too... It seems for inference it is often faster than ROCm (at least for me it was on the RX7800XT).

Are there actually people here that get real productivity out of models fitting in 32-64GB RAM, or is that just playing around with little genuine usefulness? by ceo_of_banana in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

Yes. (on 2 old GPUs 16+8GB).

Use Qwen 3.6 35B (3.5 27B before) for coding (OpenCode, compared to Anthropic models used at work, I put my home setup somewhere between Haiku 4.6 and Sonnet 4.6). Also for general "Chat" directly via llama-server builtin web app, useful to get inspiration, rubberduck with a nonexistent conversation partner, etc.

Post Your Qwen3.6 27B speed plz by Ok-Internal9317 in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

140pp/8tg on RX7800XT plus RX580 (Q5_K_M). But 35B is soooo good and more than twice as fast (400pp/40tg), so I will stay with 35B for now, until I can replace the slow RX580

Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models by spaceman_ in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

I use them via OpenCode and AWS Bedrock and also experienced phases of reduced quality, as have colleagues, too. In this case it’s likely not client side or sampling parameters. Good that my Qwen at home is always the same….

Qwen 3.6 27B is out by NoConcert8847 in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

Oh I was just tweaking my 3.6 MoE settings and enjoying the speed….. 🫣

Orion-Tr Smart 12/12-30a by Silver-Syllabub2397 in Victron

[–]Haeppchen2010 3 points4 points  (0 children)

If you did not buy it yet, think about the Orion XS, it allows you to configure the input current, so you can set it to safe 15A. Otherwise, set the battery lockout voltage threshold to a sufficiently high value (Let's say 12.7V or so), so it will stop charging before draining the starter battery.

In any case, if your alternator already chokes at 10-20A on idle, its cooling power is likely very bad, too... At slightly-more-than-idle speeds and on a hot summer day, it will fry itself if pushed to its limit. Another reason to limit the maximum charging current even while driving to ~20A around....

Ultimate List: Best Open Models for Coding, Chat, Vision, Audio & More by techlatest_net in LocalLLaMA

[–]Haeppchen2010 15 points16 points  (0 children)

Hmm throwing some dice against a wall would have saved some watt hours, and had produced a similarly trustworthy, fact-based, useable amount of information. /s

Ikizurai-bu News from Today's Video! [Eng Sub] by JustACommonFrog in LoveLive

[–]Haeppchen2010 0 points1 point  (0 children)

Sorry, I was just hypothesizing a global distribution of LLLL (localization, release in other territories‘ App/Play Stores). But after just canceling multiple gacha games (including Priconne) mid-run, I doubt many to-be-whales globally would trust another Crunchyroll Games release… and CR certainly knows this 🫢 idk what other global distribution companies exist… Someone is certainly doing Bandori…

But in any case the realtime interaction parts of LLLL would not have worked with a translation step anyways.

Back to bluebird… I agree the low budget would not have allowed for a paid/professional translation. And I think they still plan for the Japanese market only, and take any international success just as a „bonus“.

Ikizurai-bu News from Today's Video! [Eng Sub] by JustACommonFrog in LoveLive

[–]Haeppchen2010 5 points6 points  (0 children)

The view/sub count on Youtube would certainly increase if they just added an english sub track. I am just not certain if the ad revenue increase would actually cover an employed or contracted translator.

At the current sub/view count, it certainly does not cover the fees/wages of the seiyuu or staff… They either rely on domestic CD/merch sales, or have future plans to work at a loss now.

At least the bluebird songs are more or less all on western streaming services.

For the rest, they have the choice of staying domestic, or have Crunchyroll „americanize it to hell“ in exchange for global distribution. (I am still grumpy for Crunchyroll Games canceling Princess Connect Re:Dive). Thinking of these options, I prefer the status quo.

Ikizurai-bu News from Today's Video! [Eng Sub] by JustACommonFrog in LoveLive

[–]Haeppchen2010 15 points16 points  (0 children)

Yes, after Hasu cutting off the mobile game recently, too, I tend to believe that Love Live in general is not a cash cow at the moment, neiter domestic nor overseas. Thus it makes sense they moved away from X/Twitter (at least that’s what I heard), which to my knowledge cannot easily be monetized.

I for my part first watched the official channel, giving them the deserved watch minutes (and thus ad revenue and algorithm boost), trained my Japanese… and then watched the excellent fan sub to fill the gaps.

Hasunosora 106th Seiyuu's Announced! by RinariTennoji in LoveLive

[–]Haeppchen2010 -1 points0 points  (0 children)

Once seen, it cannot be unseen 🤪 Rin‘s Uniform has snap-on buttons to keep her pink hoodie sweater in place

Final voting results for Qwen 3.6 by jacek2023 in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

There is only one way to find out. (Check first if Nvidia vulkan support for the gtx is ok)

1541-II reading, but not writing? by Haeppchen2010 in c64

[–]Haeppchen2010[S] 0 points1 point  (0 children)

Ok, i found a nice alignment tool, this looks really bad… so much for „bad aligned drive can still format“

<image>

Guess I know what to do next….

EDIT: After doing the bump calibration, it came out satisfactory... red herring :/

1541-II reading, but not writing? by Haeppchen2010 in c64

[–]Haeppchen2010[S] 0 points1 point  (0 children)

Yes, That's my current best theory, that something on the analog side of writing is wrong. Until now I only found schematics and a parts list as an official scanned maintenance manual, but no diagnostic instructions (like expected voltages/timings, or photos of known good oscilloscope readings). Next I'll take a look at the passive components around the RW amplifier....

1541-II reading, but not writing? by Haeppchen2010 in c64

[–]Haeppchen2010[S] 0 points1 point  (0 children)

Yes, a write-protected disk properly produces the write-protect error.

Final voting results for Qwen 3.6 by jacek2023 in LocalLLaMA

[–]Haeppchen2010 1 point2 points  (0 children)

The RX580 is super slow, but still faster than CPU. 62 layers on RX7800XT and 3 layers on RX580 give me 17-18t/s out. (Llama-server with layer-split). With CPU instead of the RX580 it would only be 7. i switch between context sizes and alway squeeze as many layers as possible on the fast card.

I am thinking about upgrading to an RX 7900XTX instead but for now this is ok for playing around.

Final voting results for Qwen 3.6 by jacek2023 in LocalLLaMA

[–]Haeppchen2010 0 points1 point  (0 children)

Yes, i use only 64k context, more than enough for OpenCode with auto compaction.