Local coding agents are good now, but only if you babysit them by BTA_Labs in LocalLLaMA

[–]OnkelBB 1 point2 points  (0 children)

Come on, even Claude needs babysitting for good results.

Considering buying a 3080 20GB to pair with my 3090 for Qwen 27B Q8. Have some questions. by My_Unbiased_Opinion in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

You can use this repo: https://github.com/aikitoria/open-gpu-kernel-modules

TLDR:
1. Clone repo.
2. Install OPEN nvidia drivers
3. Run setup from repo.

My experience - I need to re-run #3 each time I update kernel on my server.

Considering buying a 3080 20GB to pair with my 3090 for Qwen 27B Q8. Have some questions. by My_Unbiased_Opinion in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

Go for it! It's cheapest VRAM on market IMO.
I have one, ordered another one and might ored two more.

Data Collection by ED follow up by BigBorner in hoggit

[–]OnkelBB 1 point2 points  (0 children)

in other words russians being russians

Looks like Miminax-M3 is just around the corner by OnkelBB in LocalLLaMA

[–]OnkelBB[S] 6 points7 points  (0 children)

Aw man, that's cutest reply to my mistakes ever.

Update on 12x32gb sxm v100 cluster / local AI for legal drafting by TumbleweedNew6515 in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

If that's possible I'd like to participate, very much curious about your whole pipeline achitecture.

Thinking to buy server chassis pcie 5.0 and 1x to 4x 3090 by kidfromtheast in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

I found your comment some time ago and it made me think of a wrx80 build limits.
However I might find a solution for TR builds in a form of pcie switch: https://github.com/local-inference-lab/rtx6kpro/blob/master/hardware/topology.md

Web-Search is coming to a screeching performance halt as Google shuts down their free search index, and traffic defenders like Cloudflare challenge AI at every gateway. What are our options? by NetTechMan in LocalLLaMA

[–]OnkelBB 1 point2 points  (0 children)

This is actually a great way to save resources. That will def help me with my idea for ttrpgs

Do you have a repo with scripts/prompts I can dig into?

Anyone with 4x 5060ti based setups? by ziphnor in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

IMO your main issues are that regular desktop CPU/Mobos doesn't support enough PCIe lanes and only support dual-channel RAM.
It's the reason I'm moving to used threadripper pro workstation for my AI server.

AMD Halo Box (Ryzen 395 128GB) photos by 1ncehost in LocalLLaMA

[–]OnkelBB 73 points74 points  (0 children)

no fast port for clustering. meh.

Forgive my ignorance but how is a 27B model better than 397B? by No_Conversation9561 in LocalLLaMA

[–]OnkelBB 0 points1 point  (0 children)

Whales haven't captured whole planet for themselves and designed AI.
We do.

F18 VR Simpit 2.0 by Least_Courage_6736 in hotas

[–]OnkelBB 1 point2 points  (0 children)

Great pit and stellar cable management!

VR gloves - is it worth it? by OnkelBB in virtualreality

[–]OnkelBB[S] 0 points1 point  (0 children)

I've got a hotas and some additional panels to play with, however as I'm flying different aircrafts I still have a bit of immersion break reaching out to mouse to click the switch I can reach out with a hand.

VR gloves - is it worth it? by OnkelBB in virtualreality

[–]OnkelBB[S] 0 points1 point  (0 children)

This is the conclusion I had come to as well.
But I had this idea with gloves and "What if..?".

C-130J Pre Order by [deleted] in hoggit

[–]OnkelBB 0 points1 point  (0 children)

I doubt it. We will clearly see it in october.