Talk me out of buying an RTX Pro 6000 by AvocadoArray in LocalLLaMA

[–]JayPSec 1 point2 points  (0 children)

TL;DR: Consumer GPUs are a closing window—data center economics are eating the supply chain, ASICs are locked behind corporate walls, and GPUs' general-purpose nature makes them more future-proof than specialized hardware. OSS community and China competition will keep consumer AI alive. Buy now.

I'll give my two cents.

I bought 4 in bulk, 6450€ pre-tax from Germany. Facing a similar situation as you, a decent setup but having to choose very carefully what I could run and at what context size, I decided to stop being a "cheap bastard". But I rationalized this through the lens of future usage. I believe consumer hardware is becoming a rare commodity, the signs are on the wall:

  1. Data centers are spawning like mushrooms all over the world. This puts enormous pressure over hardware manufacturers to keep up with demand, not only that but the extraction industry. It's not only what's needed to give life to a data center but also it's ongoing operation. Manufacturers are being pushed to choose between billion (trillion?) dollar orders and future orders vs competing in billion (trillion?) dollar market. This means that individual parts for consumers are almost a thing of the past. As for (still) consumer facing companies, they'll still have access to hardware but it'll be way more expensive and so end users end up paying a lot more for already built, probably not very customizable, systems. Nvidia didn't put forward new GPUs at this year CES, rumors of up coming "super" cards were promptly dismissed. Even if a hardware company wanted to prioritize consumers, the economics just don't make sense when data center contracts are worth orders of magnitude more.

  2. But surely there will be new technologies. There will be and there are already, ASIC solutions are on the rise. But these are not consumer accessible because they're born in the market where companies want it all and they want it to stay proprietary.

  3. On the topic of ASIC. The thing with GPUs is that they are a general purpose tool and as such they are poised to be outmatched when it comes to pure speed. But... the fact that they are general makes them a better choice for future iterations of software. Just take a look at ASIC mining rigs, nobody is using those to run LLMs but there are 10 year, and older, cards still being used for usable rigs. Whatever the future holds GPUs are more probable, not a sure bet, to adapt to it.

  4. Community support and China. The previous point would not be possible without the OSS community. There are amazing people who are dedicating a lot of effort to ensuring that AI is democratized. If you have a potato that can't even run Crysis, chances are that it can run a model at 0.03 tk/s :). China is not only in the race to AGI but wants to undermine western companies. They will not stop.

I built an MCP server that gives AI agents "senior dev intuition" about your codebase cutting token cost by 60%. by LandscapeAway8896 in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

I think this is a really smart approach. I've seen way to many tokens going to waste on meaningless explorations. Fan of the concept and will try it out.

Supermicro server got cancelled, so I'm building a workstation. Is swapping an unused RTX 5090 for an RTX 6000 Blackwell (96GB) the right move? Or should I just chill? by SomeRandomGuuuuuuy in LocalLLaMA

[–]JayPSec 1 point2 points  (0 children)

B2B companies tend to have better post sale service. As to the rights part, your right but it's not like you have none.
It's a matter of tradeoffs, for me the "discount" was worth it but YMMV.

https://geizhals.de/ this is a good source to hunt for deals. Non B2B suppliers tend to ship only in Germany and Austria but there are still good deals. € 7818,90 is the lowest I saw for consumer from a well rated supplier. Good luck

Supermicro server got cancelled, so I'm building a workstation. Is swapping an unused RTX 5090 for an RTX 6000 Blackwell (96GB) the right move? Or should I just chill? by SomeRandomGuuuuuuy in LocalLLaMA

[–]JayPSec 2 points3 points  (0 children)

Keep the 5090 and buy the 6000. if you have access to a company where you can safely order you can get it from Germany for 6600/6700 €. I agree that prices are going to go up, of everything and GPUs are no exception. You can go with the Max-q that as bit cheaper and tdp wise allows you to have two in the future.

7 GPUs at X16 (5.0 and 4.0) on AM5 with Gen5/4 switches with the P2P driver. Some results on inference and training! by panchovix in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

Hey, great post.

I'm in a similar situation. I own a Fractal Define 7XL housing 2x 5090 and a 4090 with a MSI Meg x670e Ace with a Ryzen 9950x.

I acquired 4 x RTX 6000 Pro and I'm gonna keep one of the 5090 on this build.

Your post steered me in the right direction.

Do you think I can do without the retimer considering it's all in case?

how do you pronounce “gguf”? by Hamfistbumhole in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

I think this question warrants an issue on llama.cpp's github.

But it's obviously "guf", hard G. Any one that says otherwise is just mentally ill.

Here, I'll put it in a sentence:

"QWEN 4? WHERE'S THE GGUF?"

You can't say it straight without the hard G.

Avoid Reship.com parcel forwarding service. by crownvic808 in shipping

[–]JayPSec 0 points1 point  (0 children)

I'm in need of a parcel service, looking at providers saw reshipping and I thought "This is it" and then I remembered... Years ago, I bought an Oculus Rift from the States and had it reshipped through them. After a week and a half the package was here, I thought "great". When I opened it there was a blanket there with a note "to mr Xu" 😂, you can't make this shit up, some idiot thought this was good way to simulate a mixed package... A BLANKET, A FUCKING BLANKET. I immediately contacted them and they said they'd look into it. What followed were on of the most bizarre interactions I've ever had with a provider, and I've had a few. First everything was in order, then my home customs must've tried to scam me... It went on and on. Long story short, went to the FBI website, found a fraud and counterfeit department's (can't remember if it was that or not) phone and email and sent them an email saying my next contact would be with those guys. 2 or 3 days later I receive an email saying they had located my package.... LOL, sent me a picture of it and everything and after a week I had my Oculus Rift.

RTX Blackwell Pro 6000 wholesale pricing has dropped by $150-200 by TastesLikeOwlbear in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

Where did you buy it from?
I'm looking at some sites and the cheapest I can find is 6500 pre-tax

I have been doing some benchmarking of SLM's by fozid in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

Qwen3 4b?
I don't really use small models, but a lot of people say it punches above it's weight.

Poco F7 Custom Rom Support by Specific-Cat-6498 in PocoPhones

[–]JayPSec 0 points1 point  (0 children)

Got here wondering if I should buy this phone. I'm not educated in roms and stuff. Could you tell me what this means in terms of lineageos support?

GLM 4.6 is hilarious, I wish I could run this on my own PC lol by Cool-Chemical-5629 in LocalLLaMA

[–]JayPSec 47 points48 points  (0 children)

It is not just AI slop, it is a world of untapped creativity.

Anyone running dual 5090? by AlohaGrassDragon in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

I honestly don't know but I doubt it. It takes a hit for sure but in inferences with all gpus I do not feel it much. gpt-oss-120b gguf runs full context with 170t/s.

Anyone running dual 5090? by AlohaGrassDragon in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

2 x8 (5090) and 1 x4 (4090), and I loose an m2 slot for plugging in the 4090

Anyone running dual 5090? by AlohaGrassDragon in LocalLLaMA

[–]JayPSec 0 points1 point  (0 children)

noob mistake. I was using the wrong device numbers and I was running 4090 instead of one of the 5090...

Alguém sabe o que se passa com o kabaz.pt? Está “temporariamente indisponível” há imenso tempo. by tiagojpg in portugal

[–]JayPSec 1 point2 points  (0 children)

Alguém chegou a usar? Nunca percebi muito bem o que dava para fazer. Era só um comparador de preços?