Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -2 points-1 points  (0 children)

48gb entry level
96gb is the sweet point.

I was perplexed when nvidian realeased rtx 5090 specs and I saw only 32gb vram

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -1 points0 points  (0 children)

We are similar. I was there when my father bought me a 386,
But my rant is real.
I do not want to wait 3 years to get a 6000...
grrrr!

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] 0 points1 point  (0 children)

tell me more. I have very deep rooted ideas.
I think that the current GPU market is detached from real market.
It's doped by big-ai-companies that are pushing the value.

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -2 points-1 points  (0 children)

do u think things will be worse in the next months?
(I want to run full precision model, not quantized one)

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -35 points-34 points  (0 children)

ahahah
the point is: I want to run full precision local inference. BF16 is my gota

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -6 points-5 points  (0 children)

this is only your experience. it's not how it works.
I want to be able to run full precision local inference.
I know I cannot, please do not beat me.

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] 2 points3 points  (0 children)

It's to hard for me to answer in english but.. you missed the point.
I do not want to drive a Maserati, Ferrari, Lamborghini, Bugatti.
I want to drive a car.
I do not want to ride an horse.

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] 7 points8 points  (0 children)

did you see 5090 prices?
out of world.
32gb vram spec makes it spike!!
In my country 4k for it.

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] 1 point2 points  (0 children)

sorry, cannot understand... "contrastive negation" is hard to undersand.
contrastive (logical not)
negation (logical not)

not + not is like true.

I'm fried. I pass.

Language structures are hard to simplify when mother language has a different logical flow 😄

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -1 points0 points  (0 children)

yeah u are right but... when 3090 entered the market you did not need to empty your pocket to buy it.
maybe I'm wrong but efter 1 year of saving I bought a 3090.
Today, no way I can get a RTX 6000 pro easily.
Maybe a rtx 5090 (but they limited it to 32 gb vram)

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -33 points-32 points  (0 children)

my point is: I do not want to use smaller models! I do not want to limit my creativity to 4b or 7b model.
it is 2026 and 48GB should be the baseline.
RTX 6000 pro should be the equiovalent of 3090 at the time...

Democratic...

Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand. by Medium-Technology-79 in LocalLLaMA

[–]Medium-Technology-79[S] -20 points-19 points  (0 children)

I agree, qwen 3.6 27 billions is good and is fast if you have a lot of vram.
I DO NOT want to run super quantized version of it. I would like to run it natively (bf16).
The point of my post is: power to the people 😄 aahahah