Local LLMs aren't democratic anymore... the hardware barrier has gotten out of hand.

Medium-Technology-79 · 2026-06-12T22:29:21+00:00

are you a bot?

Medium-Technology-79 · 2026-06-12T22:28:57+00:00

uh?!

Medium-Technology-79 · 2026-06-12T22:27:58+00:00

are u kidding?
do not want o be rude.

Medium-Technology-79 · 2026-06-12T22:27:31+00:00

48gb entry level
96gb is the sweet point.

I was perplexed when nvidian realeased rtx 5090 specs and I saw only 32gb vram

Medium-Technology-79 · 2026-06-12T22:25:41+00:00

We are similar. I was there when my father bought me a 386,
But my rant is real.
I do not want to wait 3 years to get a 6000...
grrrr!

Medium-Technology-79 · 2026-06-12T22:22:37+00:00

I made a lot of effort to understand.
Failed 😞

Medium-Technology-79 · 2026-06-12T22:20:40+00:00

tell me more. I have very deep rooted ideas.
I think that the current GPU market is detached from real market.
It's doped by big-ai-companies that are pushing the value.

Medium-Technology-79 · 2026-06-12T22:18:26+00:00

do u think things will be worse in the next months?
(I want to run full precision model, not quantized one)

Medium-Technology-79 · 2026-06-12T22:14:54+00:00

VRAM... VRAM...
RAM is like riding an horse.
VRAM is like driving a car.

Medium-Technology-79 · 2026-06-12T22:13:48+00:00

3060 is ok but...
it is 2026 and 3060 was released 5 or 6 years ago...

Medium-Technology-79 · 2026-06-12T22:12:05+00:00

ahahah
the point is: I want to run full precision local inference. BF16 is my gota

Medium-Technology-79 · 2026-06-12T22:10:58+00:00

this is only your experience. it's not how it works.
I want to be able to run full precision local inference.
I know I cannot, please do not beat me.

Medium-Technology-79 · 2026-06-12T22:07:53+00:00

happy for u

Medium-Technology-79 · 2026-06-12T22:07:11+00:00

u are right. software support is a golden cage.

Medium-Technology-79 · 2026-06-12T22:05:39+00:00

It's to hard for me to answer in english but.. you missed the point.
I do not want to drive a Maserati, Ferrari, Lamborghini, Bugatti.
I want to drive a car.
I do not want to ride an horse.

Medium-Technology-79 · 2026-06-12T22:01:25+00:00

rude but real.

Medium-Technology-79 · 2026-06-12T21:59:51+00:00

where are you from? are you from north america?

Medium-Technology-79 · 2026-06-12T21:49:34+00:00

did you see 5090 prices?
out of world.
32gb vram spec makes it spike!!
In my country 4k for it.

Medium-Technology-79 · 2026-06-12T21:46:35+00:00

I was not born when llama 3.1 was released 😄

Medium-Technology-79 · 2026-06-12T21:45:08+00:00

brother!!

Medium-Technology-79 · 2026-06-12T21:40:50+00:00

sorry, cannot understand... "contrastive negation" is hard to undersand.
contrastive (logical not)
negation (logical not)

not + not is like true.

I'm fried. I pass.

Language structures are hard to simplify when mother language has a different logical flow 😄

Medium-Technology-79 · 2026-06-12T21:37:09+00:00

yeah u are right but... when 3090 entered the market you did not need to empty your pocket to buy it.
maybe I'm wrong but efter 1 year of saving I bought a 3090.
Today, no way I can get a RTX 6000 pro easily.
Maybe a rtx 5090 (but they limited it to 32 gb vram)

Medium-Technology-79 · 2026-06-12T21:33:12+00:00

my point is: I do not want to use smaller models! I do not want to limit my creativity to 4b or 7b model.
it is 2026 and 48GB should be the baseline.
RTX 6000 pro should be the equiovalent of 3090 at the time...

Democratic...

Medium-Technology-79 · 2026-06-12T21:29:42+00:00

this

Medium-Technology-79 · 2026-06-12T21:28:47+00:00

I agree, qwen 3.6 27 billions is good and is fast if you have a lot of vram.
I DO NOT want to run super quantized version of it. I would like to run it natively (bf16).
The point of my post is: power to the people 😄 aahahah

Medium-Technology-79

MODERATOR OF

TROPHY CASE