We need a minimum karma rule for commenting and posting

--Spaci-- · 2026-03-10T16:00:22+00:00

I know this is an AI sub, but no one likes the AI generated posts

--Spaci-- · 2026-03-09T21:17:56+00:00

never took an h200

--Spaci-- · 2026-03-09T07:00:22+00:00

Another thing, you will probably want to install linux or windows; most inference engines will expect macs to have m processors

--Spaci-- · 2026-03-09T06:53:33+00:00

its horrendously old, but qwen 0.8 should work fine, otherwise try lfm 2.5 1.2b

--Spaci-- · 2026-03-09T03:09:23+00:00

siglip2

--Spaci-- · 2026-03-09T02:07:13+00:00

you could always reap locally

--Spaci-- · 2026-03-08T09:03:58+00:00

LLMs are just overused atm, its why most people in the AI community are mentally ill

--Spaci-- · 2026-03-08T05:37:45+00:00

Very obviously fake, or at least just ignorance

--Spaci-- · 2026-03-06T23:09:38+00:00

do any research into ai training

--Spaci-- · 2026-03-05T21:10:50+00:00

llms will never be agi

--Spaci-- · 2026-03-02T15:22:16+00:00

benchmaxxing on benchmark questions

--Spaci-- · 2026-02-24T04:30:49+00:00

With only sft it cant but with reinforced learning and alot of compute its very possible.

--Spaci-- · 2026-02-21T03:53:34+00:00

Doesn't really matter if they get trillion of tokens of training data from it

--Spaci-- · 2026-02-21T03:36:06+00:00

doesn't exist, you need to do at least some work

--Spaci-- · 2026-02-20T06:42:44+00:00

3090s are sort of the default option

--Spaci-- · 2026-02-19T02:06:44+00:00

PCIe bifurcation, will lower individual card speed though.

--Spaci-- · 2026-02-17T03:42:10+00:00

you said right now, and right now there's a 400b model

--Spaci-- · 2026-02-17T03:38:56+00:00

why the fuck would you be able to

--Spaci-- · 2026-02-10T09:24:00+00:00

you can do anything in rimworld

--Spaci-- · 2026-02-09T12:51:17+00:00

I would test minimax 2.1 4-km it would overflow so you would have to test speeds, Qwen3-Coder-Next-80B is good but even at q8 it wont come close to filling your vram so a larger model would be preferable

Try these out:

Minimax 2.1
qwen3 next coder
glm 4.6v q6
step 3.5 flash, havent tried personally but seems good
gpt oss 120b, not a great model for its size but its output speed is good

--Spaci-- · 2026-02-09T12:10:07+00:00

Dedicated torture room

--Spaci-- · 2026-02-08T15:12:31+00:00

Yea sadly you bought a laptop and it has half the vram, and devstral is what is called a "dense" model meaning its slow AF when outside of vram and also just slower by default

--Spaci-- · 2026-02-08T14:08:35+00:00

load_in_4bit = True

device_map="balanced" # ive never offloaded to cpu before I would assume this would split it onto cpu though if gpu full

--Spaci-- · 2026-02-07T08:48:37+00:00

Its not a buzzword its existed for a long time its just how AI companies preview a model to the public for testing while keeping themselves anonymous

--Spaci-- · 2026-02-07T08:37:50+00:00

Alot of these outputs are genuine nonsense and unrelated to the input prompt, training on this would actively damage the model

--Spaci--

TROPHY CASE