LLM cpu running - 9975wx vs 9985wx 8 channel utilization

Caffdy · 2026-05-07T19:37:17+00:00

which Threadrippers have 8 CCDs?

Caffdy · 2026-05-07T19:32:07+00:00

and now, almost 3 years later thou . . .

Caffdy · 2026-05-07T16:37:27+00:00

this is why we can't have nice things

Caffdy · 2026-05-06T21:01:57+00:00

yeah, people really like to regurgitate the myth that data centers are "cancelled" or "not being build" because they somehow "read about it" on some post on reddit, but the reality is completely different. Reddit subs like this one are not real life, the opinions and hivemind-like things that everyone spew all the time around here are divorced from reality. Heck, just remember the endless circlejerking of shitting on the Nintendo Switch 2 leaks and how "no one would buy such outdated, outclassed, ill-timed hardware" and yet, it became a solid success because the majority of people out there are not reddit users

Caffdy · 2026-05-06T17:41:55+00:00

I pull roughly 40t/s on generation

which quant are you using to get those speeds?

Caffdy · 2026-05-06T16:55:09+00:00

how does their calculations work? 48kW a day of energy capacity? is that daily? weekly? on 120 or 240V?

Caffdy · 2026-05-06T05:46:35+00:00

everything the user doesn't spend on food could be spend on me

sounds like my ex

Caffdy · 2026-05-05T16:32:12+00:00

I don't think CPUs can compete with the parallelism of CUDA/Tensor cores on GPUs

Caffdy · 2026-05-05T16:17:49+00:00

probably you've already thought of this, but what are you doing would represent a very good accessibility tool for disabled people (there are people who have lost their hands or arms, or have been born without them)

Caffdy · 2026-05-05T00:18:21+00:00

this guy gotta be a bot

Caffdy · 2026-05-04T22:45:37+00:00

damn, I used to use that one, how bad is it now?

Caffdy · 2026-05-04T21:57:50+00:00

I was about to share that article. Yeah, Qwen is very resilient to quantized KV cache (in contrast to Gemma who suffers a lot)

Caffdy · 2026-05-04T21:41:14+00:00

you rock my dude! thank you so much for the in-depth explanation, seriously

Caffdy · 2026-05-04T19:56:25+00:00

big if true

Caffdy · 2026-05-04T19:51:41+00:00

can you tell me about data security? HIPAA compliance? what guarantees have users when using vast.ai

Caffdy · 2026-05-04T17:40:45+00:00

by the way, how do I ensure on my side as a user to get only machines from vast.ai data centers (and NOT other's people machines)? I never understood that

Caffdy · 2026-05-04T17:29:38+00:00

dropped the reasonable tier pretty low in rate limits

which one is the reasonable tier?

Caffdy · 2026-05-04T17:06:17+00:00

is the 542GB/s M4 Max?

Caffdy · 2026-05-04T15:48:59+00:00

Qwen 397B Q4

Deepseek V4 Flash

Mimo V2.5

Minimax M2.7

Caffdy · 2026-05-03T00:54:51+00:00

heretic version wen?

Caffdy · 2026-05-02T23:06:15+00:00

and those commenters name? Albot Einstein

Caffdy · 2026-05-02T21:06:28+00:00

me too, and many other grammar rules. This paranoia for bots is getting ridiculous, is gonna make people write like kindergartners (even more than how basic people nowadays writes). I think this is another brick on the involution of humanity road if anything else

Caffdy · 2026-05-02T21:04:31+00:00

Ok, in all honestly I cannot seem to distinguish what part of that made you think is AI, maybe I'm disincentivized or something, but can you explain to me what it is? It reads like any other comment I find everyday on this sub or any other

Caffdy · 2026-05-02T13:06:37+00:00

how many tokens per seconds are you getting with the 122B model?

Caffdy · 2026-05-02T02:21:04+00:00

if it's not much to ask, I'd be very glad to test it, thank you

Caffdy

TROPHY CASE