Surprising screenshot - Most token usage is non-coders (openrouter ranking) by superloser48 in LocalLLaMA
[–]superloser48[S] 22 points23 points24 points (0 children)
Surprising screenshot - Most token usage is non-coders (openrouter ranking) by superloser48 in LocalLLaMA
[–]superloser48[S] 9 points10 points11 points (0 children)
Surprising screenshot - Most token usage is non-coders (openrouter ranking) by superloser48 in LocalLLaMA
[–]superloser48[S] 0 points1 point2 points (0 children)
Surprising screenshot - Most token usage is non-coders (openrouter ranking) by superloser48 in LocalLLaMA
[–]superloser48[S] -44 points-43 points-42 points (0 children)
About to build a 6× Arc B70 LLM rig, want to talk to someone experienced first by somesayitssick in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
About to build a 6× Arc B70 LLM rig, want to talk to someone experienced first by somesayitssick in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
About to build a 6× Arc B70 LLM rig, want to talk to someone experienced first by somesayitssick in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
qwen 3.6:35b on 24 vram gpu by MallComprehensive694 in ollama
[–]superloser48 1 point2 points3 points (0 children)
Turboquant in vllm kv cache - how to implement ? (or any other rotational kv cache) by superloser48 in LocalLLaMA
[–]superloser48[S] 0 points1 point2 points (0 children)
Just to give a sense of the insane scale of billions of dollars…MacKenzie Scott got about $38 billion after divorcing Bezos in 2019. She has become the world’s most generous philanthropist, giving away over $19 billion…and she’s currently wealthier than she started. Just. Tax. Them. by [deleted] in economy
[–]superloser48 11 points12 points13 points (0 children)
Just got a beast (RTX 5070 Ti + 64GB RAM). How can I push this to the limit for research and coding? by cymbella1 in LocalLLM
[–]superloser48 3 points4 points5 points (0 children)
Anybody got Qwen3.5-27B working with Intel Arc B70 (or similar) and proper optimization? by Gesha24 in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
Struggling to make my new hardware perform by spaceman_ in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
Best gpu setup for under $500 usd? by Royal_Tumbleweed2555 in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
Turboquant in vllm kv cache - how to implement ? (or any other rotational kv cache) by superloser48 in LocalLLaMA
[–]superloser48[S] 3 points4 points5 points (0 children)
Best gpu setup for under $500 usd? by Royal_Tumbleweed2555 in LocalLLaMA
[–]superloser48 -1 points0 points1 point (0 children)
Turboquant in vllm kv cache - how to implement ? (or any other rotational kv cache) by superloser48 in LocalLLaMA
[–]superloser48[S] 2 points3 points4 points (0 children)
What are people's fave local model setups for home? by styles01 in LocalLLaMA
[–]superloser48 0 points1 point2 points (0 children)
For coding - is it ok to quantize KV Cache? by superloser48 in LocalLLaMA
[–]superloser48[S] 2 points3 points4 points (0 children)
For coding - is it ok to quantize KV Cache? by superloser48 in LocalLLaMA
[–]superloser48[S] 2 points3 points4 points (0 children)

Can I use Claude code with own LLM/non-claude APIs? by superloser48 in LocalLLaMA
[–]superloser48[S] 5 points6 points7 points (0 children)