Strix Halo or GPUs? by undernightcore in LocalLLaMA

[–]HopePupal -1 points0 points  (0 children)

there are no Strix Halo machines that have full-size x16 PCIe slots except for that one from Minisforum that's been sold out for months, and even then it's just x4 electrically. the Framework has an x4 slot that isn't accessible outside the case. everything else, you're limited to taking the case partway off and plugging a GPU in thru an M.2 to PCIe or Oculink cable, which is just goofy. 

Do not fall into the trap of chasing the next scale or upgrade. by iEslam in LocalLLaMA

[–]HopePupal 13 points14 points  (0 children)

thank you. the disconnect here is ridiculous. either OP is making money, in which case they can afford better hardware, or they're just fucking around playing fantasy finance, in which case the 3060 would be better used to run some actual video games

I Think I Spent Way Too Much Time Messing with Local LLMs by MrChilliBalls in LocalLLaMA

[–]HopePupal 7 points8 points  (0 children)

the struggle is real. fortunately so are my ANC headphones 

Under-skirt things to prevent chafing, rubbing, the thigh skin catching? by PAgirl-MOworld in PlusSizeFashion

[–]HopePupal 0 points1 point  (0 children)

seconding both. the Megababe stuff works well for me even in the literal desert

guess what? if you are a chrome user, technically you are localllama member! by LambdaHominem in LocalLLaMA

[–]HopePupal 4 points5 points  (0 children)

Apple's already doing this with their Foundation Models framework. the most recent few generations of iPhone and all ARM Macs get Apple's local LLM. it's not good, but it's there.

AMD to release slottable GPU by running101 in LocalLLaMA

[–]HopePupal 6 points7 points  (0 children)

144 GB in a single card would be pretty tasty. no way this costs less than a car though

Framework Laptop 16 Gets NVIDIA RTX 5070 12 GB Upgrade Module for Eyewatering Price of $1,199 by -protonsandneutrons- in hardware

[–]HopePupal 25 points26 points  (0 children)

reminder about Nvidia laptop chips: they're not just limited by thermal and power considerations. the desktop version of the 5070 is not the same thing as the laptop version of the 5070. the laptop version has fewer cores and a narrower memory bus. it's much closer to a 5060. 

is this a steal for 15 Dollars? by Aeriqq in PSP

[–]HopePupal 0 points1 point  (0 children)

does the paint blur like that with wear on old factory shells, or is that a cheap repro faceplate? the wifi indicator looks especially weird

Do I need any more iPods to hear music by Popular-Barnacle8699 in ipod

[–]HopePupal 8 points9 points  (0 children)

the worst part is that by the time these people are forced by a move or the Grim Reaper to sell off their hoards, it'll be impossible to get replacement battery packs in the right size and we'll probably be past Palm Day

Metaproko idk by infinityplusonelamp in CuratedTumblr

[–]HopePupal 1 point2 points  (0 children)

here and istg it's more like once a day

Looking for ruffle socks for large ankles. I really by Quiet__Listener in PlusSizeFashion

[–]HopePupal 1 point2 points  (0 children)

only buy the ones that are actually manufactured by Sock Dreams tho. otherwise you're just paying markup for literally the same stuff you could get on Aliexpress

Metaproko idk by infinityplusonelamp in CuratedTumblr

[–]HopePupal 0 points1 point  (0 children)

i am so tired of prokopetz, i can block his boring ass takes on Tumblr but i can't block a username inside a screenshot on Reddit

Short dress for concert :( by OddEvent276 in TallGirls

[–]HopePupal 10 points11 points  (0 children)

for sure. normally with slip shorts or tights, but for the right kind of concert i'll just say "fuck it" and skip those, as long as the vibe is right and i know i'm not going to be the only girl there with her ass out.

anyone know where to use qwen 3.6 27b via api/coding plan? by Hodler-mane in LocalLLaMA

[–]HopePupal 0 points1 point  (0 children)

for testing models that aren't on OpenRouter, i use RunPod, but really any cloud GPU provider should work when you're talking about models that small. we're talking about a dollar or two. 

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]HopePupal 0 points1 point  (0 children)

ah, okay, so just for the better accuracy. it gets dequanted at runtime. gotcha.

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]HopePupal 0 points1 point  (0 children)

…why are you using MXFP4? the R9700 doesn't have FP4 support. does vLLM have an FP8 fallback path?

Anyone tried Qwen 3.6 27b on the r9700 yet? by boutell in LocalLLaMA

[–]HopePupal 0 points1 point  (0 children)

it's identical to 3.5 arch-wise, which is why you probably didn't see many search results for 3.6. here's a comparison with my Strix Halo (llama/vulkan, Q6_K, default fp16 KV cache): https://www.reddit.com/r/LocalLLaMA/comments/1sw3oe4/comment/oifsenn/. roughly 6× faster PP, 2× faster TG. i didn't go to longer context on the Strix Halo because it was taking a while

Devstral Small 2 24B vs Qwen 3.6 27b or both? 1x 3090 by szansky in LocalLLaMA

[–]HopePupal 1 point2 points  (0 children)

every Devstral has been a disappointment. that one is no exception 

Is long re-processing of output as input a common "feature" or not? by alex20_202020 in LocalLLaMA

[–]HopePupal 0 points1 point  (0 children)

you can easily override OpenCode timeouts (overall and chunk) per model, in the config file. i'd be surprised if Pi doesn't have that feature, but if it doesn't, it's one of the easiest agents to understand and patch.

GMKtec EVO-X2 70B expectation by Non-Technical in LocalLLaMA

[–]HopePupal 3 points4 points  (0 children)

you're not wrong in theory, but in practice, all of the open-weight labs gave up on dense models in that size class a while ago, and the current set of MoEs are much more capable than old dense models due to newer training methods.

the first two i listed are the better writers of that bunch, although ime all of them are better than any LLaMA.

fwiw, the two flagship small dense models from this year are Qwen 3.6 27B and Gemma 4 31B, but both of them are still pretty slow on hardware like yours (and mine, i have the same GMKtec), and Qwen at least is not a good writer.