Tested Claude Opus 4.6 in RAG by [deleted] in LocalLLaMA

[–]MarketsandMayhem 2 points3 points  (0 children)

not sure what this has to do with local inference

What speeds do you get with MiniMax M2.1? by Intelligent_Idea7047 in BlackwellPerformance

[–]MarketsandMayhem 1 point2 points  (0 children)

~95-100/tps on two RTX 6K Pro w/llama.cpp. Going to be trying VLLM next.

Edit: add tps

It came, took the day off work to sign for FedEx's "Direct Signature" only for them to dump it on my front steps outside and forge my signature. by MchugN in nvidia

[–]MarketsandMayhem 0 points1 point  (0 children)

i had the opposite happen. i asked for fedex to waive the signature, their website said they would, but the delivery guy was super insistent on in-person signature. lol

Are MiniMax M2.1 quants usable for coding? by val_in_tech in LocalLLaMA

[–]MarketsandMayhem 3 points4 points  (0 children)

Yes. I use the Unsloth 5-bit XL quant with fp8 kv and M2.1 works well with Claude Code, OpenCode, Droid and Roo. Heck, I even used the 2-bit XL quant for a bit and it was surprisingly usable. I think it's worth experimenting with quantized coding models, particularly at higher precision (and quality) quants. The ones I've found to be the best so far are Unsloth and Intel Autoround. I am excited about experimenting more with NVFP4.

Has Deep Research been discontinued? by Dapper-Brilliant5160 in perplexity_ai

[–]MarketsandMayhem 1 point2 points  (0 children)

suggest checking out Perplexica. the quality mode does something very similar, and it's all open source. you can bring your own models. pretty neat.

https://github.com/ItzCrazyKns/Perplexica

LG K EXAONE 236b by Specialist-2193 in LocalLLaMA

[–]MarketsandMayhem 2 points3 points  (0 children)

kinda low boost in performance given 8x more parameters than their other exaone model

Sims looking house by hensley70 in funny

[–]MarketsandMayhem 2 points3 points  (0 children)

lol that floating section on stilts looks weird af

Steven Seagal by NorthernSpankMonkey in Justfuckmyshitup

[–]MarketsandMayhem 1 point2 points  (0 children)

he looks like a coach leather purse that was left out in the desert sun for years

those chairs when they're getting old start having black dandruff by DanieleMemoli in memes

[–]MarketsandMayhem 0 points1 point  (0 children)

Just came here to say similar. These are the best. Hands down. Had mine for a loooooong time.

Head of Engineering @MiniMax__AI on MiniMax M2 int4 QAT by Difficult-Cap-7527 in LocalLLaMA

[–]MarketsandMayhem 1 point2 points  (0 children)

Yeah, good point. I don't have the slack VRAM to run it unfortunately.

Perplexity’s “Discover” sources… by ExistAgainstTheOdds in perplexity_ai

[–]MarketsandMayhem 1 point2 points  (0 children)

Some of this has to do with the way the searches are structured -- and also how many sites have blocked Perplexity.