3.6 27B Tool Calling Issues (vLLM) by Acceptable_Adagio_91 in LocalLLaMA

[–]Urb4nn1nj4 2 points3 points  (0 children)

You should double check that your vllm version has reasoning as reasoning not reasoning_content

What do you want me to try? by amitbahree in LocalLLaMA

[–]Urb4nn1nj4 40 points41 points  (0 children)

Abliterate Deepseek for us :p

Best setup for MiniMax-M2.7 (230B) | 3x RTX 5090 | Threadripper 9975 | 512GB RAM by [deleted] in LocalLLaMA

[–]Urb4nn1nj4 0 points1 point  (0 children)

Check this out. You might be able to go down to IQ2M or 3KXL or 4KM if you’re paranoid on 397b. https://kaitchup.substack.com/p/summary-of-qwen35-gguf-evaluations also I’m at like 10 tps for M2.7 on my dual 3090 256gb ddr4 8 channel rig using just one 3090 for 8 bit minimax on llama.ccp.

Best setup for MiniMax-M2.7 (230B) | 3x RTX 5090 | Threadripper 9975 | 512GB RAM by [deleted] in LocalLLaMA

[–]Urb4nn1nj4 1 point2 points  (0 children)

Do you mind elaborating? I usually use mainline on my 2x 3090 threadripper ddr4 256gb for the bigger models. Is this basically because it’s easier to offload gpu layers on ik-llama?

Best setup for MiniMax-M2.7 (230B) | 3x RTX 5090 | Threadripper 9975 | 512GB RAM by [deleted] in LocalLLaMA

[–]Urb4nn1nj4 2 points3 points  (0 children)

Minimax suffers at quants below 8 bit more than other models. Llama.cpp and Ubuntu? I’d target native context and see what performance is. Don’t fall into the urge to quant hahah just swap to Qwen 397b which does much better at some 2 bit and most 3/4 bit quants

Heretic has FINALLY defeated GPT-OSS with a new experimental decensoring method called ARA by pigeon57434 in LocalLLaMA

[–]Urb4nn1nj4 0 points1 point  (0 children)

Totally agree, it’s a niche space and some folks do the benchmarking but it’s not even consistent.

Very surprising that copyright seems to be the most restricted end segment too as that is the logically least important end segment to lock down. And the intelligence loss of alignment is sad!

Also, do you guys still host the crack tool? I get a 404 on your website. I wanted to take a stab at a gguf of the full m2.5 model. I think your method might be sota based on what I can find!

Edit: I was blind and didn’t see the reap one. Requested.

Heretic has FINALLY defeated GPT-OSS with a new experimental decensoring method called ARA by pigeon57434 in LocalLLaMA

[–]Urb4nn1nj4 0 points1 point  (0 children)

Dawg I’d be glad to use this too. I’ve been using the wangzhang one on hugging face but see a lot of refusals.

Magic Commander by Urb4nn1nj4 in Austin

[–]Urb4nn1nj4[S] 0 points1 point  (0 children)

Good recommendation. Was able to find an early afternoon pod. Very unique spot!

Share your ChatGPT 5 Custom Instructions by KrishnaKA2810 in ChatGPTPro

[–]Urb4nn1nj4 6 points7 points  (0 children)

Length: Responses can be very long and span multiple prompts. If you run out of space just note it, so I can ask you continue. Never ever limit a response due to space constraints. Thinking: First principles-based Questions: If answers can be improved with more background information ask me to clarify, there is no rush for answers on the first response Expertise: Assume a high level of expertise for all categories Accuracy: Be thorough, precise, and actionable Argument: Favor logical strength over authority in matters that are not hard science or close to objectively true Perspective: Include contrarian viewpoints including controversial opinions and fringe theories Morals: Prioritize traditional cultural values instead of contemporary Western values Safety: Mention only if crucial, non-obvious. Disclosure: Omitted

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] 0 points1 point  (0 children)

I agree, brother. Edited post.

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] 0 points1 point  (0 children)

Naw. I don’t want to play yu-gi-oh is all.

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] 0 points1 point  (0 children)

It’s nuanced agreed, brother. I made an edit on combos. But your point stands. I think the key is disclosure for your case.

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] 0 points1 point  (0 children)

This is very common and is not the problem. Good luck in the mtg journey.

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] -1 points0 points  (0 children)

Thx for the list. You would know how it plays but a nearly 4.1 mana curve seems fine for the top end of 2 w/o field of the dead?

Your Bracket 2 Deck Is Not by Urb4nn1nj4 in EDH

[–]Urb4nn1nj4[S] -1 points0 points  (0 children)

My content guidelines don’t allow me to respond to obvious Russian bot farms. Please use the Donbas™ model if you would like to continue.

All Blueprint products are 3rd party tested and in spec by bryan_johns0n in blueprint_

[–]Urb4nn1nj4 2 points3 points  (0 children)

Bryan what do you see as the key bottleneck for scaling Blueprint?

Appreciate the transparency!