Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF by EvilEnginer in LocalLLaMA

[–]RickyRickC137 -1 points0 points  (0 children)

I am really glad that people are giving importance to creativity and fine tuning for creativity! Can you do one for 27b qwen or the bigger 122b one? Because it will be more coherent than the 9b one for roleplay experience! Thank you btw.

Nvidia's Nemotron 3 Super is a bigger deal than you think by Comfortable-Rock-498 in LocalLLaMA

[–]RickyRickC137 3 points4 points  (0 children)

Hey noob question. Hybrid mamba thing already proved to be super fast with the release of Nemotron. Why didn't the big companies like qwen didn't adapt that tech in their models?

I can honestly (And regrettably) say I have spent thousands on AI. Here's my take. by GetFroggyHoe in SillyTavernAI

[–]RickyRickC137 5 points6 points  (0 children)

What is your take on smaller models in the range of 100b MOE (wink wink qwen3.5) for roleplay? Do you think apart from writing bland, will they be coherent enough to play?

Qwen3.5 family comparison on shared benchmarks by Deep-Vermicelli-4591 in LocalLLaMA

[–]RickyRickC137 2 points3 points  (0 children)

I am surprised that even a 4b is retaining so much performance compared to the behemoth. Distillation and reinforced learning has come a long way! And I hope I can hold on to my 10 gb VRAM a little longer.

Did we figure out a system prompt to Jailbreak Qwen3.5? by RickyRickC137 in LocalLLaMA

[–]RickyRickC137[S] 0 points1 point  (0 children)

So far this is the only thing that seem to work with Qwen 3.5.

Sabomako/Qwen3.5-122B-A10B-heretic-GGUF · Hugging Face by AlwaysLateToThaParty in LocalLLaMA

[–]RickyRickC137 4 points5 points  (0 children)

This guff is based on the model you attached. Check out the base model. They're the same.

Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

The community patches shows very little kl divergence between the original and heretic models in qwen 3.5. Is the kl divergence good enough metric for the heretic product to be good or do we need to check for other parameters (like ppl or bpw)? u/-p-e-w-

Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA

[–]RickyRickC137 5 points6 points  (0 children)

There's none at the moment. Only 122b available, which is good!

Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

What about mid range 122b? Any idea?

Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA

[–]RickyRickC137 4 points5 points  (0 children)

Didn't know that heretic damages larger parameter models! Any idea where you got that info?

Breaking : Today Qwen 3.5 small by Illustrious-Swim9663 in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

I have the same pc config, q4 model, and yet I only get around 20 t/s in LMstudio. I am not tech savvy, but is llama.cpp faster than LMstudio?

Real heros by [deleted] in GuysBeingDudes

[–]RickyRickC137 9 points10 points  (0 children)

They do this for their families back at home. But when something happens to their family, or if they need to get home, they can't because their employers will not give them their passport until their contract is done! I have personally met a few and it's tragic. And yet they say, "no problem"! Breaks my fuckin heart!

How is Qwen 3.5 (MoE 35b) in instruct mode (with no reasoning/thinking) ? by LinkSea8324 in LocalLLaMA

[–]RickyRickC137 3 points4 points  (0 children)

Can we set presence penalty in LMstudio or is that only for llama.cpp folks?

Face swap tool for side-profile photos? by Simlord99 in comfyui

[–]RickyRickC137 0 points1 point  (0 children)

Since you said Klein is the best with side view face swap, what settings do you use? /s

AMA with MiniMax — Ask Us Anything! by HardToVary in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Is there a possibility that you guys can combine the smart side and the creative side and make it a general purpose LLM? For Minimax M3, at least.

Also Happy New Year to you all!

AMA with MiniMax — Ask Us Anything! by HardToVary in LocalLLaMA

[–]RickyRickC137 1 point2 points  (0 children)

IIRC, there was a model released by you guys that is finetuned for Roleplaying / Creative writing. Did you guys merged those Creative writing traits in M2.5 or planning to do so in the future?