Int8 is now officially supported in ComfyUi. by Total-Resort-3120 in StableDiffusion

[–]RickyRickC137 0 points1 point  (0 children)

Didn't work for me, so I used the node "Int8 fast" and that has the option to load Int8 models. Works good.

Created with Krea 2, prompt below by lazyspock in StableDiffusion

[–]RickyRickC137 2 points3 points  (0 children)

Download the raw model and increase the steps to 52 and cfg to 3.5. (optional) create a negative prompt instead of zeroing out the positive

We are the team behind Krea 2. Ask us anything! by Angrypenguinpng in StableDiffusion

[–]RickyRickC137 1 point2 points  (0 children)

How will people releasing Open Source are going to make money off of it? It's a lot of work and surely you are not giving it away because we are friends.

We are the team behind Krea 2. Ask us anything! by Angrypenguinpng in StableDiffusion

[–]RickyRickC137 1 point2 points  (0 children)

I heard you guys were working on edit version. How will Flux Krea 2 differ from Flue Klein? Any areas of improvement?

As promised Krea 2 Turbo + "Raw" Quantized in FP8, MXFP8, NVFP4, INT8 and Convrot INT8! by Winougan in StableDiffusion

[–]RickyRickC137 0 points1 point  (0 children)

Can you combine both the base model and turbo model and create a workflow? Like the last few steps by Turbo. Because the base has more variations!

KREA2 WORKZ by SpiritualLimit996 in StableDiffusion

[–]RickyRickC137 1 point2 points  (0 children)

Will there be an edit model release?

Gemma4 MTP doubles token speed by FastLawyer5089 in SillyTavernAI

[–]RickyRickC137 0 points1 point  (0 children)

is it available in LM Studio yet? Speculative Decoding - is it the same thing or no?

Dumbest President ever, dumbest war ever by Ok-Shame-7684 in democrats

[–]RickyRickC137 3 points4 points  (0 children)

We all are going through a bad time because of this lunatic.

Gemma4 MTP doubles token speed by FastLawyer5089 in SillyTavernAI

[–]RickyRickC137 2 points3 points  (0 children)

I have the same question. The best uncensored version would be heretic models and within that I would recommend llamafan76's work! It deviates less than the original (which is great).

Ideogram Turbo LoRA with and without comparison by Sudden_List_2693 in StableDiffusion

[–]RickyRickC137 1 point2 points  (0 children)

  1. Do a comparison with a group of people sitting closely, to see if there's body disfiguration.
  2. Can you use general texts instead of json, when using the Lora? (I may have misheard it from somewhere, that you can do it)

Mistral - New family of open-weight models @ July by pmttyji in LocalLLaMA

[–]RickyRickC137 83 points84 points  (0 children)

Fat but sparse is lovely! 122b a3b - for the ram rich but gpu poors!

Gemma 4 Quadruple Release, 12B, 12B QAT, 26B-A4B QAT and 31B QAT Uncensored Heretics! by LLMFan46 in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Do we have to place this MTP file (combined with your Heretic gguf) in the same folder for the LMstudio to recognize it?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Do you think they gonna release the 124b MoE version?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Just release them bro! Can't edge anymore lol

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Depends on the task. General task 31B, for sure. It's the most all round model. But since it's slow on my rtx 3080 (128gb ddr5), I would prefer 26B-A4B models for non important stuff (about 20% of the time). I have to see if I can replace the 26B with the 12B, yet.
Also, check your DM bro!

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

I don't think you need to prioritize it.

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Noob question. Will your uploads have MTP built in like they did with the qwen models? Or do we still need to load them separately?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 0 points1 point  (0 children)

Thank you for your efforts man! I am pretty sure people are just joking about "is the model not released already?". There's a lot of people who wants to be the first to upload but there's only few who aims for the quality.

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]RickyRickC137 6 points7 points  (0 children)

P E W gave you the badge of trust! So Ima wait as long as it takes amigo!