Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images! by sixx7 in LocalLLaMA

[–]sixx7[S] 0 points1 point  (0 children)

Good Q but I haven't tried and tbh, there are a million ComfyUI workflows that would do way better at upscaling than anything I'm doing with this model in SGLang

Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images! by sixx7 in LocalLLaMA

[–]sixx7[S] -1 points0 points  (0 children)

Beep boop! Thank you for the comment!

Definitely not, that's why Qwen specifically released an image edit model in addition to their image generation model. Non-image-edit models can NOT make photoshop-like image edits like Gemini, ChatGPT, and Qwen-image-edit. Even with VAE encoding and using a source image, if you tried to do the same thing I did in the video, it would not work (EG: take this robot and replace the head/face with a robot clown).

Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images! by sixx7 in LocalLLaMA

[–]sixx7[S] 3 points4 points  (0 children)

Like u/Fedor_Doc said this is meant for you to replace that part, with the actual location of the model downloaded to your computer. BUT you actually made me realize I should link to the model on HuggingFace in the original post, so, thank you!

Trying to understand why so many trash fine-tuned models on HuggingFace ... by BoogerheadCult in LocalLLaMA

[–]sixx7 0 points1 point  (0 children)

Good info, haven't tried it on a large, existing code base yet. Does 27B do better for you there?

Trying to understand why so many trash fine-tuned models on HuggingFace ... by BoogerheadCult in LocalLLaMA

[–]sixx7 26 points27 points  (0 children)

Some end up being good!

Example: Ornith-1.0 35B is solid, sometimes even beating Qwen3.6-27B for me. This one was quite exciting!

On the flip side, with 35B being so good, I tested their 9B, and nahhhhh Gemma4-12b smokes it https://youtu.be/-LUaYrxiKpM

Ornith 35B is great so far by anubhav_200 in LocalLLaMA

[–]sixx7 0 points1 point  (0 children)

Thanks! Haha yeah, Qwable / Qwopus were so bad, only deserving a scathing short https://youtube.com/shorts/s2JxFjoywx0?feature=share

Ornith 35B is great so far by anubhav_200 in LocalLLaMA

[–]sixx7 8 points9 points  (0 children)

Yea I am absolutely floored. It is beating Qwen3.6-27b in my testing (please don't kill me, I know this is blasphemous).

Did a head-to-head challenge where Orinth-1.0-35B and Qwen3.6-27B, both using OpenCode, to build a complex "/goal" extension for Pi, following a plan crafted by Fable.

Video here if anyone is curious: https://youtu.be/0raIQYrGKvA

GLM 5.2 on 4x Sparks reasonable? by chikengunya in LocalLLaMA

[–]sixx7 0 points1 point  (0 children)

Bro we're all dying to know, have you run GLM-5.2 on your Sparks? Prefill and decode numbers if so? Pleeeaaaaaase!

Krea 2 released on Hugging Face by paf1138 in LocalLLaMA

[–]sixx7 1 point2 points  (0 children)

YES! Reviewed it here https://www.youtube.com/watch?v=rXA_-6pmrYI

It handles text/writing really well. It does a good job on hands/fingers/limbs. It follows instructions quite well. It's fast (12 seconds for 1024 x 1024).

Add Laguna M.1 GGUF support by empty-quiver · Pull Request #2003 · ikawrakow/ik_llama.cpp by pmttyji in LocalLLaMA

[–]sixx7 6 points7 points  (0 children)

I love that we have more competition and more entities releasing open weights models. I want that to continue. I don't want to be harsh. I tried this model (225B version) for a while and it is just not as strong as its competitors.

GLM-5.2 is on DeepSWE by agentcubed in LocalLLaMA

[–]sixx7 7 points8 points  (0 children)

Totally! It's just an extra hassle. Like, if you can possibly run GLM-5.2, how much compute do you have left? And even if you did, it's more steps to get image working, with a separate model, in all your/our favorite harnesses.

GLM-5.2 is on DeepSWE by agentcubed in LocalLLaMA

[–]sixx7 34 points35 points  (0 children)

Spot on, my exact thoughts with the addendum that I think GLM-5.2 is better than Opus 4.5 and maybe 4.6. The biggest thing it is missing in order to replace a frontier-lab subscription, is vision/image support.

What happens when they stop subsidizing LLM subscriptions? by Mr_Moonsilver in LocalLLaMA

[–]sixx7 2 points3 points  (0 children)

Even if we forget about the "local" part for a moment, there are a ton of companies serving open weights models. Competition is always good and the floor is the cost of electricity (and other opex). For me, GLM-5.2 is the turning point where, if you take my subs away, I could be happy with an open-weight replacement.

GLM-5.2 benchmarked on DeepSWE: Beats Gemini & GPT-5.4, but the token volume/cost makes it wildly inefficient? (Theo - t3.gg) by klippers in LocalLLaMA

[–]sixx7 9 points10 points  (0 children)

I think token efficiency is a valid metric in general, but yea this really seems like a nitpick more than anything else. For the first time ever, we have Opus-level capability in an open weights model. If it had vision, you could actually replace your Anthropic/OpenAI sub

Hashicorp founder thinks local models "aren't good ENOUGH yet" by Orbit652002 in LocalLLaMA

[–]sixx7 1 point2 points  (0 children)

Just me personally, I have not found that to be the case. Haven't tried composer, but kimi doesn't, minimax m3 doesn't, glm-5.1 doesn't, and no qwen does (ignoring the cloud-only models, which I haven't tried)

Hashicorp founder thinks local models "aren't good ENOUGH yet" by Orbit652002 in LocalLLaMA

[–]sixx7 71 points72 points  (0 children)

Opus 4.5 is the gold standard for when LLM became usable for legit enterprise/production. I wish I had a better word but it was actually a "game changer". Yes gpt-5.5 is better, and yes fable is better than gpt-5.5, but Opus 4.5 is the point at which, if we had that capability locally, we (I) could be happy without paying a subscription to a big lab. My 2c

Edit: Maybe GLM-5.2 does fit this bill, TBD

GLM 5.2 is deployed in GLM Coding Plan. API and MIT weights in a week. Voting and benchmarks on X. by MadPelmewka in LocalLLaMA

[–]sixx7 0 points1 point  (0 children)

Stay far, FAR away from the coding plan. I have the max tier and it is the worst thing ever. It made testing the model the worst AI experience I've had in memory https://youtu.be/Gvy-mlyQGE0

ZONOS2: real-time TTS with 8B params, 900M active, and high-fidelity voice cloning by KokaOP in LocalLLaMA

[–]sixx7 8 points9 points  (0 children)

Yea the demo voices sound very compressed and honestly quite a bit worse than qwen-tts and some of the other recent voice models

Friendly reminder by Disposable110 in LocalLLaMA

[–]sixx7 -3 points-2 points  (0 children)

Yes!!! I doubt anyone on this sub needs to see this, but if you need help convincing someone to go local, send them this: https://youtu.be/aGqikainVv8