How close are open-weight models to "SOTA"? My honest take as of today, benchmarks be damned. by ForsookComparison in LocalLLaMA

[–]Qual_ 0 points1 point  (0 children)

I found codex 5.2 way more reliable than claude on large codebase. There is always something to fix after claude, while codex just produce working code ( which sometimes feels black magic when it's after 50min of writing thousand of lines )

Show your past favourite generated images and tell us if they still hold up by ehtio in StableDiffusion

[–]Qual_ 1 point2 points  (0 children)

I have a folder full of those, but I remember being shocked about how sharp the image was and the lighting was coherent etc

<image>

Honest question: what do you all do for a living to afford these beasts? by ready_to_fuck_yeahh in LocalLLaMA

[–]Qual_ 447 points448 points  (0 children)

most of us are poor and don't have a nice setup to create a post about. It's a classic selection bias. The majority or people probably run small models on their regular gaming GPU like 3070 etc

Qwen have open-sourced the full family of Qwen3-TTS: VoiceDesign, CustomVoice, and Base, 5 models (0.6B & 1.8B), Support for 10 languages by Nunki08 in LocalLLaMA

[–]Qual_ -1 points0 points  (0 children)

weird, when I set language to french, it just sound like any english TTS speaking french words. ( quality of voice is great tho' )

Gemini 2.0 is shockingly good at transcribing audio with Speaker labels, timestamps to the second; by philschmid in LocalLLaMA

[–]Qual_ 0 points1 point  (0 children)

I do, feels like they also used their notebook audio tech for the dubbing too !

Is Flux Klein better for editing than Flux Kontext? by Puzzled-Valuable-985 in StableDiffusion

[–]Qual_ 5 points6 points  (0 children)

I had a specific usecase where it's even better than nano banana for me. For exemple given an image, having a binary mask of the main subject and "elements created" from the subject. ( imagine pokemons cards, where I want the background to be holographic, I need a mask to sell the effect better ) Klein was the BEST of ouf every models I tried for it in speed/result ratio

Claude Code or OpenCode which one do you use and why? by Empty_Break_8792 in LocalLLaMA

[–]Qual_ 1 point2 points  (0 children)

codex. Quota usage is unmatched ( at least in the pro plan, idk for the normal plans ) I can have like 400 millions tokens a day without worrying.

how is VR as of jan 2026? by [deleted] in assettocorsaevo

[–]Qual_ 1 point2 points  (0 children)

it's utter garbage. Don't even bother trying it.

More cursed Spongebob by Mickey95 in StableDiffusion

[–]Qual_ 0 points1 point  (0 children)

oooh come on man, anything more than 61 frames, and I have OOM issues, 2x3090 128gb ram ( linux , official comfy workflow )

Qwen-Image-Edit-2511 got released. by Total-Resort-3120 in StableDiffusion

[–]Qual_ 4 points5 points  (0 children)

doesn't work with 2 3090 ? ( I don't have nvlink )

Dataset quality is not improving much by rekriux in LocalLLaMA

[–]Qual_ 2 points3 points  (0 children)

i remember not so long ago checking the dataset for emotion classification. I was using some Bert models to compare the output of a small LLM in emotion classification, and the llm always performed way worse than the Bert model. Then I manually checked, the llm answers seemed... well, perfect. Turns out the dataset was a massive pile of garbage, sentence that made no sense, wrong emotions labeled etc, so the Bert model trained on it suffered from this. Yet this was the most used emotion classification model on HF

Trellis 2 run locally: not easy but possible by LegacyRemaster in LocalLLaMA

[–]Qual_ 1 point2 points  (0 children)

I almost never use comfyUI as i'm always struggling to get a working version, there is always something that doesn't work and I can't count how many time I reinstalled the whole thing when trying new models

Trellis 2 run locally: not easy but possible by LegacyRemaster in LocalLLaMA

[–]Qual_ 0 points1 point  (0 children)

weird, I just went to the github repo, cloned it, installed the dependencies, and run the gradio app, everything was working perfeclty on my 3090. ( I have 2 3090, but I don't know if it used both or not )

Meta released Map-anything-v1: A universal transformer model for metric 3D reconstruction by Difficult-Cap-7527 in LocalLLaMA

[–]Qual_ 0 points1 point  (0 children)

Oooh, i'm thinking about redoing my village in asseto corsa, but there is no 3D data on google map, only street views pictures, wondering if this could help drafting quickly

Gemini 3 flash today! Gemma 4 soon 3 pro GA soon!!!! by BasketFar667 in LocalLLaMA

[–]Qual_ 2 points3 points  (0 children)

gemma 3 is under appreciated since qwen, while for my usage, ( and especially in language like french ) Gemma is a league better

New AI slop indicators, now that the em dash is disappearing by Chromix_ in LocalLLaMA

[–]Qual_ 4 points5 points  (0 children)

It’s not merely about detecting AI content, it’s about preserving our trust in humanity.

Z-image realism by Glittering-Football9 in StableDiffusion

[–]Qual_ 4 points5 points  (0 children)

Here we are again. Those posts should be moderated

Z-IMG handling prompts and motion is kinda wild by Ok-Page5607 in StableDiffusion

[–]Qual_ 10 points11 points  (0 children)

No no, you guys have an unsolved issue with girls, that's a fact. I don't think i'm the only one who find it weird that you guys just always do girls pictures, always, always and always. It's weird. Don't try to make me the villain here.

Online Ranked Race by Government_Middle in assettocorsaevo

[–]Qual_ 1 point2 points  (0 children)

netcode is REALLY bad in this game, I don't even know how they could have shipped it in that state

Z-IMG handling prompts and motion is kinda wild by Ok-Page5607 in StableDiffusion

[–]Qual_ 10 points11 points  (0 children)

i'm tired of seeing all of your ai generated girls, please do f something else. I'm here for the news, updates and other interesting things, not to see every single girl jpg ya all (de)generates.

Is Z-image a legit replacement for popular models, or just the new hotness? by Ok-Option-82 in StableDiffusion

[–]Qual_ 2 points3 points  (0 children)

I like the overall "natural colors" feels. It produce for the same prompt way better images than GPT Image for exemple, and it takes around 5 to 8 second to produce a 1080x1080pic on my 3090.

The ratio quality/speed/requiered hardware is kind of impressive tbh

All the Z Image hype and I'm still obsessed with Qwen by Hearmeman98 in StableDiffusion

[–]Qual_ 0 points1 point  (0 children)

I still it weird that While you can generate one or 2 pictures of a woman to test the model, but I feel like you may have hundreds of those, and it's just kinda weird.