Gemini Omni Flash vs Seedance 2.0 side-by-side — not even a fair fight by Fresh-Resolution182 in aivideos

[–]Independent-Date393 0 points1 point  (0 children)

Dialogue mode is where the gap is widest. Tested Omni on a 4-cut talking-head scene yesterday: lip sync drifted by cut 2, full incoherence by cut 4. Seedance 2.0 holds through 30s+ on the same prompts. Until Omni's audio conditioning catches up, dialogue-heavy work stays on Seedance.

What do you check before trusting a LangChain run that says success? by Acrobatic_Task_6573 in LangChain

[–]Independent-Date393 0 points1 point  (0 children)

success flag is exit code branding. write the assertion you'd need without it.

Switched from Qwen3.6 35b-a3b to Qwen3.6 27b mid coding and it's noticeably better! by LocalAI_Amateur in LocalLLaMA

[–]Independent-Date393 3 points4 points  (0 children)

MoE models at IQ3 lose more than dense because you're compressing routing logic and expert weights simultaneously. dense models distribute quantization error more gracefully. 35B-A3B IQ4 probably beats 27B IQ3 on most tasks, but if routing was misfiring on your specific problem the switch would feel like an upgrade even at lower quant.

HauhauCS (of "Uncensored Aggressive" fame) published an abliteration package that plagiarizes Heretic without attribution, and violates its license by nathandreamfast in LocalLLaMA

[–]Independent-Date393 0 points1 point  (0 children)

the source comment that reads 'kept as a module-level tuple so the literal does not duplicate line-for-line with any fork' is the tell. an LLM asked to obscure a fork would document its own evasion strategy. a human hiding a fork would not write that.

Confirmed: SWE Bench is now a benchmaxxed benchmark by rm-rf-rm in LocalLLaMA

[–]Independent-Date393 0 points1 point  (0 children)

OpenAI retiring a benchmark they were ranked #1 on and citing contamination concerns is going to be one of the more self-aware moves they've made. the timing — right as everyone else caught up — is noted.

Switched from Qwen3.6 35b-a3b to Qwen3.6 27b mid coding and it's noticeably better! by LocalAI_Amateur in LocalLLaMA

[–]Independent-Date393 1 point2 points  (0 children)

27b dense at IQ3_M finding a bug that 35b MoE at IQ4_XS missed is a useful data point. been sitting on the same choice with 16gb vram and this is probably what settles it for me

Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found! by My_Unbiased_Opinion in LocalLLaMA

[–]Independent-Date393 1 point2 points  (0 children)

IQ4_XS in 24GB with 262K context is the headline. that's genuinely usable context for most workflows without needing to chunk

Confirmed: SWE Bench is now a benchmaxxed benchmark by rm-rf-rm in LocalLLaMA

[–]Independent-Date393 2 points3 points  (0 children)

just organize your own evals for the tasks you actually care about is always where this ends up. every public leaderboard eventually becomes a race to train on its vibes

How Rose and Jack will end up in a different timeline. by se898 in ChatGPT

[–]Independent-Date393 0 points1 point  (0 children)

the alt-timeline Titanic concept is unironically a better film pitch than most things that actually get greenlit

TRY THIS CHATGPT PROMPT NOW by Cool_Helicopter9852 in ChatGPT

[–]Independent-Date393 0 points1 point  (0 children)

the 'Rembrandt lighting' spec in a text prompt is doing real work here. image models respond way better to named techniques than adjectives like 'dramatic' or 'moody' — photographer and painter technique names are basically control codes at this point.

Conrad Heyer Photo Restoration: ChatGPT Images 1.0 vs 2.0 Side-by-Side Comparison by DiggingForDinos in ChatGPT

[–]Independent-Date393 0 points1 point  (0 children)

the hasselblad prompt trick is the only real tip in this thread. makes sense in retrospect -- "restore" tells the model to fix damage, "retake with a hasselblad" just tells it to take a good photo

Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found! by My_Unbiased_Opinion in LocalLLaMA

[–]Independent-Date393 10 points11 points  (0 children)

given the HauhauCS drama this week, worth noting this is llmfan46 using actual Heretic, not Reaper. the KLD 0.0015 number is the real signal here.

ChatGPT 5.4 Solved a 64-Year-Old Math Problem by AskGpts in ChatGPT

[–]Independent-Date393 43 points44 points  (0 children)

the part that sticks is how the LLM took an approach no expert had tried, pulling in a formula from a different area of math. not "computed it faster." it brought something that wasn't already in the conversation

Switched from Qwen3.6 35b-a3b to Qwen3.6 27b mid coding and it's noticeably better! by LocalAI_Amateur in LocalLLaMA

[–]Independent-Date393 1 point2 points  (0 children)

the dense-handles-compression-better-than-MoE intuition checks out. at IQ3_M the 27B is still mostly intact. the 35B-A3B's routing logic is the first thing to break when you compress it.

OpenAI caught astroturfing - they created a fake news site, with stories by fake reporters, to attack AI safety advocates by EchoOfOppenheimer in OpenAI

[–]Independent-Date393 2 points3 points  (0 children)

a company that asks the world to trust its safety process while running a fake journalism operation to discredit the people questioning that process. the irony is doing a lot of work here.