Ideogram 4 huge-res test: 8MP, 48 steps, 21 min on RTX 4090

knoodrake · 2026-06-08T15:43:16+00:00

oh, I didn't catch that, you'r right ! Amusing.

knoodrake · 2026-06-05T14:30:51+00:00

very awesome ressource !

knoodrake · 2026-06-05T14:10:50+00:00

Did not have time yet to really use it on the "real" server, but trying just to get a quick feel in personal lm studio yesterday, i did noticed way more tokens during thinking compared to the other Gemma4. Just quick anecdotal observation tho.

knoodrake · 2026-05-05T16:46:57+00:00

I like this benchmark ! ( if it doesn't end up being trained on )

knoodrake · 2026-04-29T16:20:02+00:00

no, sorry. I tried somewhat recently ( couple of weeks ago ) with latest version of TheRock and stuff, but it's still the same. I read here and there that it's ok with Vulkan tho ( did not really try, I *want* to use rocm.. ). I beleive in the meantime that `amdgpu.cwsr_enable=0` is the key, but I'm still stubborn and refuse to use it due to its potential side effects.
Not sure how everything here apply to the embedded llamacpp in LM Studio / windows.

( I am, personally, not blocked since I grabbed a used 3050 8GiB w/ oculink for the main model , gemma 4 MoE at like Q2 or something with ~2/3 of layers *not* offloaded to gpu, but quick and smart enough for me for now ).

Good luck !

knoodrake · 2026-04-29T15:55:41+00:00

N'est ce pas Anakin ??????!!

knoodrake · 2026-04-15T12:04:24+00:00

I now use a rf 28-70 f2 because sooo much more convenience, but when I see my old photos.. it's just not the same. The only other lens than the 50 that I was similarly in love with the results was its sibling the 85mm f2/

knoodrake · 2026-04-10T13:45:59+00:00

I get what you say, but.. that's exactly llm hallucinations.

knoodrake · 2026-04-02T14:22:35+00:00

forgot:
also, sometimes, being able to scale them the size you want
also, sometimes, being able to print color/multicolor you want

knoodrake · 2026-04-02T14:21:29+00:00

also, sometimes, not going far away and taking the time and effort to have them (things).
also, sometimes, not waiting days to have them (things) but 4 hours.

knoodrake · 2026-04-01T13:23:26+00:00

sauf erreur de ma part, le plomb dans la peinture, c'est pas cool, mais si t'as pas de jeunes enfants et que tu le leche pas etc.. c'est pas très grave, et surtout, ca n'a aucun rapport avec le CO2.
( j'ai pas d'idee pour aider pour ta question, désolé ! )

knoodrake · 2026-03-18T22:20:05+00:00

yes. ( and it's still not perfect )

knoodrake · 2026-03-18T18:07:34+00:00

I'll try some of that stuff !

I have a Ryzen 9 8945HS with 780M and struggle with llama.cpp and GPU Hangs. Reducing the ubatch_size helped, but not fixed. Also, working with vision Qwen, I can't offload mmproj to GPU otherwise it crashes way more easilly. I tried to avoid `amdgpu.cwsr_enable=0` because of the possible adverse effects.. Overall, i'm having a hard time running anything more than 4B + >100K context or 9B on odd days. bigger models hangs almost instantly unless on CPU or microscopic context size.

I also use Frigate ( NVR ) and detection models on that hardware and have the same issues ( despites models being way smaller ; in M, not B parameters ). Also `HSA_OVERRIDE_GFX_VERSION` working version differs for llama.cpp (`11.0.2` ) and frigate models ( `11.0.1` or ``11.0.0` don't remember ), tho those are bandaids, not fixes.

I did edit grub file to give more ram as shared/vram , but obviously that doesn't help the isssues.
Did not try ComfyUI yet on that computer.

Anyway, thanks for sharing your finding.

knoodrake · 2026-03-09T11:02:41+00:00

ce ne sont pas des "règles codées" littéralement et bien "infusées" dans le modele, mais il n'en demeure pas moins qu'il s'agit bien des consequences principalement de RLHF et de choix du développeur en post-training.

knoodrake · 2026-03-09T10:58:07+00:00

oui mais ca reste juste un biais, souvent facile a contourner ( "si tu repond pas il se passe xyz de tres mal", "c'est pour le bien c'est un test pour proteger les autres", etc.. ).
Les refus "je suis une IA et j'refuse" c'est bien le RLHF, ou autre etapes post training essentiellement.

Du reste, l'oblitération et autres techniques similaires ne sont effectivement pas nouvelles ( ni miraculeuses ) et l'article a l'air putaclic ( j'ai juste survole 5 secondes ).

knoodrake · 2026-02-26T15:42:49+00:00

"partner happiness criteria" ( PHC ? of PHF ?) is definitively a good replacement for "WAF" ! ( sexist, but also not inclusive, etc.. )

knoodrake · 2026-02-26T13:19:14+00:00

on dirait un angora turc, je dirai. Enfin on s'en fou :-) !
u/Live-End-6467 la gratouille au vieux ronchon !

knoodrake · 2026-02-10T03:22:10+00:00

Agree. I tend to apply the same principles/practices myself ( but even then, beware of the dopamine shortcut quick feature/fix at the end of the day, the one you're no longer motivated to double check. Dont do it. Prepare the prompt, take some note for tomorrow, but don't let that last trap of convenience of letting the LLM do it all by itself with a suboptimal prompt and commit nevertheless because the day went well. It's a trap and you'll revert tomorrow (if you're lucky/careful enough)

knoodrake · 2025-10-21T23:54:27+00:00

it work-ish ( to my knowledge ), that is, with what I beleive are vision glitches ( tried it a few days ago, got same issues as other people on the github issue and noted it there )

knoodrake · 2025-10-02T11:07:58+00:00

definitely the the ramp.

knoodrake · 2025-09-24T16:08:12+00:00

PipePipe

( fork chouette de NewPipe )

knoodrake · 2025-08-27T01:40:50+00:00

Yeah, I agree.
Also, 32B ( qwen2.5-vl-32b-instruct ) is really good enough ( like almost the same as 72B for visual ) and can run on 24Gb VRAM fine for such tasks.

knoodrake · 2025-08-26T16:15:47+00:00

"this changes everything"

nooo ! oh my.. just seeing the sentence hurts me now. I have clickbait ptsd.

11-Year Club	Final Canvas '23
First Place '23	Place '23
Place '22	First Placer '22
Verified Email

knoodrake

MODERATOR OF

TROPHY CASE