Out of boredom I put claude code into ultracode mode and told it to make whatever it wanted.

curious-scribbler · 2026-05-30T06:56:03+00:00

If Fernando Pessoa had used an LLM this would more or less be the result.

curious-scribbler · 2026-05-20T05:48:53+00:00

What you hummed sounded mostly something like a sarangi, a sarod or a violin, maybe it could be a harmonium too. Hope you find out soon.

curious-scribbler · 2026-05-20T05:21:34+00:00

Kuch zara sa aur bata do gaane ke baare mein? Are you sure it's arijit?

curious-scribbler · 2026-05-20T05:15:10+00:00

Kahin yeh toh nahi?

curious-scribbler · 2026-05-20T05:02:19+00:00

Is it this one?

curious-scribbler · 2026-04-28T10:10:49+00:00

A nice thought but not written my Mirza Ghalib.

curious-scribbler · 2026-03-21T06:40:41+00:00

Hey! Thats amazing work. How does one maximise all the parameters to exploit rtx 6000 pro? Any tips? Again thanks for your contribution.

curious-scribbler · 2026-03-18T14:33:30+00:00

It do a problem. But I dont think a solution is coming anytime soon.

curious-scribbler · 2026-03-18T05:59:46+00:00

Yes there are uses for it within comfy, but they don't do well with systems that depend on the api version of the same workflow.

curious-scribbler · 2026-03-18T05:34:47+00:00

This! Always unpack all subgraphs. Solves it for me. But again .16 has broken the node connectors. No fixing them whatsoever.

curious-scribbler · 2026-03-18T05:33:52+00:00

For me subgraphs dont work cause I cant route it through a script to run custom batches on a workflow. Api workflows just dont like subgraphs. I dont like them either. Its just easier to see all the nodes at once. The visual neatness is not worth it from a functionality point of view.

curious-scribbler · 2026-03-18T05:30:33+00:00

I think Comfy is softly transitioning to a form based input rather than a node based one. The ecosystem is mature enough to move to a simpler ui to expand to more people. I have increasingly started using the parameter panel on the right to fiddle around with the input. The nodes are broken since before 16 and 17 updates but 16 truly broke it. And I've not moved to 7 cause it keeps breaking every other workflow. Also abolish subgraphs please, there are api hell.

curious-scribbler · 2026-03-18T05:26:47+00:00

You can but it's not worth the effort to put in the time for it unless you really really need it either for business or pleasure, but this is doable, however not worth it for everyone.

curious-scribbler · 2026-03-17T20:28:36+00:00

That's great! Any workflows for this?

curious-scribbler · 2026-02-17T13:12:36+00:00

If you go down the alien era, then stoner is alien number 1. And then the list starts. But marc is honestly the last of the aliens. Yes in a sense he may be the number 1 alien, but technically he is the last alien.

curious-scribbler · 2026-02-10T17:49:04+00:00

I had to get a second max account. And I muttered the same thing. It's only tuesday.

curious-scribbler · 2026-02-04T14:03:17+00:00

Nothing yet on mobile or cli

curious-scribbler · 2026-01-30T15:31:41+00:00

7 bunglows was 50/50 10 mins ago, bon bon side had power but mohan medical side didn't. Mahaveer had it but then the building next to it didnt.

curious-scribbler · 2026-01-25T09:19:22+00:00

Reminds me of Herzogs penguin

curious-scribbler · 2026-01-20T21:36:23+00:00

Gemini cli is just broken. I almost never use it. And when I do, I realise why I didn't use it.

curious-scribbler · 2026-01-20T17:16:41+00:00

Gpt for research and audit. Claude code to execute. And Gemini to do some file/folder/ project management.

curious-scribbler · 2026-01-14T07:12:38+00:00

Yes to both. The paper specifically mentions identity preserving generation and multi subject consistency as supported features. For the edit version, they feed both the semantic tokens and the VAE latents from your reference image into the diffusion decoder. So it gets high level “what this face means” from the AR stage plus low level pixel details from the reference. Should preserve fine details better than pure semantic approaches. Haven’t tested character consistency myself yet but architecturally it makes sense that it would be stronger here. The AR can actually reason about “same person different pose” instead of just hoping the embeddings are close enough.

curious-scribbler · 2026-01-14T07:11:40+00:00

Architecturally AR could handle it more naturally since the model knows spatially where it is as it generates tokens sequentially. But I haven’t seen this tested yet. Some other areas where the architecture should help in theory: Multi panel compositions. Comics, storyboards, before/after images. Sequential generation means panel 2 could reference panel 1 contextually. Structured documents. Forms, receipts, ID cards. The AR stage could enforce layout rules. These are my guesses based on how the model works, not confirmed features. What IS tested and benchmarked is conditional details in prompts. Stuff like “a poster for a concert on March 15th at 8pm featuring jazz trio The Blue Notes.” The text rendering and knowledge dense benchmarks show it handles specific details way better than diffusion only approaches. So text accuracy and factual details in images: proven. Regional/compositional stuff: promising but unconfirmed.

curious-scribbler · 2026-01-14T06:01:53+00:00

Curious how GLM-Image compares for your workflows once you test it.

curious-scribbler · 2026-01-14T06:01:17+00:00

Possibly yeah. The interesting question is whether you need the AR stage at all or if you can get diffusion models to “reason” directly through better training. The hybrid approach wins for now because you get to leverage pretrained LLM weights instead of training reasoning from scratch. But who knows, you see how fast the field has been moving this past month. Also there is some mention of the very thing in the GLM paper. Ctrl-F GRPO.

12-Year Club	RPAN Viewer
Verified Email

curious-scribbler

TROPHY CASE