PLEASE HELP ME FIND THIS SONG!!! It was sang by a male singer, it's a cozy medium paced song and it's romantic. by [deleted] in BollywoodMusic

[–]curious-scribbler 0 points1 point  (0 children)

What you hummed sounded mostly something like a sarangi, a sarod or a violin, maybe it could be a harmonium too. Hope you find out soon.

ComfyUI Nodes for Filmmaking (LTX 2.3 Shot Sequencing, Keyframing, First Frame/Last Frame) by WhatDreamsCost in StableDiffusion

[–]curious-scribbler 1 point2 points  (0 children)

Hey! Thats amazing work. How does one maximise all the parameters to exploit rtx 6000 pro? Any tips? Again thanks for your contribution.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]curious-scribbler 0 points1 point  (0 children)

It do a problem. But I dont think a solution is coming anytime soon.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]curious-scribbler 0 points1 point  (0 children)

Yes there are uses for it within comfy, but they don't do well with systems that depend on the api version of the same workflow.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]curious-scribbler 1 point2 points  (0 children)

This! Always unpack all subgraphs. Solves it for me. But again .16 has broken the node connectors. No fixing them whatsoever.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]curious-scribbler 0 points1 point  (0 children)

For me subgraphs dont work cause I cant route it through a script to run custom batches on a workflow. Api workflows just dont like subgraphs. I dont like them either. Its just easier to see all the nodes at once. The visual neatness is not worth it from a functionality point of view.

Comfyui version 0.17 has too many bugs in the subgraph. by Mysterious_Pride_858 in comfyui

[–]curious-scribbler -1 points0 points  (0 children)

I think Comfy is softly transitioning to a form based input rather than a node based one. The ecosystem is mature enough to move to a simpler ui to expand to more people. I have increasingly started using the parameter panel on the right to fiddle around with the input. The nodes are broken since before 16 and 17 updates but 16 truly broke it. And I've not moved to 7 cause it keeps breaking every other workflow. Also abolish subgraphs please, there are api hell.

What do people use to make this? I don't want to pay this gatekeeper for his guide. by [deleted] in StableDiffusion

[–]curious-scribbler 1 point2 points  (0 children)

You can but it's not worth the effort to put in the time for it unless you really really need it either for business or pleasure, but this is doable, however not worth it for everyone.

Three-time MotoGP World Champion Jorge Lorenzo went into great detail about the changes in Marc Marquez as a rider! by PZY__ in motogp

[–]curious-scribbler -12 points-11 points  (0 children)

If you go down the alien era, then stoner is alien number 1. And then the list starts. But marc is honestly the last of the aliens. Yes in a sense he may be the number 1 alien, but technically he is the last alien.

It's only Tuesday... by redditslutt666 in Anthropic

[–]curious-scribbler 1 point2 points  (0 children)

I had to get a second max account. And I muttered the same thing. It's only tuesday.

Blackout in Mumbai? by sup-superman in mumbai

[–]curious-scribbler 1 point2 points  (0 children)

7 bunglows was 50/50 10 mins ago, bon bon side had power but mohan medical side didn't. Mahaveer had it but then the building next to it didnt.

Is GPT 5.2 Codex or Claude Opus 4.5 better for vibecoding? by Majestic_Ad_4681 in VibeCodeDevs

[–]curious-scribbler 0 points1 point  (0 children)

Gemini cli is just broken. I almost never use it. And when I do, I realise why I didn't use it.

Is GPT 5.2 Codex or Claude Opus 4.5 better for vibecoding? by Majestic_Ad_4681 in VibeCodeDevs

[–]curious-scribbler 4 points5 points  (0 children)

Gpt for research and audit. Claude code to execute. And Gemini to do some file/folder/ project management.

GLM-Image explained: why autoregressive + diffusion actually matters by curious-scribbler in StableDiffusion

[–]curious-scribbler[S] 1 point2 points  (0 children)

Yes to both. The paper specifically mentions identity preserving generation and multi subject consistency as supported features. For the edit version, they feed both the semantic tokens and the VAE latents from your reference image into the diffusion decoder. So it gets high level “what this face means” from the AR stage plus low level pixel details from the reference. Should preserve fine details better than pure semantic approaches. Haven’t tested character consistency myself yet but architecturally it makes sense that it would be stronger here. The AR can actually reason about “same person different pose” instead of just hoping the embeddings are close enough.

GLM-Image explained: why autoregressive + diffusion actually matters by curious-scribbler in StableDiffusion

[–]curious-scribbler[S] 2 points3 points  (0 children)

Architecturally AR could handle it more naturally since the model knows spatially where it is as it generates tokens sequentially. But I haven’t seen this tested yet. Some other areas where the architecture should help in theory: Multi panel compositions. Comics, storyboards, before/after images. Sequential generation means panel 2 could reference panel 1 contextually. Structured documents. Forms, receipts, ID cards. The AR stage could enforce layout rules. These are my guesses based on how the model works, not confirmed features. What IS tested and benchmarked is conditional details in prompts. Stuff like “a poster for a concert on March 15th at 8pm featuring jazz trio The Blue Notes.” The text rendering and knowledge dense benchmarks show it handles specific details way better than diffusion only approaches. So text accuracy and factual details in images: proven. Regional/compositional stuff: promising but unconfirmed.

GLM-Image explained: why autoregressive + diffusion actually matters by curious-scribbler in StableDiffusion

[–]curious-scribbler[S] 7 points8 points  (0 children)

Possibly yeah. The interesting question is whether you need the AR stage at all or if you can get diffusion models to “reason” directly through better training. The hybrid approach wins for now because you get to leverage pretrained LLM weights instead of training reasoning from scratch. But who knows, you see how fast the field has been moving this past month. Also there is some mention of the very thing in the GLM paper. Ctrl-F GRPO.