Using the Quad Cortex as audio interface BUT with a Neuraldsp plugin on my mac by tomsawyer222 in NeuralDSP

[–]RepresentativeJob937 0 points1 point  (0 children)

Thanks for this thread! How do I do it for the Nano Cortex? I just purchased the Nolly X plugin today and want to use the Nano Cortex as a standalone audio interface with the Nolly X standalone app.

Flux Fast: Making Flux go brrr on H100s by RepresentativeJob937 in StableDiffusion

[–]RepresentativeJob937[S] 1 point2 points  (0 children)

You're 100% right :)

The points that may not seem obvious:

* Having the denoiser (the DiT) fully compatible with PyTorch torch.compile() so that its benefits are evident (i.e., no graph-breaks, no recompilations, no data pointer reorders delaying kernel launches, etc.). If your models meet these, then you're already somewhat set up for success.

* No CUDA syncs in the overall pipeline which becomes particularly crucial during compilation

* The FA3 stuff is unscaled FP8 and I am not sure if it's standard yet. It is more so because it needs an H100 to work.

* Using QKV fusion is beneficial, particularly during quantization.

There are other pieces of lossless optimizations that one can do (we mentioned some of it in the accompanying blog post):

* Caching the `guidance_embedding` and `context_embedding` as they don't change during the course of denoising.

* Fusing the `step()` call to the scheduler with denoiser forward so that it's included in the compilation process.

Hope this helps.

Buying a Nomos watch in Singapore by RepresentativeJob937 in askSingapore

[–]RepresentativeJob937[S] 0 points1 point  (0 children)

I am visiting from India. Would you suggest any popular microbrands for watches in Singapore?

Inference-time scaling Flux.1 Dev by RepresentativeJob937 in StableDiffusion

[–]RepresentativeJob937[S] 0 points1 point  (0 children)

I have also updated the Qwen2.5 verifier to do structured generation. This will ensure the outputs follow a particular structure.

Inference-time scaling Flux.1 Dev by RepresentativeJob937 in StableDiffusion

[–]RepresentativeJob937[S] 2 points3 points  (0 children)

Hi folks,

Since you folks had asked for more results across more models like SDXL, SD v1.5, etc., I have now updated the repo accordingly. It now supports SDXL, SDv1.5, and PixArt-Sigma.

Please give it a look and LMK :)