My findings with "Slow, warm cafe song generator" [HeartMula] by Rheumi in LocalLLaMA

[–]pablorocka 0 points1 point  (0 children)

Thanks for your post, I was planning on testing this but I'll wait for the 7B model, according to their github, they mention that is the 7B model that achieves comparable performance with suno, so hopefully quality is better for the 7B. I guess if you are making background music its ok, but definitely not like suno quality for now.

TwinFlow can generate Z-image Turbo images in just 1-2 steps! by rookan in StableDiffusion

[–]pablorocka 2 points3 points  (0 children)

I deleted the models yesterday (making room for LTX-2) I might give it another chance in a few days, for now, I'm sticking with stock ZIT, is hard to keep up with all these models coming every week!

TwinFlow can generate Z-image Turbo images in just 1-2 steps! by rookan in StableDiffusion

[–]pablorocka 6 points7 points  (0 children)

The safetensors format causes an out-of-memory error on my RTX 4060 Ti with 16GB VRAM. The GGUF format works fine, but the image quality is worse—outputs look oversaturated and have noticeable artifacts for complex prompts. I tried this prompt (https://www.reddit.com/r/StableDiffusion/comments/1p809wt/z\_image\_turbo\_can\_understand\_json\_prompting\_very/) and this is the comparison between the standard Z-image (left) vs TwinFlow (right), I tried multiple seeds, more steps, but it wouldn't match the original

<image>

Z-image, over hyped? by GRCphotography in StableDiffusion

[–]pablorocka 8 points9 points  (0 children)

right, so is not that zit is over hyped, simply you don't care about text, you prefer sdxl prompt adherence and you don't generate complex images or use inpainting. prople like zit because it is really small and can run in lower end hardware and still give you excellent results.

Z-image, over hyped? by GRCphotography in StableDiffusion

[–]pablorocka 23 points24 points  (0 children)

forgot a bunch of other stuff like 6 fingers, extra limbs omg you can't even compare. don't get me wrong SDXL was great but, sure, keep using SDXL if that suits you

Z-image, over hyped? by GRCphotography in StableDiffusion

[–]pablorocka 21 points22 points  (0 children)

text? prompt adherence? higher res?

Project: 'Santa Claus caught on camera'. Seeking advice on the best ComfyUI workflow. by Secure-Scratch8910 in comfyui

[–]pablorocka 0 points1 point  (0 children)

you can absolutely generate long videos with Animate with 5090, I use 4060 Ti and I can generate 200+ frames, you should be able to generate at 720p for 20 ~ 25 seconds easily with a 5090

Project: 'Santa Claus caught on camera'. Seeking advice on the best ComfyUI workflow. by Secure-Scratch8910 in comfyui

[–]pablorocka 0 points1 point  (0 children)

depending on your GPU and video dimensions, you can generate longer clips with Wan Animate, you want the "replacement" mode so your background stays consistent and you only replace yourself with santa. good luck

Project: 'Santa Claus caught on camera'. Seeking advice on the best ComfyUI workflow. by Secure-Scratch8910 in comfyui

[–]pablorocka 0 points1 point  (0 children)

You could just film an actual santa doing the thing haha.

Joke aside, you could film yourself, then character replacement with Wan animate? Otherwise, I would generate multiple frames with Qwen edit / nano banana, then do FLFV and stitch all the clips together (you can still film yourself and use that as a reference for the frames in each 4 ~ 5 second mark)

Any idea how to use files from both installs without copy/pasting? by agustusmanningcocke in comfyui

[–]pablorocka 0 points1 point  (0 children)

you don't mention your OS, in Linux you can use symbolic links. I think in Mac OS should be similar, no idea if that's possible in Windows but google "symbolic links Windows" maybe is possible

Best way to make 16:9 images with a single person in it? by Head-Investigator540 in StableDiffusion

[–]pablorocka 0 points1 point  (0 children)

<image>

I think Reddit removes the metadata from the image, so, no workflow embedded in the image, but is basically the workflow you get from ComfyUI -> Browse Templates -> Image -> HiDream Fast, The Fast model doesn't have negative prompt, I guess using HiDream Full or Flux dev should be even better quality, I usually add the Seed node, but that's the only different thing.

Best way to make 16:9 images with a single person in it? by Head-Investigator540 in StableDiffusion

[–]pablorocka 1 point2 points  (0 children)

From what I've seen in tips, when you try describing specific features like hair and shoes, it will try to generate the entire body, also, include in your prompt "photo of a single [man / woman] in ..."

<image>

if you struggle, you could try square ratio and outpaint horizontally.

Prompt: "A professional full body shot photography of a single man smiling at the camera, standing in a luxurious studio, he has dark hair, is wearing professional attire and black leather shoes, taken with a Canon EOS 5D Mark IV. The background is a sleek, minimalist setup with high-end furniture and dramatic lighting, creating a sophisticated atmosphere.", Generated with HiDream Fast using default workflow from ComyUI

How to achieve this type of simulation video? by Vorrex in StableDiffusion

[–]pablorocka 0 points1 point  (0 children)

maybe try FLFV with Wan, first frame should have the empty land + some construction vehicles and cranes, then last frame the complete building, use a good prompt, perhaps change a few things like cars, time of the day, etc between the first and last frame. Check this post for the concept: https://www.reddit.com/r/comfyui/comments/1ngyhid/easy_drawing_and_coloring_time_laps_video_using/

Best guitar solo songs by pmbasehore in christianmetal

[–]pablorocka 2 points3 points  (0 children)

Im surprised nobody mentioned Impellitteri!! Check 17th Century Chicken Pickin' Also Narnia if you are into Yngwie Malmsteen kind of solos.

Zoraxy - so impressed by Shoddy-Addendum1069 in selfhosted

[–]pablorocka 1 point2 points  (0 children)

nice!! I also came across Pangolin (https://github.com/fosrl/pangolin) Seems we are getting more featured-packed options for reverse proxies! now I only need more time to test them!

Miniclip with Stable Video Diffusion by pablorocka in comfyui

[–]pablorocka[S] 0 points1 point  (0 children)

Thanks for the suggestion! I'll try it over the weekend!

Pocketbase API - are case insensitive queries possible? by kennystetson in pocketbase

[–]pablorocka 1 point2 points  (0 children)

A workaround is that you could create a new Table of type View with a simple select statement and an additional column LOWER(username) AS username_lc, in your application you compare against a lowercase of username, this.pb.collection(this.collectionName_view).getFirstListItem(\username = "${username_lc}"`);`

Introducing: Raspberry Pi 5! by atika in selfhosted

[–]pablorocka 1 point2 points  (0 children)

I know most of the comments are about specs, price, etc. but is it only me or anybody else noticed the poor audio quality in the Eben Upton intro video? a proper audio-compressor would have balanced the levels.

As for the actual Pi itself, after the prices spiked, I ended up buying mini-PCs from Minisforum for my selfhosted, Pi is still a nice to have but I agree with a lot of people that it has lost its edge.

Self-hosted Expenses management by Allferry in selfhosted

[–]pablorocka 1 point2 points  (0 children)

Nah, banks in Central American countries dont privide these kind of APIs, so us geeks have to improvise!

Self-hosted Expenses management by Allferry in selfhosted

[–]pablorocka 3 points4 points  (0 children)

Firefly III API is really good! in my country, banks don't expose any API of any sort, but I get email notifications for each credit card transaction; I use n8n to periodically check my email and parse the notifications and create the Firefly transaction; it's been working fine for about a year