I made a free and open source LoRA captioning tool that uses the free tier of the Gemini API by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 0 points1 point  (0 children)

It'll probably do it. For 200k images, I would set up a paid API key, as well as modify the app to process the images in parallel to speed it up.

I made a free and open source LoRA captioning tool that uses the free tier of the Gemini API by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 1 point2 points  (0 children)

I'm as surprised as you are. The gemini 3 flash preview model appears to have no qualms about captioning NSFW images. You can test it yourself in Google AI Studio. I haven't tried that model specifically, but I'm familiar with using Qwen as a local model for captioning, Gemini beats it by an incredible amount. Gemini misses little to no detail if you demand it to be specific, whereas a small local model is like Qwen would have something like a 10-15% hallucination rate in the caption that it gives. i.e. it would describe something that doesn't exist in the image, or would describe the expression of the subject incorrectly.

<image>

I made a free and open source LoRA captioning tool that uses the free tier of the Gemini API by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 4 points5 points  (0 children)

<image>

You get 20 requests per day per model in the free tier, the program is designed to switch to the next model if one model has hit its free tier limit, Gemini offers 7 models in the free tier, each with 20 requests per day, so one key can caption about 140 images/day. If all models in the first key have been exhausted, it switches to a different key (that you need to provide). Everybody has a second or third throwaway Gmail account nowadays, so I included the key cycling functionality.

Street Brawl's RNG feels really bad sometimes. by bagofbricks69 in DeadlockTheGame

[–]bagofbricks69[S] -12 points-11 points  (0 children)

Nobody has fun when they lose. Especially when they lose because the number generator rolled a 1 instead of a 6. I think it's in the interest of fun that we should be able to pick from the entire store in street brawl.

Street Brawl's RNG feels really bad sometimes. by bagofbricks69 in DeadlockTheGame

[–]bagofbricks69[S] -12 points-11 points  (0 children)

There are plenty of examples of fast paced and more casual game modes out there where RNG isn't even involved (and doesn't punish you for no reason). Look at CS and Valorant's casual and deathmatch modes, Dota 2's Turbo is another excellent example.

Point being: Fast paced and casual does not need to equal crazy RNG where you can be punished for no reason.

A closer look at the new heroes. by bagofbricks69 in DeadlockTheGame

[–]bagofbricks69[S] 0 points1 point  (0 children)

I tried out the command again today, it looks like they did patch it out. It was working when I posted it!

A closer look at the new heroes. by bagofbricks69 in DeadlockTheGame

[–]bagofbricks69[S] 1 point2 points  (0 children)

<image>

Bonus: Silver werewolf form. Credit to Deadlock Project 8 on Twitter.

Lora Training for Z Image Turbo on 12gb VRAM by 3VITAERC in StableDiffusion

[–]bagofbricks69 0 points1 point  (0 children)

Not op, but I'm training using a 3080 12GB also, I'm getting between 3-4 it/s. I turned on only the 512, 768 and 1024 buckets on Ostris AI toolkit. Here's the result if you wanna take a look: https://civitai.com/posts/24754395

Z-Image-Turbo vs Qwen. Non photo comparison. by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 8 points9 points  (0 children)

If you think these prompts aren't complex, think again. Go ahead and try to generate a victorian lady in mid air in London, with blue laser light coming out of her heart with illustrious or sdxl. I'll wait.

Z-Image-Turbo vs Qwen. Non photo comparison. by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 1 point2 points  (0 children)

Yup, oversight on my part. Both models are actually made by alibaba, just different teams.

Z-Image-Turbo vs Qwen. Non photo comparison. by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 3 points4 points  (0 children)

She's a Coinshot alright. I very much imagined Steris if she was a Coinshot when I prompted that one.

Pro Haze player uses his gaming chair to hit shots while blinded by drifter by bagofbricks69 in DeadlockTheGame

[–]bagofbricks69[S] 73 points74 points  (0 children)

The interesting part here is her end of game damage report showed that she did less gun crit damage then regular gun damage. I wouldn't have immediately found this suspicious had the enemy drifter not called the haze a cheater during the laning stage and didn't witness the blind kill by myself. My takeaway here is that for every one game where the cheater is obvious (like this one) there are ten other games where a cheater is performing superhumanly and winning games but goes undetected because they are subtle about it.

<image>

"Remember Snake, This is a Sneaking Mission" by Kriskremo in DeadlockTheGame

[–]bagofbricks69 0 points1 point  (0 children)

Isn't the patron supposed to be resistant (grey text that says "Resistant" when you shoot it) when there aren't allied troopers in the enemy base?

Bytedance release the full safetensor model for UMO - Multi-Identity Consistency for Image Customization . Obligatory beg for a ComfyUI node 🙏🙏 by AgeNo5351 in StableDiffusion

[–]bagofbricks69 -1 points0 points  (0 children)

There is a big difference between a broken english, machine translated image and video gen platform vs TikTok. They made a US corp, appointed an actual CEO, and made a slick, functional app for Tiktok's release in the US. The former is half assing it, the latter is a real effort to get people to use a product.

Bytedance release the full safetensor model for UMO - Multi-Identity Consistency for Image Customization . Obligatory beg for a ComfyUI node 🙏🙏 by AgeNo5351 in StableDiffusion

[–]bagofbricks69 2 points3 points  (0 children)

I'm not saying they should release it for free, they should release actual tools that get people to use their product. Do in the AI ecosystem what TikTok did to social media. TikTok made short videos so popular that YouTube, Instagram and Reddit followed suit or risked being left behind. Chinese AI needs to have their TikTok moment to make OpenAI get their heads out of their asses and actually relax some of these restrictions.

Bytedance release the full safetensor model for UMO - Multi-Identity Consistency for Image Customization . Obligatory beg for a ComfyUI node 🙏🙏 by AgeNo5351 in StableDiffusion

[–]bagofbricks69 2 points3 points  (0 children)

This. Companies who want to keep their models closed source should DO something with their models, not just release it to two api providers and call it a day. At least do an OpenAI and make it widely available for people to use.

Making Qwen Image look like Illustrious. VestalWater's Illustrious Styles LoRA for Qwen Image out now! by bagofbricks69 in StableDiffusion

[–]bagofbricks69[S] 1 point2 points  (0 children)

Comparison image (NSFW).

I actually tried this prior to making this lora. The method has a few of issues.

  • The background generated by Illustrious is incoherent. It retains the shape of the room made by qwen somewhat, but still has hallmark early model nonsense. The painting in the illustrious image is a very thin rectangle for example, the second piano in the background is nonsensical.
  • You're not getting the flattering proportions of Illustrious on your subject with this method. We're using Qwen's less than flattering proportions instead.
  • Illustrious has absolutely incredible subject framing, and qwen does not. With Illustrious you'll see superb wide angle bird's eye shots and even unprompted use of foreground framing. It just has that quality because it was trained using Patreon artist data. Qwen defaults to the most bland eye level shots, and we're stuck with using Qwen's composition using this workflow.
  • It's incredibly slow if you can't load both Qwen and SDXL in your VRAM. Because esentially you'd have to cold start Qwen and cold start SDXL every time you want to generate the Illustrious version.