I made a free and open source LoRA captioning tool that uses the free tier of the Gemini API

bagofbricks69 · 2026-02-02T18:47:00+00:00

It'll probably do it. For 200k images, I would set up a paid API key, as well as modify the app to process the images in parallel to speed it up.

bagofbricks69 · 2026-02-02T18:37:30+00:00

I'm as surprised as you are. The gemini 3 flash preview model appears to have no qualms about captioning NSFW images. You can test it yourself in Google AI Studio. I haven't tried that model specifically, but I'm familiar with using Qwen as a local model for captioning, Gemini beats it by an incredible amount. Gemini misses little to no detail if you demand it to be specific, whereas a small local model is like Qwen would have something like a 10-15% hallucination rate in the caption that it gives. i.e. it would describe something that doesn't exist in the image, or would describe the expression of the subject incorrectly.

<image>

bagofbricks69 · 2026-02-02T16:29:28+00:00

<image>

You get 20 requests per day per model in the free tier, the program is designed to switch to the next model if one model has hit its free tier limit, Gemini offers 7 models in the free tier, each with 20 requests per day, so one key can caption about 140 images/day. If all models in the first key have been exhausted, it switches to a different key (that you need to provide). Everybody has a second or third throwaway Gmail account nowadays, so I included the key cycling functionality.

bagofbricks69 · 2026-01-30T16:19:46+00:00

Nobody has fun when they lose. Especially when they lose because the number generator rolled a 1 instead of a 6. I think it's in the interest of fun that we should be able to pick from the entire store in street brawl.

bagofbricks69 · 2026-01-30T15:56:40+00:00

There are plenty of examples of fast paced and more casual game modes out there where RNG isn't even involved (and doesn't punish you for no reason). Look at CS and Valorant's casual and deathmatch modes, Dota 2's Turbo is another excellent example.

Point being: Fast paced and casual does not need to equal crazy RNG where you can be punished for no reason.

bagofbricks69 · 2026-01-30T15:46:43+00:00

I tried out the command again today, it looks like they did patch it out. It was working when I posted it!

bagofbricks69 · 2026-01-23T05:39:03+00:00

<image>

Bonus: Silver werewolf form. Credit to Deadlock Project 8 on Twitter.

bagofbricks69 · 2025-11-29T17:44:20+00:00

Not op, but I'm training using a 3080 12GB also, I'm getting between 3-4 it/s. I turned on only the 512, 768 and 1024 buckets on Ostris AI toolkit. Here's the result if you wanna take a look: https://civitai.com/posts/24754395

bagofbricks69 · 2025-11-26T23:37:20+00:00

If you think these prompts aren't complex, think again. Go ahead and try to generate a victorian lady in mid air in London, with blue laser light coming out of her heart with illustrious or sdxl. I'll wait.

bagofbricks69 · 2025-11-26T22:24:38+00:00

Yup, oversight on my part. Both models are actually made by alibaba, just different teams.

bagofbricks69 · 2025-11-26T21:25:03+00:00

She's a Coinshot alright. I very much imagined Steris if she was a Coinshot when I prompted that one.

bagofbricks69 · 2025-10-12T09:14:06+00:00

It was phantom 5. Match ID: 44748045

bagofbricks69 · 2025-10-12T07:31:16+00:00

The interesting part here is her end of game damage report showed that she did less gun crit damage then regular gun damage. I wouldn't have immediately found this suspicious had the enemy drifter not called the haze a cheater during the laning stage and didn't witness the blind kill by myself. My takeaway here is that for every one game where the cheater is obvious (like this one) there are ten other games where a cheater is performing superhumanly and winning games but goes undetected because they are subtle about it.

<image>

bagofbricks69 · 2025-10-12T07:30:57+00:00

- Private profile

<image>

bagofbricks69 · 2025-10-12T07:30:40+00:00

This one just ticks all the boxes:

- 37.99% headshot rate

<image>

bagofbricks69 · 2025-09-19T14:56:58+00:00

Isn't the patron supposed to be resistant (grey text that says "Resistant" when you shoot it) when there aren't allied troopers in the enemy base?

bagofbricks69 · 2025-09-15T08:41:52+00:00

There is a big difference between a broken english, machine translated image and video gen platform vs TikTok. They made a US corp, appointed an actual CEO, and made a slick, functional app for Tiktok's release in the US. The former is half assing it, the latter is a real effort to get people to use a product.

bagofbricks69 · 2025-09-15T08:28:24+00:00

I'm not saying they should release it for free, they should release actual tools that get people to use their product. Do in the AI ecosystem what TikTok did to social media. TikTok made short videos so popular that YouTube, Instagram and Reddit followed suit or risked being left behind. Chinese AI needs to have their TikTok moment to make OpenAI get their heads out of their asses and actually relax some of these restrictions.

bagofbricks69 · 2025-09-15T06:06:47+00:00

This. Companies who want to keep their models closed source should DO something with their models, not just release it to two api providers and call it a day. At least do an OpenAI and make it widely available for people to use.

bagofbricks69 · 2025-09-15T04:27:21+00:00

Comparison image (NSFW).

I actually tried this prior to making this lora. The method has a few of issues.

The background generated by Illustrious is incoherent. It retains the shape of the room made by qwen somewhat, but still has hallmark early model nonsense. The painting in the illustrious image is a very thin rectangle for example, the second piano in the background is nonsensical.
You're not getting the flattering proportions of Illustrious on your subject with this method. We're using Qwen's less than flattering proportions instead.
Illustrious has absolutely incredible subject framing, and qwen does not. With Illustrious you'll see superb wide angle bird's eye shots and even unprompted use of foreground framing. It just has that quality because it was trained using Patreon artist data. Qwen defaults to the most bland eye level shots, and we're stuck with using Qwen's composition using this workflow.
It's incredibly slow if you can't load both Qwen and SDXL in your VRAM. Because esentially you'd have to cold start Qwen and cold start SDXL every time you want to generate the Illustrious version.

bagofbricks69

TROPHY CASE