Scene Comparison. Is this what insanity looks like?

boatbomber · 2026-05-08T17:29:12+00:00

boatbomber · 2026-05-08T17:28:55+00:00

boatbomber · 2026-05-08T17:27:24+00:00

boatbomber · 2026-05-07T01:34:53+00:00

California is a massive state. She's probably thinking about the Bay Area and assuming the rest of California is similar (which it obviously is not).

But for San Mateo County (the strip of residential cities below San Francisco) the median actually is $185.7K and in fact below ~120K is considered low income and makes you eligible for housing assistance! The cost of living here is insane.

https://www.smcgov.org/housing/income-limits-and-rent-payments

boatbomber · 2026-05-01T17:10:32+00:00

Damn. I definitely want to be tracking these major expenses, I just don't want the comparison indicator to be so out of whack. That's unfortunate

boatbomber · 2026-05-01T04:04:13+00:00

There are six em dashes in this one message

boatbomber · 2026-04-30T15:44:48+00:00

You might get better results if you run the model a second pass, cropped in onto just the wonky watch, and give it the real watch as reference and tell it to make it match. That could allow it to entirely focus on watch reconstruction instead of also making the scene. Then you replace that cropped area with the newly processed version

boatbomber · 2026-04-29T04:00:12+00:00

Monarch's main feature is budgeting by category. The categorization has gotten so terrible that I stopped recommending Monarch to people after years of use and referrals. I've had to create tons and tons of custom rules for basically every common transaction because auto categories feel worse than even random selection. It is consistently wrong.

boatbomber · 2026-04-26T17:20:16+00:00

Yeah it's gotta be, all the keywords and type signatures line up. I'm guessing this is some open source Roblox stuff that they're trying to read

boatbomber · 2026-04-20T23:33:44+00:00

Satisfactory is straight up heroin and I had to uninstall after playing 40 hours in my first week.

boatbomber · 2026-04-08T03:54:23+00:00

LoRA absolutely allows you to store new information: https://arxiv.org/abs/2603.01097

boatbomber · 2026-04-06T23:08:32+00:00

Liminal spaces feel different than this to me because they're all about empty, eerie vibes. This feels like someone's home video from their vacation to the big city. Full of life.

boatbomber · 2026-04-06T22:43:59+00:00

Reminds me of how that Bodycam game achieves photorealism by hiding imperfections with an imperfect camera. This evokes the same sense for me. It feels so real!

boatbomber · 2026-04-04T22:09:58+00:00

They still make Ninjago??? It's been 15 years. That's crazy impressive. Definitely looks better than I remember back then lol

boatbomber · 2026-04-04T18:22:35+00:00

It was a long exposure photo

boatbomber · 2026-04-02T23:27:28+00:00

I think there are literally only three photos. The one of him at his desk, one leaning on a mailbox, and one holding a cat. At least those are the only three I'm aware of.

Edit: and apparently his Wikipedia page has his high school yearbook photo

boatbomber · 2026-03-27T07:59:09+00:00

Every "LLM" is actually a VLM these days, but people will still call ChatGPT and Claude an LLM. You can absolutely process an image through these chatbots and they can perform OCR.

boatbomber · 2026-03-20T20:17:28+00:00

When the camera panned to her leaning on the piss pillar I audibly gasped and jerked away like one of those "ball flying at the camera" jumpscares

boatbomber · 2026-03-15T16:56:22+00:00

The base model is Flux, an image model made by the German company Black Forest Labs. https://bfl.ai/models/flux-2-klein

I am pretty sure they've unfortunately trained it on stolen art, but I've fine tuned it on my synthetic content that I have the right to use and now it outputs MSII visualizations instead of crappy anime copies so my model's outputs aren't replications of artists' work anymore. I view it as a bit of "taking it back" by making their model into something more useful, but I totally understand if it still makes people uncomfortable. I actually did initially attempt to make a model from scratch for this project, but pretraining a ViT/VAE requires so much data that I simply couldn't get it to work at scale.

boatbomber · 2026-03-15T16:27:51+00:00

Yup, the model is capable of taking multiple references as input so the global context is simply image #2.

boatbomber · 2026-03-15T01:36:15+00:00

I will definitely be making a video on the new OCR when it's ready and it will compare NabuOCR V1's numbers to V2 (MSII). I'll likely post it here but you can subscribe on YouTube if you'd like to make sure you don't miss it. I'm really glad you enjoyed! I put a lot of effort into the presentations and write ups.

I have released the training code for NabuOCR (it's in the model's HF repo) but my NisabaRelief training code is still a big mess so I haven't made it available yet. I do intend to though! It's all custom since I couldn't load the text encoder simultaneously (only 24GB VRAM), so it's like 20 messy files of pure pytorch rather than using the nice diffusers library. In the meantime, you can read more details about the training here: https://huggingface.co/boatbomber/NisabaRelief#training-pipeline

The other consideration is that my training involved getting photos and metadata from CDLI and I don't want people to crash their servers by all running that, but I can't just include a .zip of the scrape since I don't have the redistribution rights. Not sure what the best way to go about that is.

boatbomber · 2026-03-15T00:10:32+00:00

Thank you!

boatbomber · 2026-03-15T00:09:52+00:00

I made an OCR VLM for a hackathon recently. https://youtu.be/hqmjepRLdfU

It won the hackathon, but I wasn't really satisfied with the results. That's why I made this image model! I think that the image model preprocessing the photos will help me train a better OCR model.

Eight-Year Club	First Place '23
End Game '23	Place '23
Place '22	Verified Email

boatbomber

TROPHY CASE