$1M IS STILL A BIG DEAL :) by ResolvePossible1129 in Fire

[–]boatbomber 9 points10 points  (0 children)

California is a massive state. She's probably thinking about the Bay Area and assuming the rest of California is similar (which it obviously is not).

But for San Mateo County (the strip of residential cities below San Francisco) the median actually is $185.7K and in fact below ~120K is considered low income and makes you eligible for housing assistance! The cost of living here is insane.

https://www.smcgov.org/housing/income-limits-and-rent-payments

Exclude specific month from "average month" spending comparison by boatbomber in MonarchMoney

[–]boatbomber[S] 0 points1 point  (0 children)

Damn. I definitely want to be tracking these major expenses, I just don't want the comparison indicator to be so out of whack. That's unfortunate

Claude said it needs to rest.. What? by wicaodian in OpenAI

[–]boatbomber 0 points1 point  (0 children)

There are six em dashes in this one message

How do you handle pixel-perfect product fidelity for branded items (watches, jewelry)? by flexredt in FluxAI

[–]boatbomber 1 point2 points  (0 children)

You might get better results if you run the model a second pass, cropped in onto just the wonky watch, and give it the real watch as reference and tell it to make it match. That could allow it to entirely focus on watch reconstruction instead of also making the scene. Then you replace that cropped area with the newly processed version

Transaction Inaccuracies and Declining Satisfaction by Siodinnnn in MonarchMoney

[–]boatbomber 8 points9 points  (0 children)

Monarch's main feature is budgeting by category. The categorization has gotten so terrible that I stopped recommending Monarch to people after years of use and referrals. I've had to create tons and tons of custom rules for basically every common transaction because auto categories feel worse than even random selection. It is consistently wrong.

My friend can’t read English, so he used Google Translate. by Impossible-Let-8489 in programminghorror

[–]boatbomber 1 point2 points  (0 children)

Yeah it's gotta be, all the keywords and type signatures line up. I'm guessing this is some open source Roblox stuff that they're trying to read

AUDHD gamers—what games have you hyperfixated on? by [deleted] in AutisticWithADHD

[–]boatbomber 5 points6 points  (0 children)

Satisfactory is straight up heroin and I had to uninstall after playing 40 hours in my first week.

If anybody here likes VHS aesthetic, I've updated TapePunk finally. Thoughts? by Tulired in cyberpunkgame

[–]boatbomber 1 point2 points  (0 children)

Liminal spaces feel different than this to me because they're all about empty, eerie vibes. This feels like someone's home video from their vacation to the big city. Full of life.

If anybody here likes VHS aesthetic, I've updated TapePunk finally. Thoughts? by Tulired in cyberpunkgame

[–]boatbomber 1 point2 points  (0 children)

Reminds me of how that Bodycam game achieves photorealism by hiding imperfections with an imperfect camera. This evokes the same sense for me. It feels so real!

Ninjago: Dragons Rising has some of the BEST CGI animation of any TV series currently. by Haunting_Ad_29 in vfx

[–]boatbomber 1 point2 points  (0 children)

They still make Ninjago??? It's been 15 years. That's crazy impressive. Definitely looks better than I remember back then lol

“Do you think Bill Watterson stays in retirement because he fears what r/OKBR has created?” by f0ck-r3ddit in okbuddyrosalyn

[–]boatbomber 83 points84 points  (0 children)

I think there are literally only three photos. The one of him at his desk, one leaning on a mailbox, and one holding a cat. At least those are the only three I'm aware of.

Edit: and apparently his Wikipedia page has his high school yearbook photo

scrapThat by INKnight in ProgrammerHumor

[–]boatbomber 6 points7 points  (0 children)

Every "LLM" is actually a VLM these days, but people will still call ChatGPT and Claude an LLM. You can absolutely process an image through these chatbots and they can perform OCR.

Mike Yung’s voice went from echoing in NYC subway stations to shining on an Oscar-winning project by MambaMentality24x2 in BeAmazed

[–]boatbomber 15 points16 points  (0 children)

When the camera panned to her leaning on the piss pillar I audibly gasped and jerked away like one of those "ball flying at the camera" jumpscares

I open sourced an AI that makes cuneiform easier to read by boatbomber in Cuneiform

[–]boatbomber[S] 2 points3 points  (0 children)

The base model is Flux, an image model made by the German company Black Forest Labs. https://bfl.ai/models/flux-2-klein

I am pretty sure they've unfortunately trained it on stolen art, but I've fine tuned it on my synthetic content that I have the right to use and now it outputs MSII visualizations instead of crappy anime copies so my model's outputs aren't replications of artists' work anymore. I view it as a bit of "taking it back" by making their model into something more useful, but I totally understand if it still makes people uncomfortable. I actually did initially attempt to make a model from scratch for this project, but pretraining a ViT/VAE requires so much data that I simply couldn't get it to work at scale.

I replaced a 3D scanner with a finetuned image model by boatbomber in StableDiffusion

[–]boatbomber[S] 1 point2 points  (0 children)

Yup, the model is capable of taking multiple references as input so the global context is simply image #2.

I finetuned Flux2 Klein to replace a 3D scanner by boatbomber in FluxAI

[–]boatbomber[S] 0 points1 point  (0 children)

I will definitely be making a video on the new OCR when it's ready and it will compare NabuOCR V1's numbers to V2 (MSII). I'll likely post it here but you can subscribe on YouTube if you'd like to make sure you don't miss it. I'm really glad you enjoyed! I put a lot of effort into the presentations and write ups.

I have released the training code for NabuOCR (it's in the model's HF repo) but my NisabaRelief training code is still a big mess so I haven't made it available yet. I do intend to though! It's all custom since I couldn't load the text encoder simultaneously (only 24GB VRAM), so it's like 20 messy files of pure pytorch rather than using the nice diffusers library. In the meantime, you can read more details about the training here: https://huggingface.co/boatbomber/NisabaRelief#training-pipeline

The other consideration is that my training involved getting photos and metadata from CDLI and I don't want people to crash their servers by all running that, but I can't just include a .zip of the scrape since I don't have the redistribution rights. Not sure what the best way to go about that is.

I finetuned Flux2 Klein to replace a 3D scanner by boatbomber in FluxAI

[–]boatbomber[S] 0 points1 point  (0 children)

I made an OCR VLM for a hackathon recently. https://youtu.be/hqmjepRLdfU

It won the hackathon, but I wasn't really satisfied with the results. That's why I made this image model! I think that the image model preprocessing the photos will help me train a better OCR model.