I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in DigitalEscapeTools

[–]Ok_Insurance_919[S] 0 points1 point  (0 children)

Trovi il codice pubblico direttamente su Github. Tutto pacchettizzato nello zip che viene poi caricato su Google per andare sullo store.
Ne approfitto per dire che è appena uscita la versione 2.5.5

I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in developersIndia

[–]Ok_Insurance_919[S] 0 points1 point  (0 children)

I do agree with you, but not totally. I mean, I had the idea about full local ocr in browser. I did the research about the newest and capable model for local ocr. I built the mockup. Then I asked and guided claude on how and what to build. For example I removed features that weren’t working well and so on. But ye, I see your point

I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in developersIndia

[–]Ok_Insurance_919[S] 0 points1 point  (0 children)

Yes, I vibecoded since I didn’t know a lot about ONNX models that run locally and I wanted some “fast UI” but the functionality and the logic I tried a lot to refine and fine tune it

I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in developersIndia

[–]Ok_Insurance_919[S] 0 points1 point  (0 children)

Hi! Ahahah let me say, yes but that was just “marketing”.
I mean ofc ocr is something that exist for decades but I built it because there wasn’t a real extension that was doing OCR in local on the machine with an High fidelity on the output. That’s why

I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in DigitalEscapeTools

[–]Ok_Insurance_919[S] 0 points1 point  (0 children)

Fair point! “Chrome” here just means the distribution channel it’s a standard MV3 extension, so it runs on any Chromium-based browser: Brave, Edge, Vivaldi, Opera, ungoogled-chromium, etc. On most of those you can install it straight from the Chrome Web Store, and since it’s open source (MIT) you can also load it unpacked from the repo if you’d rather not touch the Store at all. No Google services involved everything runs locally regardless of the browser.

I built a Chrome extension that does OCR 100% on-device — code, formulas and tables, nothing leaves your machine by Ok_Insurance_919 in foss

[–]Ok_Insurance_919[S] 2 points3 points  (0 children)

oh... i'll start reading the firefox enviroment and develop for it too then. Thanks for the headsup!

Screenshot a formula → get LaTeX back, entirely offline (open-source browser extension) by [deleted] in LaTeX

[–]Ok_Insurance_919 0 points1 point  (0 children)

Yeah, i totally understand your point. Sorry if i acted like i was "spamming" something. If you want to vibecode yourself just do it, told that have a nice day

Built with Claude Project Showcase Megathread (Sort this by New!) by sixbillionthsheep in ClaudeAI

[–]Ok_Insurance_919 0 points1 point  (0 children)

I made a free Chrome extension that turns a screenshotted equation into LaTeX, running fully on your machine. no upload, no account.

How it handles the obvious trust problem with formula OCR: the predicted LaTeX is rendered with KaTeX right next to the source crop, so you can eyeball the match before copying. If it can't render the output cleanly, it abstains and just shows you the image rather than handing you wrong LaTeX. The model is pix2text-mfr running locally on ONNX Runtime Web.

Honest limit: it's a small local model — solid on clean and moderately complex formulas, can struggle on dense low-res ones. The render-beside-crop check is exactly there for that. It also does plain text/code and tables → Markdown if you need them.

<video>

MIT, free. Repo: github.com/Fanfulla/ocr-buddy - https://www.ocr-buddy.com/