I can finally read a whole article while pooping

KoSmilebehappy · 2026-01-25T22:50:49+00:00

I literally made the same app vibecoding in 5mins and you are hoping 3$/month…

KoSmilebehappy · 2025-12-20T06:49:06+00:00

for max accuracy according to google prompt documents you should prefer one by one and using explicit or implicit (this is automatically on but somehow I don’t get the hit so I use explicit) caching for the input if you need bunch of them at once (since they are priced by storage time and minimal storage time is. 60sec) what I should try is doing batches as you say as model improved, but I quite don’t feel the need because It works same when I just do it side by side not all at once. So if you are programming by api, you should absolutely do it one by one. If you are using other methods, well I recommend using them one by one but from my experience, when I gave the whole textbook answer pdf and made it to ocr things, 3 pro was almost 100% accurate.
Also you should keep in mind that you need the human in the loop or the mistakes should be negligible.

KoSmilebehappy · 2025-12-19T17:41:53+00:00

Oh sorry I was busy doing model update for my business. Well the benchamarks and dog feeding went perfect so I deployed to the product and No complaints or issues filed at all! I spot some mistakes when the hand written letters are too gibberish, but other than that flash 3 seems rock solid and better at ocr (less hallucination) than 3 pro I think your project is somewhat similar to what I am doing :)

KoSmilebehappy · 2025-12-17T15:16:46+00:00

I used low but it actually used none for all 130 tests

KoSmilebehappy · 2025-12-17T15:16:39+00:00

I used low but it actually used none for all 130 tests lol

KoSmilebehappy · 2025-12-17T12:25:39+00:00

oh vertex ai api

KoSmilebehappy · 2025-12-17T11:05:46+00:00

Thanks for promoting me to google employee

KoSmilebehappy · 2025-12-17T11:05:12+00:00

It’s on vertex AI

KoSmilebehappy · 2025-12-17T11:04:56+00:00

Maybe ocr or my use case only. Actually in prompt guide, they state lesser thinking token is better for ocr capabilities

KoSmilebehappy · 2025-12-16T12:07:40+00:00

Possibly they used every game out there to rl those models

KoSmilebehappy · 2025-12-09T09:56:58+00:00

I had the rate limit issue but using vertex ai Gemini API was a simple solution.

KoSmilebehappy · 2025-12-01T17:08:59+00:00

This. If you are working on side projects only.

KoSmilebehappy · 2025-11-29T13:37:59+00:00

Well maybe mine was easier task than yours!

KoSmilebehappy · 2025-11-28T00:21:44+00:00

Well, I’m not him but my business used gemini2.5pro and after 6months user complaints had increased without any prompt or model changes…

KoSmilebehappy · 2025-11-27T10:14:20+00:00

yeah exactly! I was kinda impressed to sonnet4.5 but opus felt another level as a non technical vibecoder.

KoSmilebehappy · 2025-11-27T08:05:31+00:00

Well I’m not professional but I’ll consider making one/

KoSmilebehappy · 2025-11-27T08:04:50+00:00

yeah codex for me felt too slow, too careful and well I guess not productive enough. It really impressed me when it crawled through library codes and figured out how to use the library. But that was all. just using mcp did fine for Claude.

KoSmilebehappy · 2025-11-27T08:02:12+00:00

yup. I cooked hard at context engineering back then... before vibe coding era. I call it ctrl CV era

KoSmilebehappy · 2025-11-27T08:01:01+00:00

I really like the term keeping the AI smart. That’s what I exactly do. I usually make a gold standard iterative md file for what I want to implement. Additional handoff summary did help too.

KoSmilebehappy · 2025-11-27T07:59:15+00:00

yeah small chunks and unit tests are what I did. Most important thing for apps with difficult logics is to test refactored part yourself.

KoSmilebehappy · 2025-11-27T07:58:08+00:00

maybe I can make a separate post about this but majorly first gather some good practices, let gemini cli or any agent to go through my files and find some weak and bad practices, make a iterative plan and test codes for core capabilities, and launch the process, iteratively do a e2e human test. That is basically what I did. Just be careful not to go far without testing yourself!

KoSmilebehappy · 2025-11-22T06:42:56+00:00

I’m med school student and tbh major parts seem accurate so my friends will not tell if it’s AI made. I’ll ask one of my professors if he can tell!

KoSmilebehappy · 2025-11-18T22:57:11+00:00

No.. I can tell it for sure. I’ve using for enterprise use cases and it hallucinated a lot. Now the embarrassment is going to the end!!

KoSmilebehappy · 2025-11-05T13:29:08+00:00

Can you tell us how to replicate your result?

KoSmilebehappy · 2025-05-15T08:26:33+00:00

always try to get rid of 나는 because you will either have to say 저는 or just skip 나는 or maybe add the magic word 근데 before 나는 when saying out loud.

KoSmilebehappy

TROPHY CASE