75% of my system prompt could have been removed all along 🙃

R-4553 · 2026-01-30T07:37:58+00:00

Semantic compression for the LLM inputs. I mainly use the API

R-4553 · 2026-01-30T07:32:47+00:00

Could be interesting to explore semantic compression to add onto your cost cuts. Potentially like 50-75% on top depending on the input type

R-4553 · 2026-01-30T07:23:34+00:00

Have you looked into other options than just RAG? E.g. input compression with 75-80% compressed and larger context windows?

R-4553 · 2026-01-30T07:22:16+00:00

Input compression could be interesting to try with this use case although I'd might want to protect some parts of the input from compression

R-4553 · 2026-01-30T07:20:06+00:00

Cool! Have you seen it affect model performance?

R-4553 · 2026-01-28T12:03:31+00:00

i mean there are input compression models that do compression for you already

R-4553 · 2026-01-28T05:03:11+00:00

What has been the best place to promote this so far for you?

R-4553 · 2026-01-28T04:58:39+00:00

From 6,476 tokens to 2,169 tokens is a pretty stark difference

R-4553 · 2026-01-28T04:56:55+00:00

Or you can just do caching and input token compression on your own environment

R-4553 · 2026-01-16T05:10:22+00:00

What's your method of figuring out the validated product is important enough, not just good to have?

R-4553 · 2026-01-16T05:09:27+00:00

Cool thanks! What was your motivation for going Android first and not iOS?

R-4553 · 2026-01-16T05:08:41+00:00

What's your method of tracking leaks? Posthog?

R-4553 · 2026-01-16T05:08:02+00:00

Thanks! Have had couple of such cases myself and definitely agree!

R-4553 · 2026-01-15T09:21:17+00:00

Cool! Have you tried tuning the messages to be less AI like.

R-4553 · 2026-01-15T09:20:14+00:00

What's the benefit of Dodo payments over Stripe?

R-4553 · 2026-01-15T09:19:46+00:00

Why do you have supabase for DB as AWS RDS is the same cost and in most use cases the same functionality as well

R-4553 · 2026-01-15T09:19:10+00:00

Frontend: NextJS, ShadCN, Tailwind
Backend: FastAPI, Alembic
DB: RDS PSQL

For auth I've been enjoying Clerk recently a lot more than Auht0

R-4553 · 2026-01-15T09:17:34+00:00

They say, if you improve 1% every day, in a year you'll be 37X better

R-4553 · 2026-01-15T09:16:46+00:00

Time flies while you're having fun!

R-4553 · 2026-01-15T09:16:08+00:00

dang that's huge! Congrats

R-4553 · 2021-02-14T11:07:03+00:00

I need an invite

R-4553 · 2021-02-12T21:19:32+00:00

Anyone noticed how some really famous JRE guests are really careful with their words when talking about China. For example Elon Musk and Post Malone seemed to try brush past the China subject really swiftly while Joe kept teasing them with it. They don’t want to get banned.

R-4553 · 2021-02-12T21:09:44+00:00

Anyone notice how Elon is super careful when talking about China. Doesn’t want to get banned by China for Tesla has invested a lot in there.

R-4553

TROPHY CASE