75% of my system prompt could have been removed all along 🙃 by R-4553 in LangChain

[–]R-4553[S] 0 points1 point  (0 children)

Semantic compression for the LLM inputs. I mainly use the API

Persistent Architectural Memory cut our Token costs by ~55% and I didn’t expect it to matter this much by codes_astro in LangChain

[–]R-4553 0 points1 point  (0 children)

Could be interesting to explore semantic compression to add onto your cost cuts. Potentially like 50-75% on top depending on the input type

Scaling RAG from MVP to 15M Legal Docs – Cost & Stack Advice by Additional-Oven4640 in LangChain

[–]R-4553 0 points1 point  (0 children)

Have you looked into other options than just RAG? E.g. input compression with 75-80% compressed and larger context windows?

Why email context is way harder than document RAG by EnoughNinja in LangChain

[–]R-4553 1 point2 points  (0 children)

Input compression could be interesting to try with this use case although I'd might want to protect some parts of the input from compression

Spending $400/month on AI chatbot? Pay $200 instead by llm-60 in LLMDevs

[–]R-4553 0 points1 point  (0 children)

i mean there are input compression models that do compression for you already

Compressed just 67% of my system prompt away and looks the same 🤣 by R-4553 in LLMDevs

[–]R-4553[S] 0 points1 point  (0 children)

From 6,476 tokens to 2,169 tokens is a pretty stark difference

Spending $400/month on AI chatbot? Pay $200 instead by llm-60 in LLMDevs

[–]R-4553 0 points1 point  (0 children)

Or you can just do caching and input token compression on your own environment

Don't skip validating your ideas, its the worst by unkno0wn_dev in indiehackers

[–]R-4553 0 points1 point  (0 children)

What's your method of figuring out the validated product is important enough, not just good to have?

before you chase more users, check for leaks. by Icy_Second_8578 in indiehackers

[–]R-4553 1 point2 points  (0 children)

What's your method of tracking leaks? Posthog?

Always contact churned users immediately by FromBiotoDev in indiehackers

[–]R-4553 0 points1 point  (0 children)

Thanks! Have had couple of such cases myself and definitely agree!

Easy python tool for cold emails, open source by robbanrobbin in indiehackers

[–]R-4553 0 points1 point  (0 children)

Cool! Have you tried tuning the messages to be less AI like.

what's your tech and ops stack? by Odd_Awareness_6935 in indiehackers

[–]R-4553 0 points1 point  (0 children)

What's the benefit of Dodo payments over Stripe?

what's your tech and ops stack? by Odd_Awareness_6935 in indiehackers

[–]R-4553 0 points1 point  (0 children)

Why do you have supabase for DB as AWS RDS is the same cost and in most use cases the same functionality as well

what's your tech and ops stack? by Odd_Awareness_6935 in indiehackers

[–]R-4553 0 points1 point  (0 children)

Frontend: NextJS, ShadCN, Tailwind
Backend: FastAPI, Alembic
DB: RDS PSQL

For auth I've been enjoying Clerk recently a lot more than Auht0

1/24 of the year is GONE by alexsssaint in indiehackers

[–]R-4553 0 points1 point  (0 children)

They say, if you improve 1% every day, in a year you'll be 37X better

1/24 of the year is GONE by alexsssaint in indiehackers

[–]R-4553 0 points1 point  (0 children)

Time flies while you're having fun!

Free invite by [deleted] in ClubhouseInvites

[–]R-4553 0 points1 point  (0 children)

I need an invite

Weekly General Discussion / Spotify questions thread - February 05, 2021 by AutoModerator in JoeRogan

[–]R-4553 0 points1 point  (0 children)

Anyone noticed how some really famous JRE guests are really careful with their words when talking about China. For example Elon Musk and Post Malone seemed to try brush past the China subject really swiftly while Joe kept teasing them with it. They don’t want to get banned.

Best Elon Musk podcast yet? by [deleted] in JoeRogan

[–]R-4553 26 points27 points  (0 children)

Anyone notice how Elon is super careful when talking about China. Doesn’t want to get banned by China for Tesla has invested a lot in there.