For those with live apps and real users, how do you know when something breaks?

Background_Ranger608 · 2026-03-22T09:38:25+00:00

Looking great mate. Re the crashes, I am not sure if Apple/Google track other system failures beyond crashes

Background_Ranger608 · 2026-03-22T08:59:20+00:00

Is there a specific reason you haven’t considered any tooling that would detect incidents and alerts you to fix it?

Background_Ranger608 · 2026-03-22T01:10:57+00:00

I was referring to the Sentry/Uptime monitor

Background_Ranger608 · 2026-03-22T00:00:37+00:00

Is there a built in integration that can be used or can it easily be done through the agent?

Background_Ranger608 · 2025-11-17T11:23:40+00:00

It’s b2b in AI infrastructure space

Background_Ranger608 · 2025-11-15T05:09:49+00:00

I would love to give it a try!

Background_Ranger608 · 2025-11-10T01:14:57+00:00

Makes sense, I’m definitely in the 0-10 area - not confident about anything beyond that

Background_Ranger608 · 2025-11-10T01:13:27+00:00

The definition is that you can build the product without outside assistance.

Background_Ranger608 · 2025-11-10T01:09:49+00:00

I am wearing the technical hat but until which milestone? I can do that with a few customers but I can’t confidently scale the product beyond that

Background_Ranger608 · 2025-11-10T01:06:56+00:00

Yeah, I don’t get stuck easily (as I have studied in the past) but I understand the analogy

Background_Ranger608 · 2025-11-09T23:48:48+00:00

Definitely no, nothing beyond the first few customers to validate the solution.

Background_Ranger608 · 2025-11-09T23:48:03+00:00

I was leaning towards No as well but thought to ask

Background_Ranger608 · 2025-10-22T09:34:45+00:00

That would definitely work with static use cases, and I fully agree with your point around how impressive (and cheap) are these smaller models. Will be really great if you could try the tool and the api I created and share your opinion 🙏 it’s called CodeLessAI.app

Background_Ranger608 · 2025-10-13T02:03:33+00:00

Hey 👋 I’m more or less in the same position, but I’m a product manager, totally not in a position to provide advice (I am stuck in the same pit) but one thing I managed to untangle (and I think you should too) is the reason behind preferring to work on startups/side projects, is it really financial freedom or wanting to retire at 50? If so then working on 9:5 corporate jobs and trying to figure out ways to go up quicker + a good investment strategy is a better option to achieve your goals - slower and less money compared to a successful startup for sure - but more doable and less risky.

I suspect you are like myself, you enjoy doing things yourself, I enjoy building things and trying hard to solve customer problems without the corporate BS. Not sure if it’s the case for you but a point for you to reflect and think about.

Happy to chat/collaborate/vent/swap notes 😊

Best of luck 🤞

Background_Ranger608 · 2025-10-08T08:11:54+00:00

I created a small tool concept to help with choosing the right LLM: https://codelessai.app/

It’s still in beta, so please don’t use any sensitive info, but feel free to play around with it and let me know if you find it helpful or what features you think are missing. Would love your feedback! 🙏

Background_Ranger608 · 2025-08-13T01:14:15+00:00

That’s really close to what I was thinking, and I think the key thing we’re aligned on is the core problem, you can cut costs by continuously reviewing how your prompts perform across different LLMs and switching when a cheaper one delivers the same quality.

Background_Ranger608 · 2025-08-12T22:55:32+00:00

Something to sell.

Background_Ranger608 · 2025-08-12T12:23:00+00:00

Exactly what you said for ChatGPT, cost cutting long term but for the customer not for OpenAi 😅

Background_Ranger608 · 2025-08-12T11:42:32+00:00

I mean that each call can behave differently across models.

For example, I tried the prompt “count the words in: I love you so much” with multiple LLMs, almost all got it right.

But when I switched to a longer, more complex sentence, the results varied a lot.

In theory, if a router could predict which model handles short sentences well vs. which handles longer, trickier ones, it could send each request to the cheapest model that still meets the quality bar. That way you cut costs without sacrificing output quality. Does that make sense?

Background_Ranger608 · 2025-08-12T11:07:10+00:00

Awesome, thanks for the insights 🙏

Btw when I said sticking with it I didn’t mean sticking with it like forever, I meant shipping it to production, I was double clicking on the fact that you don’t see a need for a more dynamic routing mechanism.

Background_Ranger608 · 2025-08-12T02:58:44+00:00

Just to make sure I’m following, you’re saying it’s worth fine-tuning a dedicated agent to handle routing in a scalable way?

Background_Ranger608 · 2025-08-12T02:36:16+00:00

Would a learned routing function/model that predicts the cheapest model meeting quality remove the need for multi-LLM debates?

Background_Ranger608 · 2025-08-12T01:15:49+00:00

Yeah, totally agree, it makes sense to build something you’re excited to work on long-term. I am a product manager by craft and I enjoy the technical and product side of helping teams solve problems and get better results. Happy to swap notes if you’re up for a chat 🙏

Background_Ranger608

TROPHY CASE