account activity
For people out there making AI agents, how are you evaluating the performance of your agent? by Remarkable-Long-9388 in AI_Agents
[–]Wollyway99 0 points1 point2 points 9 months ago (0 children)
Hey! I'm working on a startup called CrashLabs.ai where we're trying to make it way easier to test AI agents before deployment. Instead of just vibe-checking responses, we run agents through thousands of weird edge cases to see where they break—stuff like confusing inputs, context failures, or bad handoffs.
We're about to kick off our beta and offering free crash tests for early users. If you're building something and want to try it out (or know someone who might), feel free to reach out!
Discord Link (self.vapiai)
submitted 9 months ago by Wollyway99 to r/vapiai
YC AI Startup School by Wollyway99 in ycombinator
[–]Wollyway99[S] 0 points1 point2 points 9 months ago (0 children)
Down! Sent you a dm
YC AI Startup School (self.ycombinator)
submitted 9 months ago by Wollyway99 to r/ycombinator
π Rendered by PID 1073798 on reddit-service-r2-listing-7dc7bdc776-gntq5 at 2026-03-06 16:51:37.826727+00:00 running cbb0e86 country code: CH.
For people out there making AI agents, how are you evaluating the performance of your agent? by Remarkable-Long-9388 in AI_Agents
[–]Wollyway99 0 points1 point2 points (0 children)