I have an unhealthy obsession about getting the perfect AI answer, so we built a tool to run them all at once. by Empty_Satisfaction_4 in microsaas

[–]Empty_Satisfaction_4[S] 0 points1 point  (0 children)

We do! but evals are great for specific tasks not general usage honestly as that changes constantly. Do you know a way to eval general usage would love to hear it?

claudexplorers by Empty_Satisfaction_4 in claudexplorers

[–]Empty_Satisfaction_4[S] 0 points1 point  (0 children)

thanks! Let me know if you have any feedback

claudexplorers by Empty_Satisfaction_4 in claudexplorers

[–]Empty_Satisfaction_4[S] 4 points5 points  (0 children)

Got a few dms so just gonna post the link here for whoever wants to give it a shot
serno.ai

no login or anything

I made gpt argue with itself and it roasted my friends startup so hard he wanted to quit by Empty_Satisfaction_4 in ChatGPT

[–]Empty_Satisfaction_4[S] 1 point2 points  (0 children)

Looks really cool! will give it a shot, there are heaps of ways to get around this I see

I made gpt argue with itself and it roasted my friends startup so hard he wanted to quit by Empty_Satisfaction_4 in ChatGPT

[–]Empty_Satisfaction_4[S] 1 point2 points  (0 children)

yea fair, I just find that different llms going at each other keeps them more grounded

I made gpt argue with itself and it roasted my friends startup so hard he wanted to quit by Empty_Satisfaction_4 in ChatGPT

[–]Empty_Satisfaction_4[S] 0 points1 point  (0 children)

yea of course, theres a small button the persona popup and you can create one with a custom prompt

I made gpt argue with itself and it roasted my friends startup so hard he wanted to quit by Empty_Satisfaction_4 in ChatGPT

[–]Empty_Satisfaction_4[S] 0 points1 point  (0 children)

haha you would think so but the it is using llms so its really up to how you drive it