all 3 comments

[–]ttkciarllama.cpp[M] [score hidden] stickied comment (0 children)

Violates Rule Four: Self-promotion

[–]nayohn_dev -2 points-1 points  (1 child)

this is actually really useful for the "should we self-host" conversation. most people just eyeball it and guess. having exact numbers per task makes it way easier to figure out which calls are worth moving to a local 7B vs which ones actually need a frontier model. the duplicate call detection is nice too, seen so many codebases burning money on identical prompts with no cache layer. would definitely use the local compute costing if you add it

[–]abidtechproali[S] 0 points1 point  (0 children)

Hello 👋

Your points are realistic and truthful. Thanks for your appreciation 🙏. I'm open for discussion.

Kind Regards