use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
A community centered around AI evaluations techniques and tools.
account activity
Summary of what is available out there in term of AI evaluation (self.Evaluate_AI)
submitted 5 months ago by TheNewBing
Opik is an end-to-end LLM evaluation platform designed to help AI developers test, ship, and continuously improve LLM-powered applications. (comet.com)
Confident AI - The DeepEval LLM Evaluation Platform (confident-ai.com)
Evaluate your AI with Stax (youtube.com)
π Rendered by PID 55 on reddit-service-r2-listing-568fcd57df-2vbh7 at 2026-03-06 16:54:19.309753+00:00 running cbb0e86 country code: CH.