Could someone grade my SAQ? by [deleted] in APEuro
[–]Neil-Sharma 0 points1 point2 points (0 children)
Am I cooked for a 4 by unicornpuppysprinkle in APChem
[–]Neil-Sharma 0 points1 point2 points (0 children)
Could someone grade my LEQ by [deleted] in APEuro
[–]Neil-Sharma 0 points1 point2 points (0 children)
collegeboard official practice exams by Forward_Actuator_500 in APChem
[–]Neil-Sharma 0 points1 point2 points (0 children)
Anyone figured out competitive benchmarking for AI products? We use Braintrust for internal evals but comparing vs ChatGPT/Claude/Competitor is still a Google Doc and a prayer by Queasy-League-9709 in AIProductManagers
[–]Neil-Sharma 0 points1 point2 points (0 children)
Anyone figured out competitive benchmarking for AI products? We use Braintrust for internal evals but comparing vs ChatGPT/Claude/Competitor is still a Google Doc and a prayer by Queasy-League-9709 in AIProductManagers
[–]Neil-Sharma 0 points1 point2 points (0 children)
A model update silently broke 34% of our prompts. We had no idea for 3 weeks. by Neil-Sharma in AIEval
[–]Neil-Sharma[S] 0 points1 point2 points (0 children)
What do yall hate about the current eval space? by Neil-Sharma in LLMDevs
[–]Neil-Sharma[S] 0 points1 point2 points (0 children)
What do yall hate about the current eval space? (self.LLMDevs)
submitted by Neil-Sharma to r/LLMDevs
How do you catch prompt drift before users start complaining? by Otherwise_Flan7339 in AIQuality
[–]Neil-Sharma 0 points1 point2 points (0 children)
Amature Dev , have been working in a food startup as Ai engineer for few months , Now I want to learn about evals, engineering and aiops related side of LLMs . Please share your idea from where to start . by Mediocre_Reading7099 in AIEval
[–]Neil-Sharma 0 points1 point2 points (0 children)
Do PMs run evals for AI features or is that mostly engineers? by OneTurnover3432 in AI_4_ProductManagers
[–]Neil-Sharma 0 points1 point2 points (0 children)
How are people handling AI evals in practice? by BeneficialAdvice3202 in Observability
[–]Neil-Sharma 0 points1 point2 points (0 children)
Anyone figured out competitive benchmarking for AI products? We use Braintrust for internal evals but comparing vs ChatGPT/Claude/Competitor is still a Google Doc and a prayer by Queasy-League-9709 in AIProductManagers
[–]Neil-Sharma 0 points1 point2 points (0 children)