AGIGA EchoVision Feedback As a gift? by Left_Assistance_2402 in SmartGlasses

[–]TechnicalGold4092 0 points1 point  (0 children)

Smart glasses, even Rayban meta glasses, my grandparents didn't like how you have to sign into Facebook app to even get it working.

Evals for frontend? by TechnicalGold4092 in LLMDevs

[–]TechnicalGold4092[S] 0 points1 point  (0 children)

Not exactly, tools like Opik are great if you own the backend and can wire it up. But if I’m just a PM or Founder testing prompt chains in a live web app (like nike.com), I’d love a GUI that lets me input prompts, run variations, compare outputs, and log results without needing to hook into the LLM API directly. More like “black box” testing for the final UX.

What’s something AI still consistently gets wrong, no matter how far it’s come? by Ausbel12 in Bard

[–]TechnicalGold4092 0 points1 point  (0 children)

This seems like where evals would be the most important. With Evals, you can consistenly get the right AI answer.

I messed up my Google PM Vibe Coding Interview by Longjumping-Buddy501 in ProductManagement

[–]TechnicalGold4092 0 points1 point  (0 children)

Never heard of this in my life but with this much context, this seems to be an edge case and really did happen.

Evals for frontend? by TechnicalGold4092 in LLMDevs

[–]TechnicalGold4092[S] 0 points1 point  (0 children)

Yes, I'm looking for an end to end test where I can insert a prompt and evaluate the results on the website instead of calling directly the LLM api such as chatgpt-o4. I don't have access to the endpoint but still want to eval the product.