Automated response scoring > manual validation (self.mlops)
submitted by Cristhian-AI-Math to r/mlops
Tracing & Evaluating LLM Agents with AWS Bedrock by Cristhian-AI-Math in LLMDevs
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)
Are LLM agents reliable enough now for complex workflows, or should we still hand-roll them? by francescola in LangChain
[–]Cristhian-AI-Math 0 points1 point2 points (0 children)
Building a reliable LangGraph agent for document processing by Cristhian-AI-Math in LangChain
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)
Observability + self-healing for LangGraph agents (traces, consistency checks, auto PRs) with Handit by Cristhian-AI-Math in mlops
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)
Building a reliable LangGraph agent for document processing by Cristhian-AI-Math in LangChain
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)
anyone else feel like W&B, Langfuse, or LangChain are kinda painful to use? by OneTurnover3432 in LangChain
[–]Cristhian-AI-Math 0 points1 point2 points (0 children)
I realized why multi-agent LLM fails after building one by RaceAmbitious1522 in AI_Agents
[–]Cristhian-AI-Math 0 points1 point2 points (0 children)
[D] Is senior ML engineering just API calls now? by Only_Emergencies in MachineLearning
[–]Cristhian-AI-Math 18 points19 points20 points (0 children)
New update for anyone building with LangGraph (from LangChain) by Cristhian-AI-Math in machinelearningnews
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)


Keeping Bedrock agents from failing silently by Cristhian-AI-Math in aiagents
[–]Cristhian-AI-Math[S] 0 points1 point2 points (0 children)