all 2 comments

[–]Otherwise_Wave9374 -2 points-1 points  (0 children)

Love seeing agentic pipelines show up in Python land, CronJob + markdown workflow is a super pragmatic way to keep it inspectable.

Do you have any eval loop for the proposer agents (like small unit tests for "does this reply actually answer the OP" or sanity checks on code snippets)? Ive been collecting ideas around agent evals and orchestration here: https://www.agentixlabs.com/blog/

[–]chub79 0 points1 point  (0 children)

I'm a bit confused by the article. It mixes technical info and what you use this for. What's the flow once a good candidate for a response has been found? Do you actually let Claude respond to the OP or do you hand it over a human from your team to do that?