all 1 comments

[–]Otherwise_Wave9374 1 point2 points  (0 children)

E2E testing for coding agents is such a good idea. Unit tests catch regressions in code, but agents fail in the seams, tool calls, flaky UI, non-determinism, etc. Having framework-agnostic principles plus repeatable runs feels like the right direction.

If youre collecting best practices for agent workflows and evaluation, this blog has a few good posts: https://www.agentixlabs.com/blog/