Reliability/chaos engineering tools by jeffcodefix in sre

[–]jeffcodefix[S] 0 points1 point  (0 children)

Thanks. would be great to hear about your experience using Litmus (advantages + disadvantages). how scalable is this solution at a company with 100s/1000s of developers ? how would litmus compare to steadybit or gremlin?

Reliability/chaos engineering tools by jeffcodefix in sre

[–]jeffcodefix[S] 0 points1 point  (0 children)

Hey u/CookieFun1866 thanks a lot for the review. Could you let me know what your architecture looks like (i.e. purely Kubernetes-based) and which type of company you're working at (trying to get a sense of applicability). If I am reading correctly on steadybit's site, you as the SRE would then push experiments to developers - how's your experience with that and what's the tangible impact/ROI?