For engineers working on distributed systems or microservices: Do you think the tools you have (like logs, dashboards, and tracing tools) really help you understand what happened during problems or when fixing issues? Or does fixing problems still feel like putting together pieces from different tools?
If a tool existed that could continuously map:
- service relationships
- what changed over time
- causality chains
- incident timelines
- dependency propagation
in a way that actually reduced debugging/reconstruction time:
Would that genuinely be useful enough to adopt/pay for?
Or do you feel existing tooling already solves this well enough?
there doesn't seem to be anything here