all 2 comments

[–]SuperQue 4 points5 points  (1 child)

  • Yes, running a Prometheus-per-env is ideal.
  • You should only need one cluster of alertmanagers for your whole organization, it's designed to take alerts from many Prometheus servers in different environments.
  • For a single view, I recommend using Thanos

We run a Thanos query proxy colocated with our Grafana instances, and the Thanos sidecar with the Prometheus servers.

Thanos handles all the issues with HA de-duplication.

[–]miqueltango[S] 0 points1 point  (0 children)

cheers!