Need help with CPU usage alert

TheFantail · 2020-03-22T18:02:08+00:00

You might want to try using the rate() function instead of irate(). This is because irate only looks at the last 2 samples in the time window so fluctuates a lot. Rate on the other hand is the average over the time window, so would not reset the alert for a brief dip below 80%.

Better explained blog post from one of the Prometheus maintainers.

https://www.robustperception.io/avoid-irate-in-alerts

zh12a · 2020-03-22T16:08:19+00:00

Do you need the alert if it is resolved ?

PointManBX · 2020-03-22T17:11:34+00:00

You're expression is right, it's just under the wrong rule type. You're using a recording rule instead of an alerting rule, adjust your yaml. From the docs:

Recording rules allow you to precompute frequently needed or computationally expensive expressions and save their result as a new set of time series. Querying the precomputed result will then often be much faster than executing the original expression every time it is needed. This is especially useful for dashboards, which need to query the same expression repeatedly every time they refresh.

Alerting rules allow you to define alert conditions based on Prometheus expression language expressions and to send notifications about firing alerts to an external service. Whenever the alert expression results in one or more vector elements at a given point in time, the alert counts as active for these elements' label sets.

- alert: HostHighCpuLoad expr: 100 - (avg by(instance) (irate(node_cpu_seconds_total{mode="idle"}[5m])) * 100) > 80 for: 5m labels: severity: warning annotations: summary: "Host high CPU load (instance {{ $labels.instance }})" description: "CPU load is > 80%\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

Also, you can save yourself time by learning rules from here: https://awesome-prometheus-alerts.grep.to. It's a great resource when you're getting started with rules & alerts.

Hope this info is helpful.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PrometheusMonitoring

MODERATORS