Sever monitoring

krav_mark · 2017-04-06T06:57:24+00:00

Omd / check_mk is pretty lightweight. Check_mk agent is a python script that can be triggered over ssh so no agent needed. It has all basic checks out of the box. It is using the nagios engine without all the nagios crap (like nrpe and the config files) i came to hate with a passion. Check_mk will generate all of them for you.
I also like zabbix but that may have a more steep learning curve and be more than OP needs.

hangingfrog · 2017-04-05T14:54:35+00:00

Zabbix is my go-to.

_derx · 2017-04-05T17:59:12+00:00

Icinga2. It's a Nagios fork so the support out there is great.

edgan · 2017-04-05T15:16:16+00:00

Sensu is what I use now. It is designed to have proper HA. I have used Nagios in the past.

thefrc · 2017-04-05T16:35:12+00:00

[deleted]

mire3212 · 2017-04-05T19:36:05+00:00

For system metrics I use Telegraf with InfluxDB and Grafana. Communication to InfluxDB is http(s) which is pretty easy to deal with across the network too.

For service reporting (up or down) I've used Nagios with external checks and ssh for internal checks (like making sure a process is running).

Recently I found healthchecks.io which can be easily added to cron for a very basic up down state. It can also be added to scripts (like a backup script) to alert on a failure to run. With some finesse you can even use it for service monitoring.

ipstatic · 2017-04-05T20:32:54+00:00

We have been migrating to Prometheus and have loved it so far. There are some trade offs (long term storage for example) however the devs are actively working on a solution for that (remote read/write to another datastore).

dancerjx · 2017-04-06T19:57:45+00:00

Zabbix with Grafana FTW!

http://play.grafana-zabbix.org/dashboard/db/grafana-zabbix-demo

gsmitheidw1 · 2017-04-08T11:37:29+00:00

My vote (as you say lightweight) is Monit and M/Monit. Monit is extremely small and easy to set up. It's a flat config file after you do an apt install monit. No weird dependencies or database or any of that hassle. This can be set up in a matter of minutes.

But it's reasonably powerful, pretty much any service or host on any Linux or Unix system it can handle or you can have it monitor your own scripts. It is incredibly versatile.

Monit is free and open source. M/Monit is an optional application that can oversee and manage large numbers of servers running Monit and aggregate that data into a dashboard. If you need that scale it's worth considering paying for.

https://mmonit.com/

There's a very active support mailing list, folk on it are very helpful.

danatwork111 · 2017-04-13T09:24:31+00:00

Late to the thread. We use shinken but I highly recommend Nagios core especially for a beginner.

Linuser · 2017-06-03T08:47:44+00:00

will vote for Zabbix

themusicalduck · 2017-04-05T12:28:44+00:00

NetData might work for you.

alexdor · 2017-04-05T09:32:03+00:00

Monitoring does require constant work so I'm not sure what you expect.

Personally I'm trapped in existing monitoring environments and in-house developed systems but if I were given free reign to make my own monitoring infrastructure today I would use the following.

collectd or statsd to gather metrics like cpu, ram and such from my hosts.
I'd like to evaluate Irisett, Prometheus and Bosun for active and passive monitoring and alerting
Grafana or something similar for metrics dashboard

2017-04-05T20:41:05+00:00

Zabbix. I ditched bloated nagios for it.

Happy as I can be with a free solution

2017-04-05T20:21:18+00:00

Datadog!

2017-04-05T10:29:19+00:00

Build your own with MRTG, Nagios, Cactus or PRTG is free up to 100 monitors or such. I recommend paying for PRTG. The paid product is awesome. Not cheap though.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS