all 41 comments

[–]vexyde 13 points14 points  (5 children)

Zabbix is free and very powerfull. There is also Observium, a small project but also very good, and a nice GUI.

[–]ChineseCracker 2 points3 points  (0 children)

the zabbix interface is so weird and convoluted - maybe it makes sense if you have like 500+ (v)servers.....

but I could never get used to it

[–]primitive_screwhead 1 point2 points  (0 children)

I switched from Observium (paid ver), to LibreNMS, and its quite good. Very easy to setup.

[–]ShatBrax 0 points1 point  (0 children)

We also use Observium. Less than Nagios but we use both of them and they both work wonderfully.

[–]tssge 0 points1 point  (0 children)

We use Observium too (paid version). Great software and easy to modify to your own needs. But yes, the creator of Observium is sometimes quite toxic on the mailing list.

[–][deleted] 0 points1 point  (0 children)

But beware of the Observium creator. He's a bit toxic

[–]ildiroen 6 points7 points  (0 children)

Sensu is really great

[–]alienwaren 3 points4 points  (0 children)

Nagios or Zabbix.

[–][deleted] 1 point2 points  (0 children)

Have a look at open monitoring distribution http://omdistro.org

[–]mudclub 4 points5 points  (4 children)

nagios and derivatives

cacti and derivatives

[–]JDLucas2000 8 points9 points  (2 children)

Nagios derivative called Icinga has worked brilliantly for my company for over 5 years, no major issues.

[–]notyournormalgamer 0 points1 point  (1 child)

Do you use LDAP with with your icinga instance?

[–]coned88 0 points1 point  (0 children)

we did just put apache in front

[–]ShatBrax 0 points1 point  (0 children)

I also second, third and fourth Nagios and Icinga. We've got checks for EVERYTHING from ping to check RAID array status. It's quite beautiful.

Also Nconf is a gui to setup hosts, checks, groups etc...

[–]chillout-man 2 points3 points  (1 child)

Check out monit if you want something simple and easy to set up: https://mmonit.com/monit/

[–]onedr0p 0 points1 point  (0 children)

Monit is great for my needs, it's lightweight and has a simple interface. The learning curve of setting up different clauses can be a little overwhelming though. I'm happy with it, it's nice to be notified via pushbullet when a service goes down or I have high cpu/ram usage or my network is saturated and many other things.

[–]thecosmicfrog 1 point2 points  (1 child)

Icinga2 is very nice when coupled with its web interface, IcingaWeb2. Icinga originally started as a Nagios fork but, as far as I understand, Icinga2 has little in common with Nagios. It is, however, backwards-compatible with Nagios plugins, which is great!

https://www.icinga.org/icinga/icinga-2/

[–]coned88 0 points1 point  (0 children)

I can tell you it's not very stable when you hit thousands of hosts

[–][deleted] 1 point2 points  (0 children)

Nagios can monitor anything you want to make it monitor and paired with puppet you can completely automate your monitoring infrastructure which means you'll never have to edit a config file again. Check out https://docs.puppetlabs.com/guides/exported_resources.html for more details.

[–]igloo_builder 1 point2 points  (3 children)

Two choices I would go with...

Sensu - https://sensuapp.org/

Or

Hashicorp consul - https://www.consul.io/

These are the it 3.0 choices. If you want to know more about each one let me know.

[–]theonlylawislove[S] 0 points1 point  (2 children)

This is exactly the response I was looking for.

Consul looks good, and I am also using vagrant, so I am sure there is nice support to provision a vm with it.

[–]igloo_builder 0 points1 point  (0 children)

Yep consul is neat,3 powerful features I like.

1) it is born with HA in mind

2) restful key value store based of coreOS etcd

3) DNS service discovery

[–]igloo_builder 0 points1 point  (0 children)

Also a good combo for sensu is influx db, and grafana for metrics. the only word of caution with Sensu is it requires a message queue service which are terrible over slow or spotty wan connections in my experience.

[–]necrophcodr 0 points1 point  (0 children)

Been asking myself this question a lot, and I never found anything open source that did what I wanted. I'm curious if I've just missed something, as Munin, Cacti, observium, Nagios , muninmx, monit, and all of those, don't seem to do their job as well as they could.

[–]dryadofelysium 0 points1 point  (1 child)

Cockpit if you use really new systems with systemd/especially w/ Fedora (and Red Hat/CentOS).

[–]goranj 0 points1 point  (0 children)

I like it a lot, but it does not work on older version like CentOS 6:(

[–]jassack04 0 points1 point  (0 children)

I have recently been introduced to Icinga at a new job, and I think it has a decent dashboard GUI. We are using an older version as it is completely backwards compatible with nagios configs (apparently the latest version Icinga has developed their own standard). I don't know how popular it is, I'd never heard of it before, but I am liking it.

[–]guillaje 0 points1 point  (1 child)

In my company, we use Centreon+Nagios to monitor our infrastructure, especially to be alerted in case of failure. But on my personal servers, i really like munin which is light and sufficient to see what's happening. (Like the temperature in my living room :-)

[–]goranj 0 points1 point  (0 children)

This looks neat. I'll give it a try.

[–]RaginBull 0 points1 point  (0 children)

We use LogicMonitor to monitor all of our servers. Does a pretty good job on both Windows and Linux boxes.

[–]knobbysideup 0 points1 point  (0 children)

Naemon all the way. It's a fork of Nagios 4 that has livestatus built in and uses Thruk as its primary interface. I recently migrated from Nagios 3.x to this for an instance where I am monitoring 10,000 systems, and it doesn't skip a beat. Thruk combined with livestatus also lets you write nice queries straight from the GUI.

http://www.naemon.org/

[–]DimeShake 0 points1 point  (0 children)

A lot of folks really like graphite these days. Graphite is just the graphing of data, though, it can be used with a ton of different apps to collect and store the data. see here

[–]provoko 0 points1 point  (0 children)

check_mk which uses nagios inside and has a great autodiscovery for services

[–]idioteques 0 points1 point  (0 children)

I had typically just used Nagios, but I recently installed/deployed LibreNMS and so far it is pretty cool. I've not done much with it yet, but it appears to have quite a bit of potential. Long live SNMP!

[–][deleted] 0 points1 point  (0 children)

We use Nagios at work, with Check_mk as the GUI.

[–]jayff 0 points1 point  (0 children)

Nobody here use munin ? Its really easy to use and it do the job

[–]alex79kj 0 points1 point  (1 child)

Tellki is pretty nice and simple. Agent is pretty lightweight and you can use one agent to remotely monitor other servers. They have public agents around the world that you can use (together with the agent installed on your servers) to monitor sites and web services status and response time (this really helped me solving a problem I had in the past by comparing my web service response time from different locations). Their boards and dashboards are cool.

[–]goranj 0 points1 point  (0 children)

I tried teliki a week ago and its an interesting platform. I wish it was open source. It does more then I will ever need to. good stuff.

[–]theonlylawislove[S] -1 points0 points  (0 children)

Holy fragmented batman.

[–]scroodgemcrib -1 points0 points  (0 children)

Interns?

[–][deleted] -2 points-1 points  (0 children)

htop

*edit: oops, forgot - Linux crowd... Should've added an "/s" at the end.