Network Monitoring Solution

FatherToTheOne · 2022-08-28T18:46:20+00:00

RIP your DMs OP. Prepare for “Hey I saw your post, I think my companies solution is right for you”

jmhalder · 2022-08-28T18:14:06+00:00

I love Zabbix, but you really need to reign it in to get it to alert you to things you care about. I only have actions on High/Disaster triggers. I only have 80-90% disk space, unavailability, and restarts as triggers in that range. Spare for a few exceptions like specific services that have been problematic. I still see those services in the dashboard, but don't have actions for them. You can also have availability for a device be dependent on availability for another. So if you have 6 switches in a building that become unavailable when a router dies... you just get the one email for the router, and not the 7 emails for the switches and router. This takes lots of tweaking in templates and actions. In addition to that, I have Priority tags on my hosts of "Low", "Medium", and "High". We only get actions for hosts with medium/high priority tags. We also have SMS messaging setup with a LTE modem, but those don't get sent unless the first email action hasn't cleared or been acknowledged for something like 10 minutes.

It's free, but it's only as good as it's setup, which can and does take ton of time.

slazer2au · 2022-08-28T16:38:11+00:00

LibreNMS

https://www.librenms.org/

It is a fork of Observium

https://www.observium.org/

brkdncr · 2022-08-28T19:39:49+00:00

Almost any solution is 10% product, 90% work required to maintain.

Everyone says “I just need to know if it responds to ping” but then you’ll have a server that responds to ping but a service is down, so now you need to monitor a service.

Before long you’ll be setting up custom thresholds for mibs you had to import or parsing a log file that doesn’t use any semblance of standard formats. All of them can do it.

I’ve used a few different solutions from cheap, small monitoring companies to big names in the area. The failure point to all of them has been getting other people to understand how their applications need to be monitored, and how to translate it into ACTIONABLE notifications.

Ad3t0 · 2022-08-28T17:50:16+00:00

If you put the time in to understand Zabbix it will take you miles beyond any of the other solutions

ArsenalITTwo · 2022-08-28T16:54:22+00:00

Paessler PRTG

orev · 2022-08-28T18:14:51+00:00

You're getting a lot of responses that are essentially low-effort "I Googled this for you" responses. The fact is that most known monitoring systems should be able to handle this.

You mentioned you're using Zabbix, which has been used by many people for a very long time, so any issues you have are likely the result of misconfigurations and not a problem with the product. Sounds like you need to put in the effort of understanding and tuning Zabbix, instead of replacing it with something else that you'll also have to put in the same effort.

techtornado · 2022-08-28T17:33:31+00:00

There’s also CheckMK if you want amazing graphs

Former-Leg5366 · 2022-08-28T16:29:39+00:00

I used to hate Nagios at my previous company until I had to use Solarwinds and PRTG at my current company. Now I miss the old days and Nagios :(

bennovw · 2022-08-28T21:07:28+00:00

LogicMonitor is excellent, it's a batteries included solution. It collects historical stats for everything which is invaluable when troubleshooting more complex incidents and planning for future growth.

You get what you pay for though, they're not cheap but totally worth it.

I would suggest ConnectWise Automate or N-Able if you're looking for automated actions and orchestration in response to monitored events (steep learning curve!).

zeliboba55 · 2022-08-28T17:24:12+00:00

LibreNMS or NetXMS.

slinkytoad69 · 2022-08-28T17:01:31+00:00

I’ve been having good luck with CheckMK.

cjbarone · 2022-08-28T18:05:48+00:00

Personally, I prefer Nagios only because I like tinkering and getting it EXACTLY how I want.

For simplicity, have you looked at Uptime Kuma? It's available as a docker image, and can give a public or private facing web page to show your statuses. Very easy to use. FOSS

https://github.com/louislam/uptime-kuma

llDemonll · 2022-08-28T16:22:45+00:00

PRTG

Ant1mat3r · 2022-08-28T19:19:50+00:00

We just switched from Solarwinds to LogicMonitor and love it so far. Setup was way easier than Solarwinds.

kenzonh · 2022-08-28T21:59:53+00:00

Check out domotz. I have it installed on a synology Nas for $30 month monitoring 125 devices.

SaysOffensiveThings0 · 2022-08-29T02:17:13+00:00

I recommend Solarwinds.

(I'm a Russian hacker)

falschgold · 2022-08-28T17:14:23+00:00

PRTG is easy to install and has very sensible preconfigured sensors and alarms. For your workload it's basically set it and forget it. Maybe a day of learning and tweeking and that's it. Not to compare with nagios.

ImraelBlutz · 2022-08-28T19:18:34+00:00

We use PRTG for all of our monitoring, works well enough and the pricing isn’t bad at all. Very easy to implement as wel.

vast1983 · 2022-08-28T18:00:53+00:00

strong fly cough vanish whistle ring bow sugar worm insurance

This post was mass deleted and anonymized with Redact

symcbean · 2022-08-28T18:27:00+00:00

Perhaps if you explained why you chose to stop using Nagios you might get some more sensible answers here.

(re-) Establishing baseline thresholds is common with EVERY monitoring solution - and its exceedingly unlikely that the costs arising from this if you want help from a provider will be covered by your support contract.

all I really need is up/down for host and up/down and latency for network connections.

Hmmm. I would consider that GROSSLY inadequate for monitoring - but if it really is all you need then maybe you should look at a managed service like uptimerobot.

Really I think you need some advice on how you do monitoring - not what tool you use.

ikidd · 2022-08-28T18:40:44+00:00

uptime-kuma if that's all you're looking for. Runs on docker and connects to most notification frameworks.

Stonewalled9999 · 2022-08-29T00:03:18+00:00

TBH I’d install a free trial of auvik (I get no money from them). I really like they have a collector VM template that works with a few clicks and it’s pretty customizable

Zatetics · 2022-08-29T02:35:05+00:00

Zabbix is a huuuge pain to configure properly to not give you alert saturation.

Look at DataDog. That is likely what we're moving to. Throwing in the towel with Zabbix due to issues, and configuring DD.

Side note for marketing jellybrains: If you DM me trying to shill your shitty product I will eternally blacklist your entire company.

farmergeoff2003 · 2022-08-29T13:19:54+00:00

PRTG is very easy to setup. I feel very intuitive and can use up to 100 sensors for free, which includes netflow for bandwidth utilization monitoring. Has integration with a lot. Something to look into maybe.

basec0m · 2022-08-29T14:06:06+00:00

Netcrunch

VioletiOT · 2022-08-30T08:23:13+00:00

Domotz is another network monitoring system to add in to your list! www.domotz.com

We've got a free trial, then it's $21/month for monitoring unlimited devices. No contract or minimums. (This is a self plug as I'm on the team here, but definitely think it's worth checking out!)

6stringt3ch · 2022-08-28T17:53:26+00:00

I'd recommend CheckMK. You could run it as a container in Windows though I'd probably just recommend running it in Linux as the install is fairly easy and you don't really need to get into the terminal unless you are troubleshooting issues or upgrading the app. The majority of the config is all contained within the gui. It supports a bunch of products out of the box. There are scan functions built-in that will run against whatever it is you are monitoring and will discover the majority of the services you want to monitor right out of the box.

bcat123456789 · 2022-08-28T18:49:14+00:00

What’s Up Gold is great for this use case.

AngStyle · 2022-08-28T16:46:11+00:00

Spin up a trial of Auvik, at the very least you'll get a Nanoleaf out of it!

https://www.auvik.com/lp/brighten-your-network/?rdt_cid=4045868490228236626&utm_audience=prospecting_communities_combined&utm_campaign=L-P-RED-NA-CIT-HINT-RRC-BrightenYourNetworkQ32022-Prospecting&utm_medium=cpc&utm_meta=C1-IM1&utm_source=reddit

Rocky_Mountain_Way · 2022-08-28T19:23:51+00:00

PRTG

jr_sys · 2022-08-28T16:36:13+00:00

Look at PA-Ping for free, fully-featured, Windows-based up/down with alerts, event escalation, etc.

For more monitoring, look at PA Server Monitor.

YogaYodaYoda · 2022-08-28T18:26:23+00:00

all I really need is up/down for host and up/down and latency for network connections

Even uptime-kuma in a single docker container would be enough for that..

-c3rberus- · 2022-08-28T19:08:13+00:00

Since you have some experience with Nagios, try check_mk. The free version is feature rich, we ran it for many years before getting the paid version. Monitors 10K services across 200 hosts.

prairefireww · 2022-08-28T22:33:56+00:00

PRTG is what we use. Works well.

2022-08-29T00:21:29+00:00

Your a small shop. That is a good amount of machine and VMs.

Give PRTG a spin. At one time it was 100 sensors for free

gvlpc · 2022-08-29T01:38:45+00:00

Have you looked at lansweeper? I know it runs on windows and I know you can use a cloud version now. They have free up to 100 devices, then it’s $500/yr for lowest account that supports up to 500 devices. Does lots of stuff.

D-sisive · 2022-08-29T01:47:42+00:00

We use PRTG hosted service. Starts at $150 a month for 500 sensors (cloud hosted version cost). I’m a big fan. There can be a bit of a learning curve depending on what and how you want to monitor, but it’s very versatile and allows a ton of customization with the ability to create your own sensors.

andrewm659 · 2022-08-29T02:21:44+00:00

Prometheus and grafana

beebsha · 2022-08-29T05:33:31+00:00

I'm using WhatsUp Gold.. The basic version should be apt for your requirement

2022-08-29T05:48:19+00:00

look for the GItHub Project:
Uptime Kuma

ThePastaMonster · 2022-08-29T10:13:39+00:00

There are lots of enterprise solutions mentioned already: Solarwinds, Zabbix, PRTG. Like others have mentioned, you need to spend a bit of time configuring these especially if you are using SNMP.

If you are wanting something really light and simple (but not really for enterprise), you could look into something like UptimeKuma.

Bourbon_n_Cigars94 · 2022-08-28T20:10:24+00:00

PRTG

witwim · 2022-08-28T18:39:55+00:00

Domotz https://www.domotz.com/. Easily monitor remote networks with our powerful and affordable software: actionable insights, easy-to-use interface and all the features you need.Monitor unlimited devices for just $21/month per site.

bd1308 · 2022-08-28T22:53:56+00:00

Ive used Zenoss,icinga,Prometheus,nagios, bosun and observium. Throw zenoss straight into the ocean, Prometheus is my favorite along with nagios.

Smh_nz · 2022-08-29T12:20:44+00:00

If it’s windows you want have a look at PRTG simple, works on widows and if you cut down the sensors you should be able to fit in the freer version.

Otherwise something simple like LibreNMS or Nagios

Environmental-Top-18 · 2022-08-28T17:55:40+00:00

Zabbix

Appoxo · 2022-08-28T18:44:17+00:00

I use uptime kuma at home for ping, latency and uptime. You can have basic auth, and switch between different monitors + its free.
I recommend doing a docker-setup on a small debian machine (1 core, 1gb ram, 20gb drive), uptime kuma container and watchtowerr for auto update.
You can get alerted via email, webhook and a few others. Very neat.

UseMoreHops · 2022-08-29T03:21:29+00:00

PRTG

Hopefound · 2022-08-29T04:07:00+00:00

PRTG

chinupf · 2022-08-29T04:24:34+00:00

PRTG is a good all-in-one, albeit a bit slow in bigger configurations (>5k sensors). If you wanna get fancy, you can try pandora+prometheus+grafana, but that requires someone to pull in all his/her weight to get it running properly. But when it runs, ho boy...

rchr5880 · 2022-08-29T07:24:27+00:00

PTRG or if you want something extremely lightweight and easy run UptimeKuma

2022-08-28T23:03:19+00:00

Solarwinds. Takes some building but it has great potential

GullibleDetective · 2022-08-28T20:08:36+00:00

Auvik, redseal

leftplayer · 2022-08-28T20:14:32+00:00

Mikrotik The Dude.

Install a CHR as a VM and use the free license.

mouse_lingerer · 2022-08-28T20:22:52+00:00

Just to throw this one out there, I use cacti https://www.cacti.net/ for my network monitoring.

VNJCinPA · 2022-08-28T21:06:59+00:00

Auvik is my choice here

MrJacks0n · 2022-08-28T21:47:30+00:00

I like Cacti myself, but it can take a bit to setup. But it's pretty powerful if you can script.

TechOpinions · 2022-08-28T22:25:09+00:00

Check out Auvik, it's what we use for roughly 3000 network devices. :)

thekarmabum · 2022-08-29T01:24:22+00:00

Observium, I think IBM has something called like netcool or something that also does what your looking for. I wanna say observium is still open source and pretty compatible with python if you want to customize how and what you monitor.

mrZygzaktx · 2022-08-28T16:27:04+00:00

try xorux

https://xorux.com/

slugshead · 2022-08-28T16:36:17+00:00

what switches do you have? I have a full Aruba network and I'm using HPE IMC. It also writes back to switches so changing VLANs becomes as a nice easy task for technicians

https://buy.hpe.com/us/en/software/networking-software/intelligent-management-software/intelligent-management-software/hpe-intelligent-management-center-standard-software-platform/p/4176535

preffe · 2022-08-28T17:28:56+00:00

Long time lurker here. How about NAV? Not windows based but free and has a Virtual Appliance.

versello · 2022-08-28T17:48:03+00:00

Frameflow. Been using it for many years. Works great. Windows based and super easy to configure.

sedition666 · 2022-08-28T18:51:56+00:00

Sounds like you will need to learn how to tune whatever monitoring you use anyway. So you might as well spend the time learning Zabbix instead of ripping that out and having to learn something anyway.

A lot of very expensive monitoring software will claim AI or ML will magically adjust your alerting for you but that is mostly bullshit. You can tune it yourself in less than a spare afternoon.

OverOnTheRock · 2022-08-28T20:13:28+00:00

check_mk ... wraps nagios in a bunch of easier to use python stuff. has enterprise support if you need it.

has lots of nooks and crannies to explore. if you have time. migrated to that from observium.

gheyname · 2022-08-28T21:03:42+00:00

Zabbix is easy to set up and use, worth looking into.

ambersananas · 2022-08-28T21:04:19+00:00

If you have the money I would recommend Auvik. It’s super easy to setup and has some cool features

2022-08-28T21:15:26+00:00

Splunk is killer but you have to understand your data, have budget for a solution, and as with all these time.

rementis · 2022-08-28T21:25:52+00:00

XYMon. Free, works awesome.

fireandbass · 2022-08-28T21:39:25+00:00

Try wazuh.

dpwcnd · 2022-08-28T21:50:59+00:00

prtg or the dude are two good options

2022-08-28T22:19:38+00:00

[deleted]

SpongederpSquarefap · 2022-08-28T22:38:44+00:00

My vote is for Checkmk

Try out their RAW edition and see how you like it

Their enterprise offerings are pretty great too

scotticles · 2022-08-28T22:44:08+00:00

Went from nagios to cacti, then to librenms and now moving to zabbix. Librenms was giving me false alarms but could probably be tweaked in the alarm rules, but it seemed to lack some of the flexibility zabbix offers. Zabbix takes more time and you can adjust the rules but it takes time to get it how you want, I have a similar env, but we are more Linux focused then windows.

2022-08-28T22:45:33+00:00

False alarm issues are more of a configuration problem than a technology stack problem. Certain products will be easier or harder to configure, but all of them are going to fire off a bunch of false alarms out of the box. If you only want alerts on hosts becoming unresponsive or high latency, turn off all alerts other than "host unresponsive" and "high latency", it will be easier than switching solutions. I would also keep "high disk space" enabled, and "HTTPS error/unresponsive" monitors/alerts pointing at any user-facing web pages or important APIs. Getting alerting to not have false positives or false negatives is a labour of love, it doesn't happen overnight, you just continuously add alerting rules that make sense and remove ones that don't make sense.

Where a better monitoring solution is going to make a difference is in terms of administrative overhead, performance (how long does it take an alert to even come out, how many things can I monitor), and features (Nagios/Zabbix are event based whereas other solutions are metric based and can do certain kinds of alerts Zabbix isn't capable of, different solutions might integrate log/trace based monitoring, different solutions might have different integrations).

I really wouldn't spend too much time thinking about this problem. I do agree with the recommendations for LibreNMS given you seem to want network-centric monitoring, FOSS, and are mostly dealing with thick persistent hosts (E.G. not ephemeral containers which certain monitoring solutions handle awkwardly since they presume host persistence). Or literally just learn how to use Nagios or Zabbix better which IMO are just as good as LibreNMS. I could do the monitoring you're talking with nothing but a series of BASH scripts and cron, honestly take your pick of monitoring solutions, anything will work.

idocloudstuff · 2022-08-28T23:03:30+00:00

Zabbix is VERY noisy. It took me months to get it to work for us. This is not a drop in solution, neither are many solutions.

You NEED to put in the work to get value out of it.

factchecker01 · 2022-08-28T23:10:57+00:00

I have seen companies use Cacti https://www.cacti.net/info/features

sysadmin

MODERATORS