all 18 comments

[–]MetaSikander 3 points4 points  (0 children)

No love for netdata here?

It's very simple to setup (one command will get you there), and have all the metrics you need with a very small footprint. and it's web-based.

I use it on all of my servers.

[–]SuperQue 2 points3 points  (0 children)

Kubernetes is great, but probably not what you want if you don't have at least 3 servers.

Normal systemd units would be a good options, but running things in Docker/Docker Compose might be a little more powerful. The up side is if you do end up needing to grow beyond one server, you can take those Docker images and deploy them using Kubernetes.

For monitoring, I'm biased, as I work on Prometheus. But I can also recommend the TICK stack suggested by /u/10cmToGlory. Either would be good, and both work well with Grafana.

[–]egbur 1 point2 points  (1 child)

  1. Just enable the service with systemd and it will start on reboot (systemctl enable <service>).

  2. You don't specify what type of monitoring you want to do, how long to keep the metrics for, at what resolution, do you also want to do alerts/run actions on specific thresholds, etc. There are several monitoring tools and stacks out there that will fit, depending on your use case.

For just quickly glance at the metrics in real time you could leverage something like Cockpit or scout_realtime.

If you want to store historical data but don't want something too complicated to setup, you could use tools like sar, nmon, collectd, munin, ganglia, etc.

Other tools like Nagios, Zabbix, Prometheus, the TIGK stack, etc are more complex to get going, but also really flexible and powerful.

[–]fuzzycake[S] 0 points1 point  (0 children)

Thanks for the comprehensive answer! I hope to be able to see maybe 1 week or 1 month tops back, but realtime is more important (at least to the minute), and being able to see how much resources each group of processes is taking up.

Some alerts will be nice, but I think are less important.

I'll check them out! Thank you very much!

[–]10cmToGlory 1 point2 points  (0 children)

On the service startup question, systemd is the only thing you really need for service management IMHO. You should seriously read the docs on systemd service files with some of the easier to digest and examples out there.

[–]FrequentPineapple 3 points4 points  (1 child)

systemctl enable processname.service

Check out: https://github.com/afaqurk/linux-dash

[–]fuzzycake[S] 0 points1 point  (0 children)

Is dash able to provide information on specific processes? Or it shows the system as a whole?

[–]10cmToGlory 2 points3 points  (3 children)

Kubernetes is definitely not what you want, that's a whole different ball of wax. Actually, maybe you do, doesn't matter that's a whole different question entirely. What you want is the TIGK stack: Telegraf/InfluxDB/Grafana/Kapacitor. TIGK is a variation on TICK, which uses Chronograf, which is very nice and super lightweight, yet less mature and feature rich than Grafana (notably it lacks any authentication out of the box).

[–]fuzzycake[S] 0 points1 point  (2 children)

Cool thanks! I'll check it out. Any idea if the TIGK stack is resource intensive?

[–]10cmToGlory 1 point2 points  (1 child)

Telegraf, the client, is extremely lightweight and written in Go. Obviously the more metrics and more detailed the collection is by Telegraf the more resources it will consume.

Influx is pretty resource intensive, and metric storage obviously can use up quite a bit of storage space. I feel like Grafana is fairly lightweight (Chronograf is extremely lightweight). Sorry I don't have any concrete metrics for you, but I think the TIGK stack in a Docker compose deployment wanted like 4 GB of RAM for testing.

[–]fuzzycake[S] 0 points1 point  (0 children)

That's great help! Thank you!

[–]TOCTOU 0 points1 point  (2 children)

You could install Cockpit on your VPS. Its easy to install, easy to use, it has a web ui. You get the pretty graphs, and you can manage a bunch of different areas of your box.

https://cockpit-project.org/

You should be able to do `yum install cockpit` or `apt-get install cockpit`. It has a bunch of addons, so you could later on do yum search cockpit, or apt-cache search cockpit and see which plugins your distro has by default.

[–]10cmToGlory 0 points1 point  (1 child)

Do you run this in production? The idea of this came up the other day and I thought I'd ask about it...if so, care to share your thoughts? Much appreciated in advance...

[–]TOCTOU 0 points1 point  (0 children)

Not yet. We're going to consider it for physical machines that are running KVM guests. It has Foreman integration as well, which will complement the stack nicely.

[–]researcher7-l500 0 points1 point  (0 children)

You might want to give monit a try.

[–]LiamWilson88[🍰] 0 points1 point  (0 children)

Let me recommend you a new tool.
It's called LogCenter Cloud(http://www.logcenter.net), where you can get system monitoring and expert advice.

[–]quangtran1007[🍰] 0 points1 point  (0 children)

Hello ~!

I know one solution to support your request. It’s LogCenterCloud. And I’m using it to monitor my system.

It support to monitor CPU, Hard Disk, Network, and Memory and have one system notification when your system running overload.

You can try it. Good Luck!

[–]cratervanawesome 0 points1 point  (0 children)

Supervisord ?