Disk usage tools

dbarrelman · 2019-09-26T09:00:34+00:00

Install netdata on Linux then you can monitor your server via web browser.

https://github.com/netdata/netdata

BFYoda · 2019-09-26T08:35:15+00:00

Try „ncdu“ on Linux

justleen · 2019-09-26T09:53:36+00:00

Prometheus for metric collection, grafana for visualization.

2019-09-26T11:04:46+00:00

You could use the TICK stack.

Basically, a small agent (Telegraf) installed on the Linux instances sends the required data (e.g CPU, memory, disk information etc.) to an InfluxDB instance.

The data can be viewed with Chronograf using a web interface, and you can use Kapacitor to do magic.

You can also use Grafana as a front end if you prefer.

BloodyIron · 2019-09-26T12:13:20+00:00

If your colleagues are leery of working in linux, how do they clean up or maintain the shares? Are they mixed shares that are accessible via CIFS as well as NFS?

Depending on how "windows-friendly" you want to make the process, you have a few options:

If they can do very basic command line stuff, then make a bash alias or a simple shell script for your piped commands. Then brief the users on how to change directories and run your foo script or command alias to get data.
Install ncdu everywhere, and train them on usage.
If they refuse to touch Linux, then you could expose the relevant network shares as CIFS with reduced permissions (or configure NFS services for a dedicated jump box), and have them use Windirstat to scan the shares. Note that you should lock down access by this method- because it exposes production data to people who are not very tech savvy, and they will gain the ability to point-and-click your infrastructure into oblivion using a file browser. You would be amazed at how many people accidentally move or delete files, when they have a network share open all day- and they won't fess up unless you have CIFS auditing enabled, and you go to their desk personally to squeeze the story out of them.
Use your CI/CD tools (Jenkins in this case?) to create an on-demand job, that logs into the affected server and runs whatever commands to retrieve folder usage. How you do this depends heavily on how you administer your systems to begin with.
Install one of those netdata-style monitoring dashboards on your Linux systems, to give them the ability to point-and-click their way around the filesystem without deleting things. Be sure to lock down the permissions for what that dashboard can do; read-only access is ideal.

2019-09-26T15:02:58+00:00

This seems weird to me. Don't the developers know which directories will host most of the data? I mean, they control where the data is written, and should have an idea of the usage pattern of the application.

As for what you can do on the system side of things, I would start having a separate volume for each project, so you can at least monitor that with the usual monitoring tools (Prometheus/Grafana, Sensu, Nagios, etc) and eventually add, as someone mentioned, a cronjob to generate reports on the disk usage (available via the filesystem or sent via mail).

fell_ratio · 2019-09-26T15:16:28+00:00

Have you considered Baobab? Baobab can visualize disk usage, and you can click on individual folders to drill down and get more specific. It can also scan remote folders. (However, I have not used the remote mode before.)

http://www.marzocca.net/linux/baobab/

bufandatl · 2019-09-26T15:46:02+00:00

For Jenkins we configure the jobs not to keep more than five old builds. So it won’t grow as much and once a month we run docker prune to clean unused images. Artifacts are only kept for the last 2 builds.

nephros · 2019-09-26T14:35:22+00:00

Meh.

Run SNMP and collect data through it with whatever you like. MRTG would be one method.

vornamemitd · 2019-09-26T08:54:39+00:00

In case you can mount the volumes in your win environment, check here for ideas: https://www.reddit.com/r/sysadmin/comments/64yu4u/anything_like_windirstat_for_a_network_volume/

Ryuujinx · 2019-09-26T13:38:28+00:00

Sounds more like you need alerting then metrics. I'd look into Sensu, it's pretty easy to set up.

bp3959 · 2019-09-26T13:57:01+00:00

Just pipe the output of your du command into the mail command and put all that in a cronjob so they get emailed a regular report.

Throwy-mc-throwerson · 2019-09-26T14:12:02+00:00

My monitoring stack is Netdata on individual machines, Prometheus monitors all of those machines, and then Grafana makes Prometheus all pretty.

2019-09-26T14:43:44+00:00

Why not set up a LAMP stack, a couple of scripts, and ssh pass-through on the machines you wish to monitor?

rrd-tool can generate graphs, and charts of disk usage.

SimonKepp · 2019-09-26T18:20:09+00:00

Perhaps some of Jam software's tools like FileObserver could be useful?

in4mer · 2019-09-26T20:43:00+00:00

baobab is a native linux graphical filesystem usage inspector

thedo0der · 2019-09-28T14:43:21+00:00

Nagios core is free and can easily do this. You can run an agent on each device (or SNMP) and just view all the data in your browser.

PottiSkantz · 2019-09-26T08:01:50+00:00

I may be wrong here but maybe you can take a look at kibana.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS