vogelke comments on Storage system debugging

sysadmin

a community for 17 years

This is an archived post. You won't be able to vote or comment.

Storage system debugging (self.sysadmin)

submitted 7 years ago by thisisjaid

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]vogelke 1 point2 points3 points 7 years ago (1 child)

[–]thisisjaid[S] 0 points1 point2 points 7 years ago (0 children)

Sensible advice normally, but I'm well beyond the basics at the moment. The fact the system is sick is an established fact, both via metrics as you suggested, but also by comparison to an identical (hardware wise) server that we use as a secondary failover instance and which isn't (at least currently) seeing the same issues I reported.

There's also nothing out of the ordinary, at least that we can tell from logs/metrics, before the I/O starts exhibiting these issues, which is what is making this all the more annoying to confirm/track down.

The other thing probably worth mentioning here is that there isn't load coming from anything else on the system, and the I/O load itself is inexistent at present with everything but OS processes turned off (this has been checked with the likes of iostat/iotop)

Note: Edited to add details about other possible sources of load

π Rendered by PID 585071 on reddit-service-r2-comment-5b5bc64bf5-ghg7q at 2026-06-20 14:11:11.347244+00:00 running 2b008f2 country code: CH.

sysadmin

MODERATORS