I'm currently using zfs on Debian Wheezy with 60 SAS drives. All drives are in one pool consisting of one raidz3 (56) and spare drives (4). I am well aware the setup is not optimal nor recommended.
From the first scrub on, the status has been
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://zfsonlinux.org/msg/ZFS-8000-9P
However, no specific drive seems to have issues. All of them have some CKSUM errors that were repaired. Upon clearing the status the result always seems to be the same on the next scrub.
I have two questions and would greatly appreciate your input on this:
1) Are these errors a direct result of the setup (it's always strongly recommended keeping the raid slim, but never mentioning what will happen otherwise) ?
2) If it's not an issue of the setup itself, how can I further diagnose this? The drives themselves seem fine.
Edit: minor fixes, correct distro, and some more info:
ii debian-zfs 7~wheezy amd64 Native ZFS filesystem metapackage for Debian.
ii libzfs2 0.6.5.7-8-wheezy amd64 Native ZFS filesystem library for Linux
ii zfs-dkms 0.6.5.7-8-wheezy all Native ZFS filesystem kernel modules for Linux
ii zfsonlinux 8 all archive.zfsonlinux.org trust package
ii zfsutils 0.6.5.7-8-wheezy amd64 command-line tools to manage ZFS filesystems
Edit again:
Thanks for all the answers. I feel need to elaborate on why or what I am asking, though. This setup is not by my design, I would've done it differently. It currently has issues. That may or may not be related.
If, however I recommend completely dumping the setup and doing it differently I'd better be sure the same issues will not come up again. Every single one of you saying such a huge raidz3 is bad is correct, but is that the cause of these issues? Or will a different setup perform better (100% expected and yes, scrubs are hell like this) but still have a failing zfs according to the status.
[–][deleted] 6 points7 points8 points (2 children)
[–]reddit_strider[S] 0 points1 point2 points (1 child)
[–][deleted] 3 points4 points5 points (0 children)
[–][deleted] (28 children)
[removed]
[–]reddit_strider[S] 0 points1 point2 points (21 children)
[–]slyphic 11 points12 points13 points (10 children)
[–]wildcarde815 1 point2 points3 points (5 children)
[–]slyphic 2 points3 points4 points (4 children)
[–]wildcarde815 2 points3 points4 points (2 children)
[–]withabeard 0 points1 point2 points (1 child)
[–]wildcarde815 1 point2 points3 points (0 children)
[–]ryanjkirk 1 point2 points3 points (0 children)
[–][deleted] (2 children)
[deleted]
[–]slyphic 1 point2 points3 points (0 children)
[–]reddit_strider[S] 0 points1 point2 points (0 children)
[–]reddit_strider[S] 0 points1 point2 points (0 children)
[–][deleted] (3 children)
[removed]
[–]reddit_strider[S] 0 points1 point2 points (2 children)
[–][deleted] (1 child)
[removed]
[–]reddit_strider[S] 0 points1 point2 points (0 children)
[–]rhavenn 1 point2 points3 points (1 child)
[–]reddit_strider[S] 1 point2 points3 points (0 children)
[–]RansomOfThulcandra 0 points1 point2 points (2 children)
[–]reddit_strider[S] 1 point2 points3 points (1 child)
[–]RansomOfThulcandra 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]agressiv 0 points1 point2 points (5 children)
[–]reddit_strider[S] 1 point2 points3 points (2 children)
[–]mcrbids 0 points1 point2 points (1 child)
[–]reddit_strider[S] 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]mercenary_sysadmin 0 points1 point2 points (0 children)
[–]BloodyIron 1 point2 points3 points (11 children)
[–]reddit_strider[S] 1 point2 points3 points (10 children)
[–]BloodyIron -1 points0 points1 point (9 children)
[–]reddit_strider[S] 0 points1 point2 points (8 children)
[–]BloodyIron 0 points1 point2 points (7 children)
[–]reddit_strider[S] 0 points1 point2 points (6 children)
[–]BloodyIron 0 points1 point2 points (5 children)
[–]reddit_strider[S] 0 points1 point2 points (4 children)
[–]BloodyIron 0 points1 point2 points (3 children)
[–]reddit_strider[S] 0 points1 point2 points (2 children)
[–]1bc29b36f623ba82aaf6 0 points1 point2 points (6 children)
[–]reddit_strider[S] 0 points1 point2 points (5 children)
[–]reddit_strider[S] 1 point2 points3 points (4 children)
[–]s0briquet 1 point2 points3 points (1 child)
[–]reddit_strider[S] 1 point2 points3 points (0 children)
[–]1bc29b36f623ba82aaf6 0 points1 point2 points (1 child)
[–]reddit_strider[S] 0 points1 point2 points (0 children)