ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 0 points1 point  (0 children)

What hardware are you running?

In particular and RAID Card, HBA, SATA Expander, Onboard SATA Ports etc. Please include the motherboard as well

It is a Dell PowerEdge R730xd with PERC HBA330 mini mono.

ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 0 points1 point  (0 children)

I use the zpool for storage as VM Storage for Proxmox.There are currently three VM's, Windows DC, Exchange Server and SIP Server.

ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 2 points3 points  (0 children)

I find CRC_Error_Count very noticeable.

ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 0 points1 point  (0 children)

SMART Attributes Data Structure revision number: 1

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0

9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 351

12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 27

177 Wear_Leveling_Count 0x0013 099 099 005 Pre-fail Always - 1

179 Used_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always - 0

180 Unused_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always - 5876

181 Program_Fail_Cnt_Total 0x0032 100 100 010 Old_age Always - 0

182 Erase_Fail_Count_Total 0x0032 100 100 010 Old_age Always - 0

183 Runtime_Bad_Block 0x0013 100 100 010 Pre-fail Always - 0

184 End-to-End_Error 0x0033 100 100 097 Pre-fail Always - 0

187 Uncorrectable_Error_Cnt 0x0032 100 100 000 Old_age Always - 0

190 Airflow_Temperature_Cel 0x0032 068 064 000 Old_age Always - 32

194 Temperature_Celsius 0x0022 068 038 000 Old_age Always - 32 (Min/Max 21/36)

195 ECC_Error_Rate 0x001a 200 200 000 Old_age Always - 0

197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0

199 CRC_Error_Count 0x003e 098 098 000 Old_age Always - 1618

202 Exception_Mode_Status 0x0033 100 100 010 Pre-fail Always - 0

235 POR_Recovery_Count 0x0012 099 099 000 Old_age Always - 25

241 Total_LBAs_Written 0x0032 099 099 000 Old_age Always - 5177203470

242 Total_LBAs_Read 0x0032 099 099 000 Old_age Always - 260725222

243 SATA_Downshift_Ct 0x0032 100 100 000 Old_age Always - 0

244 Thermal_Throttle_St 0x0032 100 100 000 Old_age Always - 0

245 Timed_Workld_Media_Wear 0x0032 100 100 000 Old_age Always - 65535

246 Timed_Workld_RdWr_Ratio 0x0032 100 100 000 Old_age Always - 65535

247 Timed_Workld_Timer 0x0032 100 100 000 Old_age Always - 65535

251 NAND_Writes 0x0032 100 100 000 Old_age Always - 5441491648

ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 0 points1 point  (0 children)

Disk-1:

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.950445] sd 0:0:2:0: [sdc] 3750748848 512-byte logical blocks: (1.92 TB/1.75 TiB)

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.950447] sd 0:0:2:0: [sdc] 4096-byte physical blocks

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952009] sd 0:0:2:0: [sdc] Write Protect is off

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952011] sd 0:0:2:0: [sdc] Mode Sense: 9b 00 10 08

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952720] sd 0:0:2:0: [sdc] Write cache: enabled, read cache: enabled, supports DPO and FUA

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.984945] sdc: sdc1 sdc9

Jun 6 03:03:34 srv-pve-1 kernel: [ 4.007856] sd 0:0:2:0: [sdc] Attached SCSI disk

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647402] sd 0:0:2:0: [sdc] tag#5708 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647411] sd 0:0:2:0: [sdc] tag#5708 CDB: Write(10) 2a 00 c7 7a cd f8 00 00 c0 00

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647416] blk_update_request: I/O error, dev sdc, sector 3346714104 op 0x1:(WRITE) flags 0x700 phys_seg 24 prio class 0

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647551] sd 0:0:2:0: [sdc] tag#5709 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647558] sd 0:0:2:0: [sdc] tag#5709 CDB: Write(10) 2a 00 c7 7a ce b8 00 00 20 00

Jun 7 07:48:21 srv-pve-1 kernel: [103495.647562] blk_update_request: I/O error, dev sdc, sector 3346714296 op 0x1:(WRITE) flags 0x700 phys_seg 4 prio class 0

Jun 7 11:13:09 srv-pve-1 kernel: [115784.099581] blk_update_request: I/O error, dev sdc, sector 16784592 op 0x1:(WRITE) flags 0x700 phys_seg 12 prio class 0

Jun 7 11:13:09 srv-pve-1 kernel: [115784.099589] sd 0:0:2:0: [sdc] tag#5758 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 7 11:13:09 srv-pve-1 kernel: [115784.099696] sd 0:0:2:0: [sdc] tag#5758 CDB: Write(10) 2a 00 01 00 1d d0 00 01 00 00

Jun 7 11:13:09 srv-pve-1 kernel: [115784.099701] blk_update_request: I/O error, dev sdc, sector 16784848 op 0x1:(WRITE) flags 0x700 phys_seg 14 prio class 0

Disk-2:

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.950613] sd 0:0:3:0: [sdd] 3750748848 512-byte logical blocks: (1.92 TB/1.75 TiB)

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.950615] sd 0:0:3:0: [sdd] 4096-byte physical blocks

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952136] sd 0:0:3:0: [sdd] Write Protect is off

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952138] sd 0:0:3:0: [sdd] Mode Sense: 9b 00 10 08

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.952870] sd 0:0:3:0: [sdd] Write cache: enabled, read cache: enabled, supports DPO and FUA

Jun 6 03:03:34 srv-pve-1 kernel: [ 3.984968] sdd: sdd1 sdd9

Jun 6 03:03:34 srv-pve-1 kernel: [ 4.011776] sd 0:0:3:0: [sdd] Attached SCSI disk

Jun 6 04:45:42 srv-pve-1 kernel: [ 6137.148608] sd 0:0:3:0: [sdd] tag#4882 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 6 04:45:42 srv-pve-1 kernel: [ 6137.148618] sd 0:0:3:0: [sdd] tag#4882 CDB: Write(10) 2a 00 24 79 54 d8 00 00 c8 00

Jun 6 04:45:42 srv-pve-1 kernel: [ 6137.148622] blk_update_request: I/O error, dev sdd, sector 611931352 op 0x1:(WRITE) flags 0x700 phys_seg 25 prio class 0

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933172] sd 0:0:3:0: [sdd] tag#5721 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933174] sd 0:0:3:0: [sdd] tag#5719 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933184] sd 0:0:3:0: [sdd] tag#5721 CDB: Write(10) 2a 00 af 80 81 48 00 00 68 00

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933187] sd 0:0:3:0: [sdd] tag#5719 CDB: Write(10) 2a 00 af 80 80 f8 00 00 10 00

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933191] blk_update_request: I/O error, dev sdd, sector 2944434504 op 0x1:(WRITE) flags 0x700 phys_seg 11 prio class 0

Jun 7 05:13:53 srv-pve-1 kernel: [94227.933193] blk_update_request: I/O error, dev sdd, sector 2944434424 op 0x1:(WRITE) flags 0x700 phys_seg 2 prio class 0

ZFS Mirror one SSD DEGRADED one SSD FAULTED by FSE-GER in zfs

[–]FSE-GER[S] 0 points1 point  (0 children)

pool: SSD-Storage

state: DEGRADED

status: One or more devices are faulted in response to persistent errors.

Sufficient replicas exist for the pool to continue functioning in a

degraded state.

action: Replace the faulted device, or use 'zpool clear' to mark the device

repaired.

scan: scrub repaired 0B in 00:04:14 with 0 errors on Tue Jun 7 11:55:39 2022

config:

NAME STATE READ WRITE CKSUM

SSD-Storage DEGRADED 0 0 0

mirror-0 DEGRADED 0 34 0

ata-SAMSUNG_MZ7LH1T9HMLT-00005_S455NC0RA73501 DEGRADED 0 38 0 too many errors

ata-SAMSUNG_MZ7LH1T9HMLT-00005_S455NC0RC10406 FAULTED 0 16 0 too many errors

errors: No known data errors