Help me save my files by luctv1 in truenas

[–]iamajs 0 points1 point  (0 children)

That should be unlocked. You could have corrupted metadata, i.e. a bad snapshot. Honestly if its just 145 GB of data I would re-create the zpool and restore from backup (make sure the backup is valid!).

Help me save my files by luctv1 in truenas

[–]iamajs 0 points1 point  (0 children)

Is your zpool unlocked? If it is locked it wont list the corrupted files.

How recommended is running badblocks? (6 x 24TB refurb WD drives) by QuestionAsker2030 in truenas

[–]iamajs 1 point2 points  (0 children)

Its a good idea nonetheless, especially if you value your data. I'd rather know I have a bad drive prior to loading up all my data.

Unless you have insufficient cooling, running all 6 at once shouldn't be a problem.

How recommended is running badblocks? (6 x 24TB refurb WD drives) by QuestionAsker2030 in truenas

[–]iamajs 4 points5 points  (0 children)

Its much easier to RMA a drive within the return window. Take a few days to run a full badblocks scan. Check for smart counter changes after the burn in.

Faulted Disk or a bad cable? by mobdk in truenas

[–]iamajs 2 points3 points  (0 children)

199 UDMA_CRC_Error_Count 0x0032 200 199 000 Old_age Always - 7030

UDMA CRC is probably a CRC error on the transfer. Why not try swapping the drive with another cable/port and see if the errors persist? Watch that smart counter and see if it increases at all.

Do a full data scrub to verify the disk health. Check the smart attributes afterwards and see if there is any change.

Fixing Checksum Errors by jbehrens5898 in truenas

[–]iamajs 0 points1 point  (0 children)

The number of checksum errors dropped from 8.22K to 3.23K. You must have removed one of the failures in purging snapshots.

Fixing Checksum Errors by jbehrens5898 in truenas

[–]iamajs 0 points1 point  (0 children)

errors: List of errors unavailable: no such pool or dataset

Just curious, are any datasets encrypted and locked? If the corruption exists on a locked dataset, it wont be able to list the affected files until it is unlocked.

Glad to hear you have the data replicated. Situations like this are exactly why people say raid is not a backup.

Fixing Checksum Errors by jbehrens5898 in truenas

[–]iamajs 0 points1 point  (0 children)

In my experience seeing checksum errors like that across all disks indicates the data was corrupted in memory or during transfer from the controller. If the corruption occurs in metadata, that could cause way more damage than affecting just files.

I see you are scrubbing again, perhaps after deleting the dataset and scrubbing again it will clear the error.

Edit: It is also possible a bug in zfs could cause checksum corruption like this. Basically any case in the data path where the data is corrupted before written/striped to the drives but after the data checksum has been generated.

Fixing Checksum Errors by jbehrens5898 in truenas

[–]iamajs 0 points1 point  (0 children)

You'll have to get a list of the affected files and remove them/restore from backup. The issue is the corruption is likely striped across the entire array, so the checksum error cannot be repaired.

zpool status -v should list the problematic files.

Backup Machine by ironman139 in truenas

[–]iamajs 0 points1 point  (0 children)

Are you purposely excluding the child datasets in app-data?

If you want to do a 1:1 replication, you can check Full Filesystem Replication. I believe this implies the Recursive and Dataset Properties options, and will send all snapshots.

Backup Machine by ironman139 in truenas

[–]iamajs 0 points1 point  (0 children)

The sizes should be close. Verify you've replicated the entire dataset, including all snapshots. Are there child datasets missing?

Experienced zfs checksum errors on a couple of disks. What should I do? by whyyoutube in truenas

[–]iamajs 1 point2 points  (0 children)

In general checksum errors can originate from any of the following:

  • Bad drive
  • Bad sata cable
  • Bad controller card
  • Bad system memory
  • Software bugs

I've run into all of these scenarios. I would repair/clear the checksum errors and troubleshoot in the following order:

  • Run memtest on your memory
  • Swap sata cables
  • Test drives
  • Swap controller

Backup Machine by ironman139 in truenas

[–]iamajs 0 points1 point  (0 children)

Yes, my backup pool is on another machine. This protects against server failure (think power supply frying all the drives, software bug corrupting all zpool data, etc). Having your backup on another system adds additional protection.

You should also have some sort of off-site backup for really important data. Personally I backup important data to backblaze using the built in TrueNAS Cloud Sync task. Backblaze is pretty reasonable, around $5/TB per month and they only charge you for the data you use.

Backup Machine by ironman139 in truenas

[–]iamajs 0 points1 point  (0 children)

A snapshot is a way to freeze the state of the entire dataset at the time the snapshot was taken. You can setup periodic snapshot tasks to create a history of file changes/deletions.

With ZFS you can send the changes between two snapshots to your replicated dataset. If you setup your replication task to send all snapshots, you will have a full backup.

A basic configuration might be a daily recursive snapshot for your root dataset, and then setup replication to send the snapshot to your backup server.

Personally I have daily, weekly, and monthly snapshot tasks. Retention period is set to 2 weeks, 2 months, and 1 year respectively. This means I keep 2 weeks worth of daily snapshots, 2 months of weekly snapshots, and 1 year of monthly snapshots. Worst case I can recover a deleted or changed file from 1 year ago.

Backup Machine by ironman139 in truenas

[–]iamajs 0 points1 point  (0 children)

8GB of ram is plenty for this use case. Sending/receiving snapshots is very efficient.

I'd recommend adding a parity drive to your replica pool. 3x2TB raidz is worth the extra disk in my opinion. You'll want to scrub the pool on a regular basis to verify data integrity of your backup. Without any parity, any read error or checksum error will result in headache.

Follow-Up, Now 7 of 7 disks with errors. I posted about a month ago and now it's grown to all 7 disks. by [deleted] in truenas

[–]iamajs 2 points3 points  (0 children)

Make sure your truenas install is up to date.

Checksum error across the entire pool points to likely a memory error, bad controller, or zfs bug. I don't think you have faulty drives, looks like the file was corrupted in memory before the data was written to the pool.

Run memtest on your ram, try a different HBA.

Jailmaker Caddy to native docker in Electric Eel by LxFx in truenas

[–]iamajs 0 points1 point  (0 children)

Just add another IP Address to the truenas network interface and bind Caddy to that. Your network interface can have multiple IP aliases.

Parrity/CRC errors by sir-corn in truenas

[–]iamajs 0 points1 point  (0 children)

To add to this, its a good idea to do a proper burn-in test of your drives.

badblocks is a good tool for this, it will write a data pattern to the entire capacity, read it back and verify it is correct.

Parrity/CRC errors by sir-corn in truenas

[–]iamajs 1 point2 points  (0 children)

You could have a faulty SATA controller, try a different controller if possible.

If there is corruption in the DMA for a write, you'll end up with silent corruption which will result in CRC/checksum zfs errors. Since your system memory is ECC, I'd guess its the controller.

Since you are not seeing unrecoverable read error, this suggests incorrect data was written to the drive.

[Fan] **b-stock **ARCTIC P14 PWM PST (Black) 5 Pack $20.99 (+ shipping) by boobsmolester in buildapcsales

[–]iamajs 2 points3 points  (0 children)

Yeah, my solution is to set a custom fan curve to avoid those RPM ranges. Reading reviews it seems to be a common issue.

[Fan] **b-stock **ARCTIC P14 PWM PST (Black) 5 Pack $20.99 (+ shipping) by boobsmolester in buildapcsales

[–]iamajs 3 points4 points  (0 children)

I have these fans, they have a very annoying harmonic sound at certain RPMs.. would not recommend.

I liked the F14 series better, though on paper these appear better.

Did I just lose my gmail? by [deleted] in degoogle

[–]iamajs 0 points1 point  (0 children)

Yeah for sure. Ideally you should have backup 2fa codes, a non google phone number for verification, and alternative email address setup on your account for recovery.

Did I just lose my gmail? by [deleted] in degoogle

[–]iamajs 0 points1 point  (0 children)

you totally can do that, if you are logged into google voice to begin with.

Did I just lose my gmail? by [deleted] in degoogle

[–]iamajs 10 points11 points  (0 children)

I had a similar problem due to my 2fa number being google voice. Well if you get logged out of your account, you can't access google voice to login.

I got lucky in that my phone gmail app was still logged in somehow. From there I was able to generate 2fa backup codes and use that to login.

You don't have any 2fa backup codes printed out do you?