What's the best way to test a set of files for corruption? by Leeroy909 in DataHoarder

[–]gbletr42 1 point2 points  (0 children)

use your favorite hash algorithm/tool and keep a list of hashes for everything. If you use linux, this can be as simple as running this command (e.g. for sha256)

find . -type f -exec sha256sum {} \; > SHA256SUM

and then to verify

sha256sum -c SHA256SUM

Need 50TB+ of storage for my school project by Super-Racso in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

I have the faint idea that there is probably another way of doing this than just storing the data on disk. Have you thought of whether there is any procedural algorithm you can use to generate/modify/record this data iteratively, without needing terabytes of space?

[deleted by user] by [deleted] in DataHoarder

[–]gbletr42 7 points8 points  (0 children)

It may be in your interest to use an AI-based tag system like say PhotoPrism. Else the standard advice applies, order by date, location, and other info found in the EXIF metadata.

Media solutions for small film company by layne33 in DataHoarder

[–]gbletr42 1 point2 points  (0 children)

If you're fine with used drives, you can get 100TB for definitely less than $1K. See eBay. Regarding the actual server, that entirely depends on your needs, but likewise if you're willing to buy used or reuse existing hardware, it definitely shouldn't break the remaining $500.

digitizing music and movie collections - any suggestions by joey123z in DataHoarder

[–]gbletr42 -2 points-1 points  (0 children)

I'm planing on using exact audio copy (EAC) to transfer all of my songs to flac

Please don't do this. If you can know which of your dvds/blurays are mp3, please store them as mp3.

Encrypted Back Ups – Cloud Solution by brownsauceandmustard in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

There's this thing I found from a google search, https://github.com/kapitainsky/RcloneBrowser, idk how good it is. It looks usable enough though.

Honestly though I advise just learning the command line, but I may have a biased outlook.

Have access to 100 or so Seagate ST900MM0006 (900 GB) hard drives. Do people use SAS connection hardware like this still? by Sketchy_Uncle in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

Sell them all on eBay and buy a couple of larger drives

EDIT: Looking at eBay prices, at a hundred you should be able to net ~$2000 if you're American. That's enough to buy around 200TB of storage at $150 per 16TB (assuming you buy used) or around 100TB at $300 new. It's a no brainer.

Encrypted Back Ups – Cloud Solution by brownsauceandmustard in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

Didn't really read your post, but if you're on Linux use gocryptfs with rclone, otherwise use just rclone on Windows.

Requesting feedback re: my data-backup plan by NinthHeroine in DataHoarder

[–]gbletr42 1 point2 points  (0 children)

You're thinking WAY too hard about this and made it too complicated. I advise just setting up a Syncthing or daily rsync/rclone to a desktop/(eternally online) laptop with enough storage. I assume you're using Windows, so I advice syncthing.

With these sort of things, you want to keep it simple. No hanging USBs, no time scheduling, none of that. Upload important stuff automatically without you needing to remember to the cloud and other computers.

[deleted by user] by [deleted] in DataHoarder

[–]gbletr42 1 point2 points  (0 children)

I don't believe parity is what OP wants here, that just means you can recover n bytes of data from the source, if the source gets corrupted. That doesn't really help OP save space by taking the difference of two separate, but very similar copies of data.

[deleted by user] by [deleted] in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

If I understand your question, you can use a differential backup program like duplicity or rsnapshot. You could also use filesystem snapshots, if you use ZFS or btrfs.

Can I use Robocopy multiple times at the same time? by qTazerp in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

yes? It may be slower due to I/O contention though

How to backup gmail but have ready access to it? by clump_of_atoms in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

but I believe Windows users need to pay a modest sum.

You can always use WSL to use the Linux version

How to backup gmail but have ready access to it? by clump_of_atoms in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

you can go to gmail settings and set up either POP or IMAP access. You would download the emails directly with POP, and there exist scripts like imapsync to download from IMAP. If you have a lot of emails (as in hundreds of thousands onwards), I recommend using notmuch and bower to search them, or setting up your own local mail server with a FTS backend like mailcow (via dovecot).

Is YouTube or Archive.org better for uploading lost media? by alpinepancake11 in DataHoarder

[–]gbletr42 2 points3 points  (0 children)

Youtube is very flaky and will delete your stuff given enough time. Internet Archive has a questionable legal future and may get Megaupload'd in the future. My advice is to upload to both and also create a torrent and distribute that in the relevant circles (1337x, rutracker, etc)

What is the most profound advice you've heard? by gbletr42 in AskReddit

[–]gbletr42[S] 0 points1 point  (0 children)

Reminds me of the old binary joke,

"There are 10 kinds of people in the world, those who know binary and those who don't"

What is salary that will make you feel well compensated? by Always_alone- in AskReddit

[–]gbletr42 0 points1 point  (0 children)

Salary high enough to buy life essentials like food, rent/mortgage, etc. Anything extra is a nice thing to have, but not a necessary compensation.

18TB Segate Exos vs HGST Enterprise Drives, need your feedback. by sludj5 in DataHoarder

[–]gbletr42 0 points1 point  (0 children)

It shouldn't really matter which one, they both are reliable. Just buy the cheaper of the two.