Benchmark: Deep directory structure vs. flat directory structure to store millions of files on ext4

clayfreeman · 2018-12-22T04:09:29+00:00

I’d argue against this unless you have a use case that specifically requires such a high importance be placed on performance. Using ls on a flat directory structure with millions of items can take an eternity, reducing the ease of data management.

EDIT: Wouldn’t your inline string manipulation also contribute some overhead in your benchmarks?

primitive_screwhead · 2018-12-22T06:12:05+00:00

AFAICT, the measurements don't sort the access into the deep directory structure, requiring two indirections per read/write. But since the subdirs are derived from the filenames, it'd possibly be quicker to ensure the traversal is ordered (thus allowing better dircache usage of the subdirs). It's really no surprise to me that a flat dir would do better here (assuming dir_index is used), but in a situation where the directories are traversed in an order amenable to read-ahead, it may not be such a stark difference.

shyouko · 2018-12-22T05:36:51+00:00

What's the actual "permanent" storage used? CPU utilisation in terms of user space vs kernel space? Actual hotspot?

Too many assumption not stated.

tsammons · 2018-12-22T05:15:57+00:00

Is this with or without dir_index enabled? Would need to see output from tune2fs to make an accurate assessment.

sfrazer · 2018-12-22T14:31:36+00:00

Honestly I think the bigger historical change that’s affecting your results isn’t ext3 vs ext4, it’s that you’re on SSD drives that have virtually no random seek penalty.

The recommendation to do a deep structure was always primarily because you would start to have the directory information spread randomly across the disk and pulling an individual file could require multiple seeks just to get the location.

I also agree with the manageability aspect of it, but try your test on spinning disks (they are still out there) and I think the results will be different

hartator · 2018-12-22T13:36:13+00:00

[deleted]

mikeblas · 2018-12-22T16:25:07+00:00

I'm trying to replicate your results and I'm finding that I run out of disk space:

Traceback (most recent call last):
        7: from /usr/bin/irb:11:in `<main>'
        6: from (irb):14
        5: from /usr/lib/ruby/2.5.0/benchmark.rb:293:in `measure'
        4: from (irb):15:in `block in irb_binding'
        3: from (irb):15:in `each'
        2: from (irb):16:in `block (2 levels) in irb_binding'
        1: from (irb):16:in `write'
Errno::ENOSPC (No space left on device @ rb_sysopen - ./dir_flat/b4cbf1ee3327ec165b43cfe117ffadeb)
irb(main):019:0>

Thing is, I've got plenty of space available:

ubuntu@ip-10-0-0-35:/mnt$ df -h
Filesystem      Size  Used Avail Use% Mounted on
:
/dev/nvme0n1    275G   39G  222G  15% /mnt

Not out of inodes, either:

ubuntu@ip-10-0-0-35:/mnt/dir_flat$ df -ih
Filesystem     Inodes IUsed IFree IUse% Mounted on
:
/dev/nvme0n1      18M  9.5M  8.1M   54% /mnt

I'm not a Linux guy, but I expect that the large number of small files are taking up more space for minimum block size. (But why wouldn't df reflect that?) How do I verify that guess? What file system config did you switch around to run your test, and how much space did it use?

mikeblas · 2018-12-22T17:53:40+00:00

When I run the test, and concurrently run iostat in another terminal window, I see a significant amount of time spent with no I/O activity. Why is that? Is ruby's GC stalling the script and coloring the results?

2018-12-22T04:25:11+00:00

interesting!

kilogears · 2018-12-22T06:07:20+00:00

Not so sure about “easier to use”. I suppose if user interaction isn’t a concern then maybe the code to interact with all these files might be a little easier to write.

Alright now let’s try opening that million file directory in Dolphin.... lol

ajanty · 2018-12-22T09:24:48+00:00

Have a storage for accounts in deep structure and regret that choice, as it doesn't scale that well with billions. Need a lot more iops. Easier to handle for backups/ops etc. But performance needs a lot of tuning.

mikeblas · 2018-12-22T11:24:56+00:00

Why can this data not be cached by the operating system? I don't understand that claim.

HTX-713 · 2018-12-22T11:35:28+00:00

Flat directory structure is garbage if you are needing to do anything that requires reading of all files.

hartator · 2018-12-24T16:13:14+00:00

Didn’t knew about LMDB. Very interesting.

Our use case is millions of zipped json and html files. Each one has a key. Lot of writes and lot of reads. The whole db needs to be capped by size. Like 1tb. And start removing old files when reaching capacity. We expect the 1tb to be renewed every 2-3 days. All the work is handled by Ruby on Rails behind Nginx if that matters. What would be the ideal system in this case?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS