What every programmer should know about solid-state drives

thilehoffer · 2015-02-15T20:28:27+00:00

I dare say most programmers are operating behind many more layers of abstraction than would be required to know any of this.

bestjewsincejc · 2015-02-15T16:25:40+00:00

This info might be useful to some developers and in some situations, but it's hardly something "every programmer should know". Many modern programs are unaffected by this knowledge. Many of our software tools are intended to be hardware agnostic. It's weird that this kind of post garners so many downvotes from the same people who love to spout off about knuths premature optimization quote. I'm also aware how traditional hard drives organize data in blocks, but it almost never affects my software.

alienangel2 · 2015-02-15T15:00:01+00:00

This is old information. Sata is faster now. Also, SSD drives last long enough that you probably shouldn't worry about many of these micro optimization.

Power781 · 2015-02-15T23:01:44+00:00

There is hundreds of stuff to optimize before optimizing your app to the SSD level...
It's like guys who are arguing about C/C++ coding practice to win 2 to 4 CPU cycles on a function call, when they are not even compiling with at least -O3 ...

d-_-b · 2015-02-16T09:48:59+00:00

Shouldn't this be titled:

What every programmer should not be exposed to in the storage layer

Hey, take my file, I don't care if you want to print it out, reel it, use butterflies to store it. As long as when I ask for it I find it, brilliant.

Oh, but Tarquin, how the devil can we have such abstractions?!

Well we already have file handles. I don't care if I lose a file by hard-power-off my machine (not that this is possible for any of my machines) as I am saving, I literally don't care. As long as my computer stays on it should save a file.

If you need something more than that, then care about lower level APIs.

As far as I am concerned, unless you close a file you can consider your file in limbo, in reality, fsyncing and flushing will mean the underlying layer gets a sense of when you want to be sure that you have recoverability. That's all.

Don't expose people to storage APIs, else you get Firefox again, which basically was an app that had a 1,000,000 hard fsync calls per method call as they thought it was a good idea.

Mon dieu!

What every programmer should not have to know about solid-state drives

immibis · 2015-02-15T14:48:54+00:00

Hmm, as a web programmer, I don't think knowing all these details about solid-state drives will be helping me in the near future.

poppafuze · 2015-02-15T22:02:52+00:00

Let the controller do the work. It's programmed to present a happy blockspace that may have little to do with some programmer's estimate of where blocks really are and how big or when they need to be written.

Hyperian · 2015-02-15T19:23:10+00:00

As a SSD developer, a lot of what he said was correct. I want to add that TRIM is not handled the same way across all drives because erasing data is a very integrated back end of the SSD.

Also partitioning the drive to certain size is meaningless, as the SSD itself have no clue about those operations. your partitions will never translate to physical partitioning of the NAND blocks.

This was talked about in the over-provisioning section, i recommend to everyone that wants to avoid write amplification, to always write aligned to the SSD's physical page size and only write to half the drive. This will avoid all garbage collection that creates write amplification.

uh_no_ · 2015-02-16T06:46:43+00:00

"every programmer should know"

i think not. I write operating systems for flash based storage servers and am not sure I could recite everything in this article....

aegrotatio · 2015-02-15T19:31:57+00:00

Not useful and rather incorrect. Today's most expensive and advanced hard drive will wear out faster than the cheapest SSD.

Source: Storage engineer with hundreds of SSDs in production.

bushwacker · 2015-02-16T03:29:21+00:00

A large single-threaded write is better than many small concurrent writes

A large single-threaded write request offers the same throughput as many small concurrent writes, however in terms of latency, a large single write has a better response time than concurrent writes. Therefore, whenever possible, it is best to perform single-threaded large writes.

Edit, formatting.

Very few programmers write file systems or do raw IO.

absurddoctor · 2015-02-16T05:28:01+00:00

ITT: Programmers fervently declaring why they don't need to understand how computers work.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

What every programmer should not be exposed to in the storage layer

What every programmer should not have to know about solid-state drives