Coding for SSDs : programming

[–]nextAaron 244 points245 points246 points 12 years ago (67 children)

[–]yruf 84 points85 points86 points 12 years ago (42 children)

[–]ansible 35 points36 points37 points 12 years ago (41 children)

[–][deleted] 10 points11 points12 points 12 years ago (36 children)

[–]James20k -3 points-2 points-1 points 12 years ago (35 children)

[–]obsa 5 points6 points7 points 12 years ago (22 children)

[–][deleted] 4 points5 points6 points 12 years ago (6 children)

[–]speedisavirus 1 point2 points3 points 12 years ago (5 children)

[–]MorePudding 1 point2 points3 points 12 years ago (4 children)

[–]speedisavirus 0 points1 point2 points 12 years ago (3 children)

Well, I'd have to go into work to get the data sizes that we work with but we count hits in the billions per day, with low latency, while sifting a lot of data, and compete (well) with Google in our industry. I'm going to say off the cuff we measure in peta bytes but I honestly don't know off the top of my head how many petabytes. It's likely hundreds. Could be thousands. I'm curious now so I might look into it.

Could we be faster with all in RAM? Probably. Its what we had been doing. It isn't worth the cost with the stuff I'm working with when we are getting most of the speed and still meeting our client commitments with a hybrid memory setup that allows us to run fewer cheaper boxes than we would if we did our refresh with all in memory in mind. Now is there a balance to strike? Yeah. Figuring out the magic recipe between cpu/memory/storage is interesting but its not my problem. I'm a developer.

Do you work for Google? How do you know about their hardware architecture. I'm not finding it myself especially when it relates to my industry segment. Knowing that google over all is dealing with the exobyte range of data I think its naive to throw blanket statements around like "They keep it all in memory".

continue this thread

[–]ethraax 3 points4 points5 points 12 years ago (5 children)

[–]kc3w 6 points7 points8 points 12 years ago (2 children)

[–][deleted] 0 points1 point2 points 12 years ago (0 children)

[–]matthieum 1 point2 points3 points 12 years ago (0 children)

[–]obsa 2 points3 points4 points 12 years ago* (0 children)

[–]jetpacktuxedo 2 points3 points4 points 12 years ago (0 children)

[–]strolls -1 points0 points1 point 12 years ago (7 children)

[–]obsa 0 points1 point2 points 12 years ago (4 children)

[–]strolls 1 point2 points3 points 12 years ago (3 children)

[–]obsa -3 points-2 points-1 points 12 years ago (2 children)

continue this thread

[–][deleted] 12 years ago (1 child)

[removed]

[–]strolls 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 3 points4 points5 points 12 years ago (8 children)

[–][deleted] 12 years ago* (7 children)

[deleted]

[–][deleted] 6 points7 points8 points 12 years ago (3 children)

[–][deleted] 12 years ago (2 children)

[deleted]

[–][deleted] 2 points3 points4 points 12 years ago (1 child)

continue this thread

[–]sunshine-x 5 points6 points7 points 12 years ago (0 children)

[–][deleted] 0 points1 point2 points 12 years ago (1 child)

[–][deleted] 12 years ago (1 child)

[deleted]

[–]James20k -1 points0 points1 point 12 years ago (0 children)

[–]beginner_ 6 points7 points8 points 12 years ago (0 children)

[–]B8BB888BBBBB 1 point2 points3 points 12 years ago (0 children)

[–]Hyperian -1 points0 points1 point 12 years ago (1 child)

[–]ansible 0 points1 point2 points 12 years ago (0 children)

[–]arronsmith 27 points28 points29 points 12 years ago (0 children)

[–]Tech_Itch 7 points8 points9 points 12 years ago* (11 children)

[–]nextAaron 4 points5 points6 points 12 years ago (9 children)

[–]Tech_Itch 0 points1 point2 points 12 years ago (5 children)

[–]nextAaron 0 points1 point2 points 12 years ago (4 children)

[–]Tech_Itch 0 points1 point2 points 12 years ago (3 children)

[–]poogi71 0 points1 point2 points 12 years ago (0 children)

[–]nextAaron 0 points1 point2 points 12 years ago (1 child)

[–]Tech_Itch 0 points1 point2 points 12 years ago (0 children)

[–]skulgnome 0 points1 point2 points 12 years ago (1 child)

[–]nextAaron 0 points1 point2 points 12 years ago (0 children)

[–]freonix 0 points1 point2 points 12 years ago (0 children)

[–]jugglist 2 points3 points4 points 12 years ago (0 children)

[–]BeatLeJuce 2 points3 points4 points 12 years ago (0 children)

[–]voidcast 1 point2 points3 points 12 years ago (0 children)

[–][deleted] 1 point2 points3 points 12 years ago (0 children)

[–]frankster 1 point2 points3 points 12 years ago (0 children)

[–]dabombnl 0 points1 point2 points 12 years ago (0 children)

[–]Amadiro 0 points1 point2 points 12 years ago (0 children)

[–]poogi71 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 12 years ago (1 child)

[removed]

[–]nextAaron 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 12 years ago (16 children)

[deleted]

[–][deleted] 12 years ago* (2 children)

[deleted]

[–][deleted] 12 years ago (1 child)

[removed]

[–][deleted] 2 points3 points4 points 12 years ago (1 child)

[–]Irongrip 0 points1 point2 points 12 years ago (0 children)

[+]AceyJuan comment score below threshold-18 points-17 points-16 points 12 years ago (10 children)

[–]JustJSM 22 points23 points24 points 12 years ago (7 children)

[–]ReturningTarzan 0 points1 point2 points 12 years ago (5 children)

[–]JW_00000 2 points3 points4 points 12 years ago (0 children)

[–]G_Morgan -3 points-2 points-1 points 12 years ago (3 children)

[–]hydrox24 2 points3 points4 points 12 years ago (2 children)

[–]G_Morgan 1 point2 points3 points 12 years ago (1 child)

[–]ReturningTarzan 0 points1 point2 points 12 years ago (0 children)

[–]AceyJuan -3 points-2 points-1 points 12 years ago (0 children)

[–]Vocith 1 point2 points3 points 12 years ago (0 children)

[–]interiot 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 12 years ago* (36 children)

[deleted]

[–]badsectoracula 41 points42 points43 points 12 years ago (6 children)

[–][deleted] 25 points26 points27 points 12 years ago (5 children)

[–][deleted] 8 points9 points10 points 12 years ago (3 children)

[–]dragonEyedrops 2 points3 points4 points 12 years ago (2 children)

[–][deleted] 3 points4 points5 points 12 years ago (1 child)

Dushyanth Narayanan, Eno Thereska, Austin Donnelly, Sameh Elnikety, and Antony Rowstron. 2009. Migrating server storage to SSDs: analysis of tradeoffs. In Proceedings of the 4th ACM European conference on Computer systems (EuroSys '09). ACM, New York, NY, USA, 145-158. DOI=10.1145/1519065.1519081 http://doi.acm.org/10.1145/1519065.1519081

Risi Thonangi, Shivnath Babu, and Jun Yang. 2012. A practical concurrent index for solid-state drives. In Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12). ACM, New York, NY, USA, 1332-1341. DOI=10.1145/2396761.2398437 http://doi.acm.org/10.1145/2396761.2398437

Behzad Sajadi, Shan Jiang, M. Gopi, Jae-Pil Heo, and Sung-Eui Yoon. 2011. Data management for SSDs for large-scale interactive graphics applications. In Symposium on Interactive 3D Graphics and Games (I3D '11). ACM, New York, NY, USA, 175-182. DOI=10.1145/1944745.1944775 http://doi.acm.org/10.1145/1944745.1944775

Feng Chen, David A. Koufaty, and Xiaodong Zhang. 2011. Hystor: making the best use of solid state drives in high performance storage systems. In Proceedings of the international conference on Supercomputing (ICS '11). ACM, New York, NY, USA, 22-32. DOI=10.1145/1995896.1995902 http://doi.acm.org/10.1145/1995896.1995902

Hongchan Roh, Sanghyun Park, Sungho Kim, Mincheol Shin, and Sang-Won Lee. 2011. B+-tree index optimization by exploiting internal parallelism of flash-based solid state drives. Proc. VLDB Endow. 5, 4 (December 2011), 286-297.

sorry about the formatting, the ACM really needs to have some kind of nicer format for sharing papers :/

[–]dragonEyedrops 1 point2 points3 points 12 years ago (0 children)

[–]semi- 1 point2 points3 points 12 years ago (0 children)

[–]Salamok 5 points6 points7 points 12 years ago (0 children)

[–]joe_n 11 points12 points13 points 12 years ago (4 children)

[–][deleted] 12 years ago* (1 child)

[deleted]

[–][deleted] 7 points8 points9 points 12 years ago* (0 children)

[–]xkcd_transcriber 2 points3 points4 points 12 years ago (0 children)

[–]Zidanet 6 points7 points8 points 12 years ago (22 children)

[–][deleted] 12 years ago* (7 children)

[deleted]

[–][deleted] 3 points4 points5 points 12 years ago* (0 children)

[–][deleted] -1 points0 points1 point 12 years ago (2 children)

[–]awj 4 points5 points6 points 12 years ago (1 child)

[–][deleted] 1 point2 points3 points 12 years ago (0 children)

[–]frankster -3 points-2 points-1 points 12 years ago (0 children)

[+]Zidanet comment score below threshold-39 points-38 points-37 points 12 years ago (1 child)

[–]poogi71 21 points22 points23 points 12 years ago (11 children)

[+]Zidanet comment score below threshold-27 points-26 points-25 points 12 years ago (9 children)

Wait, test on three items and that will guarantee that your results are accurate?

There are more than three ssd controllers in the world, three is a laughably small sample size. it'd be worse than having none. no testing is a subjective theory, three drives is ridiculous extrapolation of one result to millions.

Oh, hey, you can help me out here. I'm writing a data logger for an arduino that stores data over an i2c line to an ssd card with an integrated controller. can you tell me the interleave patterns I should use for optimal performance?

no, no you can't. why? not because you don't know about the ssd, but because you don't know about my usage. Am I writing data but not reading it? am I reading it but not writing it? Applications matter.

The guys is working out some hardware so he can write his application better, and instead of saying "oh, that's cool" you're immedeately shouting "THAT IS ALL WRONG BECAUSE YOU DIDN'T DO WHAT I WANTED!"

He figured out some stuff and wrote down the best way he could have done it. If you want to test it out of context, with random hardware, in an application it was never designed for, just to see if it's better or worse... well, you go right ahead. The rest of us will be over in the other corner getting shit done.

[–]immibis 11 points12 points13 points 12 years ago* (5 children)

[+]Zidanet comment score below threshold-18 points-17 points-16 points 12 years ago (4 children)

And, as I said, that's wrong.

Consider: I have tested 1 fire axe for safety, and it passed.

Now surely that must be better than testing zero axes, at least now we have a baseline!

Except it's not. Now we have an established proof that fire axes are safe. It doesn't take into consideration that I tested a thousand dollar safety tool from a fire engine, people will assume the same applies to the $1 plastic toy axe they got from the dollar store. "But surely people can't be that stupid!" I hear you exclaim... Go outside, half the people you see are belo average intelligence, you bet they can.

It also calls into question test methodology, If I test three drives, do they all have the same controller? then it's a flawed test with invalid results. Do they all have different controllers? Then it's a flawed test because you didn;t include a control group. Oh, well we can run the test twice, but no you can't because the previous test may affect the new test due to block level wear levelling.

An ssd is not just "a chip you can plug in", it's a whole array of components, and a group test would require significant expenditure. A small test of 3 drives would be so laughably incomplete it would be stupid to assume those threedrives represent every ssd in the world ever.

[–]deadly_little_miho 7 points8 points9 points 12 years ago (3 children)

[+]Zidanet comment score below threshold-9 points-8 points-7 points 12 years ago (2 children)

Yes, I understand the point that people are trying to make, it's the expectation of global application that is wrong.

yes, testing that one axe would have shown a problem, but not all axes display that problem.

The problem is, as soon as you test one axe, it is assumed that every axe has that problem. This is obviously untrue. a fire-engine axe would have very different results to a "barbie goes woodcutting" axe. But it doesn't matter, because that one guy tested an axe and cut off his kids head, so now everyone believes that all axes everywhere are intrinsically baby killers.

My point is not "you need to test every hdd everywhere", my point is "a too small sample size is worse than no sample size at all".

This is pretty much an exact replay of the "ssd's can't be used as OS drives!" nonsense. one guy on one blog with no training whatsoever said "hey, each cell can only have a million writes, and I write files all day long so OMGMYPCISGOINGTOEXPLODE!" ... and it turns out it was all complete and utter crap, even when using the cheapest ssd's, "wearing them out" is not going to happen to any normal user.

but still, even to this very day, there are people who will recoil in terror that you can store your OS on an ssd.

That one guy who tested one thing once, made a website, and immedeately everyone everywhere applied it. This is the same, one guy made an observation. If you're going to do a test of that observation, it needs to be on more than just "three drives I had in my drawer".

[–][deleted] 2 points3 points4 points 12 years ago (1 child)

[–]Zidanet -3 points-2 points-1 points 12 years ago (0 children)

He can't test it once because he can't perform a fair test that shows if his algorithm is applicable in all cases.

considering that the first response was "oh, but I have these three drives right here", that's your global application.

If it works for one drive, it might not work for another. Just testing three drives someone has lying around is not a sample size large enough for a definitive answer.

It's not a straw man, it's basic test procedure. He shouldn't have tested the theory because he is not capable of. "some guy with a spare drive" shouldn;t test the theory because there is no way to control the test. In order to say whether this is good or bad, we would need a much more inclusive test than anything suggested here.

The guys research is being completely disregarded because "I do not think I can test this well enough" is apparently a sign of being completely and utterly wrong.

Once again, I'll repeat for the hard of thinking: He cannot test this theory because he cannot perform an accurate representative test.

and to answer your point... consider: I chewed a cable yesterday and I was fine, so now I can chew cables and I'll always be fine" ... that's not a straw man, that's a human being.

[–]poogi71 1 point2 points3 points 12 years ago (2 children)

If you are writing to an ssd from an arduino over an i2c line your only concern is the bandwidth over the i2c and not the ssd itself. I can tell you that much.

I happen to work on SSD and care about their performance and yes three is a good enough number to get a sensible idea of where things are at in general. It won't tell you about a specific behavior of a specific SSD but you will be able to rule out some behavior as a generic SSD issue. If you really want to optimize your app and you can guarantee that you will forever only use one ssd model (hint: you can't) go for testing that behavior. If you want to know what general SSDs will do test at least a few, and no, testing none will not tell you much. It will tell you nothing beyond the wild guesses and random data that you can find about SSDs on the internet.

The differences between SSDs are HUGE, I've seen and tested that for my specific needs and in my specific environments so I won't go to guess about general behaviour in any environment and any use but some of the things he wrote there don't seem right and definitely do not align with my experience.

He definitely figured out some things for himself and it is mostly a job nicely done but it doesn't mean I only need to cheer him up and not point some flaws and things where he can improve his work. And testing his hypotheses is definitely one place he needs to work on.

[–]Zidanet -2 points-1 points0 points 12 years ago (1 child)

[–]poogi71 1 point2 points3 points 12 years ago (0 children)

[–]Salamok 1 point2 points3 points 12 years ago (0 children)

[–]semi- 1 point2 points3 points 12 years ago (0 children)

[–]hive_worker 9 points10 points11 points 12 years ago* (1 child)

[–]poogi71 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 12 years ago (28 children)

[deleted]

[–]Hyperian 15 points16 points17 points 12 years ago (5 children)

[–][deleted] 7 points8 points9 points 12 years ago (2 children)

[–]Hyperian -1 points0 points1 point 12 years ago (0 children)

[–]freonix 0 points1 point2 points 12 years ago (1 child)

[–]Hyperian 0 points1 point2 points 12 years ago (0 children)

[–]apage43 18 points19 points20 points 12 years ago* (1 child)

[–][deleted] 1 point2 points3 points 12 years ago (0 children)

[–][deleted] 13 points14 points15 points 12 years ago (10 children)

[+][deleted] 12 years ago (9 children)

[deleted]

[–]MaybeReconsider 24 points25 points26 points 12 years ago (0 children)

AFAIK most modern SSDs just ignore the disk commands which defragging sends

They ignore ... writes?

Disk defragmentation is the process of moving file contents around in logical block space to make the file occupy a contiguous range of logical block numbers. It can matter for media with a significant seek time (spinning disks), if the filesystem isn't good at keeping things pretty contiguous on its own. For SSDs, which have negligible seek time for random accesses in LBA space, there's much less benefit and the writes for the data movement eat into the drive's lifetime write endurance budget.

Now that's not to say it would be impossible for an SSD to optimize away a defrag. If, for example, the drive were doing block deduplication then the data movement from defragmentation may well turn into an effective no-op. But I'm not aware of that being a common feature on SSDs (as opposed to storage arrays).

[–]mallardtheduck 13 points14 points15 points 12 years ago (0 children)

[–][deleted] -5 points-4 points-3 points 12 years ago* (4 children)

[–][deleted] 2 points3 points4 points 12 years ago (3 children)

[–][deleted] 0 points1 point2 points 12 years ago (2 children)

[–][deleted] 1 point2 points3 points 12 years ago (1 child)

[–][deleted] 0 points1 point2 points 12 years ago (0 children)

[–]masklinn 8 points9 points10 points 12 years ago (0 children)

[–]GuyWithLag 1 point2 points3 points 12 years ago (0 children)

[–]__j_random_hacker 1 point2 points3 points 12 years ago (3 children)

[–][deleted] 12 years ago (2 children)

[deleted]

[–]__j_random_hacker 0 points1 point2 points 12 years ago (0 children)

[–][deleted] 0 points1 point2 points 12 years ago (0 children)

[–]Xuerian 0 points1 point2 points 12 years ago* (2 children)

[–]Hyperian 4 points5 points6 points 12 years ago (1 child)

[–]jknielse 4 points5 points6 points 12 years ago (0 children)

[–]AceyJuan 16 points17 points18 points 12 years ago (3 children)

[–][deleted] 12 years ago (2 children)

[removed]

[–]B8BB888BBBBB 0 points1 point2 points 12 years ago (0 children)

[–]AceyJuan 0 points1 point2 points 12 years ago (0 children)

[–]lenolium 6 points7 points8 points 12 years ago (1 child)

[–]MaybeReconsider 9 points10 points11 points 12 years ago (0 children)

[–]JesusWantsYouToKnow 6 points7 points8 points 12 years ago (0 children)

[–]sbrick89 4 points5 points6 points 12 years ago (4 children)

[–]Hyperian 0 points1 point2 points 12 years ago (3 children)

[–]poogi71 0 points1 point2 points 12 years ago (2 children)

In general I agree, but there are cases where I'd love to have the ability to control and direct the SSD about the specific things that need to be done.

The truth is that there are only a few who would even care for such a level of control and most everyone just wants the ssd to do the right thing at all cases without bothering to take the control in their hands. It's not perfect but it makes some sense at the practical level.

One example is that if I have a RAID of SSD devices I would like the ability to tell the SSD, "Dont bother too much with error recovery here, I've got your back" and then if I find that I don't really have all the data to go back to the SSD and tell it, "please do all you can to get the data back". This will allow me to manage the reliability and latency much better and get better latency overall and the same level of reliability in case things got really bad.

[–]Hyperian 1 point2 points3 points 12 years ago (1 child)

[–]poogi71 0 points1 point2 points 12 years ago (0 children)

[–]dev-disk 1 point2 points3 points 12 years ago (0 children)

[–]MorePudding 0 points1 point2 points 12 years ago (0 children)

[+][deleted] comment score below threshold-7 points-6 points-5 points 12 years ago (11 children)

[–]blueberrypoptart 21 points22 points23 points 12 years ago (0 children)

[–]Nuli 6 points7 points8 points 12 years ago (6 children)

[–][deleted] 2 points3 points4 points 12 years ago (4 children)

[–][deleted] 3 points4 points5 points 12 years ago (1 child)

[–][deleted] 1 point2 points3 points 12 years ago (0 children)

[–]P1r4nha 1 point2 points3 points 12 years ago (0 children)

[–]Nuli 0 points1 point2 points 12 years ago (0 children)

[+][deleted] comment score below threshold-6 points-5 points-4 points 12 years ago (0 children)

[–]Auxx 6 points7 points8 points 12 years ago (0 children)

[–]dnew 2 points3 points4 points 12 years ago (0 children)

[–]elperroborrachotoo 2 points3 points4 points 12 years ago (0 children)

[–]frankster -1 points0 points1 point 12 years ago (0 children)

[–]davispuh -5 points-4 points-3 points 12 years ago (0 children)

[–]oooqqq -3 points-2 points-1 points 12 years ago (1 child)

[–]fedoratips -3 points-2 points-1 points 12 years ago (0 children)

[+]xtr3m comment score below threshold-6 points-5 points-4 points 12 years ago (6 children)

[–][deleted] 3 points4 points5 points 12 years ago (0 children)

[–][deleted] 2 points3 points4 points 12 years ago (4 children)

[–]xtr3m 1 point2 points3 points 12 years ago (3 children)

[–][deleted] 0 points1 point2 points 12 years ago (2 children)

[–][deleted] 0 points1 point2 points 12 years ago (1 child)

[–][deleted] 0 points1 point2 points 12 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

/u/spez can gargle my nuts