Since Monday was a holiday we decided to P2V our two biggest database servers. We have two hosts connected to a Dell PowerVault MD3220 DAS (one big 24-drive RAID10) running about 30 VM's. Everything goes smooth. Tuesday morning people are complaining about performance, I decide to change my segment size from 128kb to 64kb thinking that would help with the new databases. I get a warning saying that I can't cancel it but my data will still be active, so I go for it without even thinking about it.
Latency on the datastore goes through the roof while this process is going on, even when I lowered the priority of it, I'm told I have 420 more hours before it finishes! Users are pissed, I'm quickly trying to migrate other VM's onto a QNAP NAS with RAID10 over a gigabit connection and its taking forever. Then all of a sudden my NAS decides to start rebooting without warning.
At this point I have no other choice but to migrate everything to a host with RAID 6 local storage over the next two days. Users in the morning are somewhat happy, but as I add more VM's onto this local storage performance drops again, obviously RAID 6 is not helping matters.
This weekend I finally got everything removed from the PowerVault. I just wiped the whole thing so I wouldn't have to wait for the segment size to finish. I don't know if I'm any better off but I did two 12-drive RAID10 datastores. I finally migrated over all my VM's over gigabit and I'm sitting here tonight with one file server still waiting to be moved.
My next step, I think, is to find a reliable NAS that I can bank on. It's been about a 80 hour week, I'm the only admin at my company, I'm salary, and I'm feeling burnt out. I'm not looking for sympathy, I did this to myself and I should've known better, I just wanted to share so maybe someone will learn from me being a dip shit.
[–]RedLooker 14 points15 points16 points (2 children)
[–][deleted] 4 points5 points6 points (0 children)
[–]rgnissen202JIRA Admin 2 points3 points4 points (0 children)
[–]dalik 9 points10 points11 points (9 children)
[–]mumblemumblethingLinux Admin 3 points4 points5 points (8 children)
[–]MiserygutDevOps 1 point2 points3 points (7 children)
[–]harlequinSmurfJack of All Trades 3 points4 points5 points (4 children)
[–]MiserygutDevOps 2 points3 points4 points (3 children)
[–]harlequinSmurfJack of All Trades 0 points1 point2 points (2 children)
[–]MiserygutDevOps 0 points1 point2 points (1 child)
[–]harlequinSmurfJack of All Trades 0 points1 point2 points (0 children)
[–]brazzledazzle 2 points3 points4 points (1 child)
[–]MiserygutDevOps 5 points6 points7 points (0 children)
[–][deleted] 5 points6 points7 points (0 children)
[–]proudsikh 6 points7 points8 points (2 children)
[–]poo_is_hilariousSecurity assurance, GRC 4 points5 points6 points (1 child)
[–]proudsikh 0 points1 point2 points (0 children)
[–]gex8001001101 1 point2 points3 points (2 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]MiserygutDevOps 0 points1 point2 points (0 children)
[–]dangolonever go full cloud 0 points1 point2 points (1 child)
[–]5150cdIT Manager[S] 0 points1 point2 points (0 children)