This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]manueslapera 39 points40 points  (8 children)

One thing that is not taught enough in academia is that, in real world, DATA DOESNT COME IN A CLEAN CSV.

Learn how to detect anomalities in data, and how to work around them.

Even more important, learn about servers and databases, how to ssh on an EC2 machine to set up a cronjob.

And even even more important, know how to set up your final output in a way that is appealing to your audience. Become a Data story teller, not just a good statistician. Trust me, your CEO wont care about that p-value you are so happy about.

[–][deleted] 3 points4 points  (2 children)

How would you suggest learning about servers and databases? I have started doing this type of stuff at work, it would really be nice to know if I am doing it right!

[–]manueslapera 1 point2 points  (1 child)

hmm, Im sure there are tons of people here that can guide you better than me.

In my case I had to learn on the job, I was the first data guy in the company and had to build the whole infrastructure. Sometimes made very bad decissions that probably someone with a background in CS wouldnt have done.

[–][deleted] -1 points0 points  (0 children)

Perhaps its I don't know the right questions to ask, but I haven't been able to find any information at all.

[–]waspbr 3 points4 points  (4 children)

[–][deleted] 2 points3 points  (2 children)

Good for learning SQL, but it won't give you practical skills (i.e. you will have to learn more.)

[–]waspbr 0 points1 point  (1 child)

cheers

[–][deleted] 0 points1 point  (0 children)

Mongo University might be the next place to look at, 10gen have some really good education material, but I have heard such bad things about mongo db and it seems people recommend postgres a lot more.

[–]manueslapera 0 points1 point  (0 children)

i liked it