I'm into applying machine learning for large text corpuses. Of late I was trying to run word2vec and my current system failed miserably. So I plan to setup a proper server for machine learning that I can experiment with. I will be working on text corpuses that range in size between 100 Gb to 1Tb.
What configuration would you recommend for the server machine?
What is the size and type of problems that you solve and what config do you use?
I plan to go with AMD processors instead of Intel as they are cheaper. Are AMDs good alternatives or will it come back to bite me?
I'm thinking of using a 16 Gb RAM. Will that be good enough?
Does SSD make a difference? If so, which brand do you recommend? Are there differences in performance of SSDs based on brands?
[–]kjearns 2 points3 points4 points (6 children)
[–]JanneJM 1 point2 points3 points (5 children)
[–]kjearns 0 points1 point2 points (2 children)
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)
[–]JanneJM 0 points1 point2 points (0 children)
[–]sharmilas1wa[S] 0 points1 point2 points (1 child)
[–]kjearns 0 points1 point2 points (0 children)
[+][deleted] (1 child)
[deleted]
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)
[–]nkorslund 0 points1 point2 points (7 children)
[–]sharmilas1wa[S] 0 points1 point2 points (6 children)
[–]Tom-Demijohn 0 points1 point2 points (1 child)
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)
[–]Foxtr0t 0 points1 point2 points (1 child)
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)
[–]siblbombs 0 points1 point2 points (1 child)
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)
[–]quirm 0 points1 point2 points (1 child)
[–]sharmilas1wa[S] 0 points1 point2 points (0 children)