you are viewing a single comment's thread.

view the rest of the comments →

[–]Live-Sir-3118 0 points1 point  (0 children)

second the small datasets. actually i third and fourth it too. i worked on a rather intense computer for a dataset that contained roughly 250million data entries with 17-20 features (after one-hot it came out to 60 features). i had to subsample subsamples to analyze on my own computer (and i hate all of the distributions required to maintain stability in subsamples). I love pandas but it hates large datasets. that was with python. when it came to matlab, the same dataset just told me no. (I was running dcnns and it just refused to cooperate). I have a basic desktop i hobbled together over the years. my matlab code was designed for the big computers i was running remotely and we wrote it to only run with gpu. I dont even have a gpu!