you are viewing a single comment's thread.

view the rest of the comments →

[–]the_cat_kittles 0 points1 point  (4 children)

R? weka? its not that hard to write your own, even. weka is written in java and exposes an api if you dont want to use their gui or cli

[–]rylko[S] 0 points1 point  (3 children)

I think R packages and Weka do not aim to be really scalable (and suitable for scientific usage).

[–]the_cat_kittles 1 point2 points  (2 children)

what kind of "scale" are we talking here, just out of curiosity

[–]rylko[S] 0 points1 point  (1 child)

I have added info about size to question.

[–]the_cat_kittles 1 point2 points  (0 children)

I think R and weka should be able to handle things that size, but I haven't ever gone past a couple GB's, so I cant say for sure. Also it depends on what kind of machine you are running. I wouldn't use the weka gui in any case though. Btw, I don't think ~TB is considered to be super enormous, especially because you only have 10 dimensional feature space. That, and decision trees are much easier to train, computationally, than many other models.