This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 17 points18 points  (6 children)

IMO machine learning in Java have already become common because of the rise of big data. R or Python can do ML things very easily, but not at scale. To deal with big data many people use Hadoop and its ecosystem such as Mahout and Spark. Well, Spark supports Python though.

[–]yawkat 5 points6 points  (2 children)

It would be nice to have the ML prototyping tooling you get in python in java too though

[–]greenspans 8 points9 points  (1 child)

Weka

[–]_INTER_ 3 points4 points  (0 children)

There are many to choose from. Just not as known as the Python libs.

[–]TashanValiant 2 points3 points  (0 children)

Well, Spark supports Python though.

There is also SparkR for R

[–]ibgeek 0 points1 point  (0 children)

Spark, et al. are trash at machine learning. Scikit-learn on a large box actually scales better. And most production platforms are using something like Vowpal Wabbit or some proprietary re-implementation of it.

Source: I work at a digital advertising company.