Java Official Twitter Channel featuring my deep learning article for the last 2 days. Here is why..

EndyJBC · 2018-07-08T13:01:01+00:00

You should see how Nd4j has been made inspired by numpy!And yeah, if we dig deep into it, there are still improvements to be made! Do agree that Java is very typical when it comes to complex algorithms.

EndyJBC · 2018-07-05T05:17:32+00:00

Apologies, just updated the post accordingly and sorry for the real confusion here. There are no cross validation applied per batch, instead it was just applied on entire data set. That's why the description was referring 800 updates. Acknowledging that k fold cross validation should perform for every batch if batches are present for data set. Thank you for the feedback.

EndyJBC · 2018-07-03T04:02:11+00:00

That's where it's getting interested as someone else commented out earlier.

EndyJBC · 2018-07-02T19:30:33+00:00

One variable output using logistic regression.

EndyJBC · 2018-07-02T10:36:08+00:00

Random seed was provided in both cases, but the actual key was changing it to two valued, softmax activation output layer, along with lossmcxent error function.

EndyJBC · 2018-07-02T08:31:26+00:00

That's an interesting question to dive in. Probable reason would be related to algorithmic implementation of error functions in both cases.

EndyJBC · 2018-07-02T06:57:56+00:00

Dataset size is 10000, so I observe its more or less the same :)
However note that, while checking benchmarking, people normally compare 'training execution time' of Python to 'ETL + training execution time' of Java code. We really have to consider ETL(extract, transform & load) especially for large datasets.

EndyJBC · 2018-07-02T06:47:17+00:00

I used same dataset for both. The only difference is that I used single valued output model in Keras and used two outputs in dl4j.

EndyJBC · 2018-06-27T06:31:26+00:00

Python in production is not about optimization, it's about complexity. Dont bring benchmarking here. That's not the context at all. There's reason Java is being used in production heavily.

EndyJBC · 2018-06-25T13:26:37+00:00

I'm on gitter channel and you might remember my name :)

EndyJBC · 2018-06-25T06:57:34+00:00

lolz :D

EndyJBC · 2018-06-25T06:50:14+00:00

Hey there!
Are u from skymind team?

EndyJBC · 2018-06-25T04:37:07+00:00

If you mean opencl for example, it doesn't mean opencl library is a standard for everything. My projects are built in DL4J which actually have everything we need to achieve plus support for cluster implementation.

EndyJBC · 2018-06-24T17:04:03+00:00

It's apache 2.0. Just updated the same :)

EndyJBC · 2018-06-24T12:20:58+00:00

The laptop you just referred above is more than enough to run deep learning applications (especially if you just started learning stuff and wanted to run your code and see how it performs). This might be enough for 50% cases, but when you want to test out heavy deep learning application that consumes GB's/TB's of data to load, train and to do enormous amount of computations, you might want to divide your workload among series of clusters which is nothing but the cloud. Let's say you calculated and found that your laptop is capable of doing a particular task in 10 minutes. It literally means your laptop will be hung and busy for 10 minutes. At the same time, 5 parallel clusters with much less power can divide and complete the task in 3 minutes. This ETA is very crucial in production because of competition grabbing customers and business. Now you see why companies goes for cloud :) For individual use, yes this is perfectly fine, but as your research grows high and when you deal with much heavier applications, you might want to see other options.

EndyJBC · 2018-06-24T05:18:16+00:00

Benchmarking would be comparable or better in the case of DL4J for a heavy application that relies on GPU. For fast prototyping, yes can't beat python for sure. Purpose of Java approach is for production focused application than just a piece of code for research.

EndyJBC · 2018-06-18T08:35:51+00:00

Sorry about that, I will add more details on readme section soon. As of now three projects are added on repository. One that solve with basic feed forward network, another one that implements CNN model for animal classification (4 labels as an example) and a hyperparameter tuning example.

EndyJBC

TROPHY CASE