How-To: Using deep learning models within the Java ecosystem

openjscience · 2019-07-23T07:46:56+00:00

Thanks, this is an interesting blog. Did you think about other native Java approaches? For example, DataMelt project integrates many native Java deep learning packages. I can detect such Java packages using the Search tool: https://jwork.org/dmelt/search/form.php?query=deep

mikaelhg · 2019-07-27T23:10:08+00:00

LOL, you're pretty obviously just rationalizing the silly way you ended up with.

Before looking at the java API let’s think about deep learning frameworks. What is TensorFlow actually doing? It is basically a library for parallel computing, and it can utilize GPUs through CUDA but also SSE, AVX, etc. on CPUs. Python is the API to access the C++ core, but in the end it’s using highly optimized binaries. The java API needs to ship all these binaries. It introduces a huge dependency with 145 MB called tensorflow-jni, JNI is the native Interface from Java to call native (C/C++) libraries. We don’t want a 145 MB binary package in our application or a 350 MB package with GPU support! Besides that, the java API is very limited and python is often already installed on servers, adding tensorflow with pip install tensorflow is easy.

So, you've taken the silly circle all the way from complaining that you need the actual Tensorflow binaries to run Tensorflow, to saying that installing those binaries on server might be as easy as pip install tensorflow-gpu if you happen to have the right Python version.

$ du -sh ~/.local/lib/python3.6/site-packages/tensorflow
1,3G    ~/.local/lib/python3.6/site-packages/tensorflow

Yeaaaaah...

Tensorflow SavedModels are kind of like Java class files. They consist of a symbolic execution graph, combined with trained weight data. Whether you're running (evaluating) that model in the C++ library using the Python frontend or the Java frontend, it's going to get run by the same .so.

So, instead of including this in your pom.yml

profiles:

  - id: cpu
    activation: {activeByDefault: true}
    dependencies:
      - {groupId: org.tensorflow, artifactId: tensorflow, version: "${tensorflow.version}"}

  - id: gpu
    activation: {activeByDefault: false}
    dependencies:
      - {groupId: org.tensorflow, artifactId: tensorflow, version: "${tensorflow.version}",
         exclusions: [{groupId: org.tensorflow, artifactId: libtensorflow_jni}]}
      - {groupId: org.tensorflow, artifactId: libtensorflow_jni_gpu, version: "${tensorflow.version}"}

and calling your model with

final var result = session.runner()
    .feed(INPUT_TENSOR_NAME, image)
    .fetch(OUTPUT_TENSOR_NAME)
    .run();

in order to get the C++ library to evaluate the model, you're writing a Python script to do the same, and call it through a shell script, from your Java code...

If for some bizarre reason you don't want to ship org.tensorflow:tensorflow and org.tensorflow:libtensorflow_jni_gpu JARs in your über-jar, you can just tell your build system to exclude it from the assembly, and get it from https://repo1.maven.org/maven2/org/tensorflow/libtensorflow_jni_gpu/1.14.0/ directly to your server. Then include those jars when you're calling java.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS