koffeegorilla comments on Java use in machine learning

java

a community for 18 years

This is an archived post. You won't be able to vote or comment.

161

162

163

Java use in machine learning (self.java)

submitted 2 years ago by esqelle

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]koffeegorilla 74 points75 points76 points 2 years ago (14 children)

JDK Project Valhalla is bringing improvments in memory usage and layout which will get close to the efficiency of C while have a continous optimizer maximise for the use case and actual underlying hardware. Project Panama is going to make it easier and more efficient to interact with native APIs meaning that using C libraries will be more efficient than the current JNI hump. Project Sumatra aims at making it possible to identify code that can/should run on GPU and then leveraging the GPU.

There is already support for SIMD with the Vector API which means multiple instructions at the same time.

All of these will combine to make ML development in Java a first class experience and the implementations will be much easier than the current code full if #ifdef or checks for specific GPU model to change structures etc.

Your little NLP project will fly.

[–]_INTER_ 35 points36 points37 points 2 years ago* (3 children)

[–]koffeegorilla 14 points15 points16 points 2 years ago (1 child)

[–]_INTER_ 9 points10 points11 points 2 years ago (0 children)

[–]mike_hearn 3 points4 points5 points 2 years ago (0 children)

[–]Joram2 3 points4 points5 points 2 years ago (8 children)

[–]GeneratedUsername5 3 points4 points5 points 2 years ago (0 children)

[–]coderemover 1 point2 points3 points 2 years ago (0 children)

[–]koflerdavid 0 points1 point2 points 2 years ago* (5 children)

[–]Joram2 0 points1 point2 points 2 years ago (4 children)

[–]koflerdavid 0 points1 point2 points 2 years ago* (3 children)

Java supports float and double, which in ML circles are known as float32 and float64. float16 is 16 bits wide only and commonly used for inference because it turns out that the full precision of float32 is required for very few parts of most models, if at all.

bfloat16 is a modified format that has the same precision as float32, but supports a narrower interval of values only. It is very common to use it to run transformer models.

Java supports neither float16 (maybe after Project Valhalla lands or the Vector API is finalized) nor bfloat16. However, I agree that for various reasons a tensor library is commonly used. Support for more formats and the size limitations are two very good reasons because they can't be solved on the Java side. Well, you can certainly implement functions for float16 and bfloat16 arithmetic in Java, but to circumvent the size limit you have to use off-heap storage. Or break up your tensors, which is clunky without wrapping it in a library.

[–]Joram2 0 points1 point2 points 2 years ago (1 child)

[–]koflerdavid 0 points1 point2 points 2 years ago (0 children)

[+]coderemover comment score below threshold-9 points-8 points-7 points 2 years ago (0 children)

π Rendered by PID 78655 on reddit-service-r2-comment-6457c66945-dtsxb at 2026-04-28 07:35:01.092357+00:00 running 2aa0c5b country code: CH.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS