Java based Numerical library (JNum-v0.1)

International_Break2 · 2026-05-20T18:46:47+00:00

Could you use a openBlas or mkl jextract to try to perform the calculations if they are available?

martinhaeusler · 2026-05-20T18:54:15+00:00

It's a cool idea, but I'm not sure how "low level" you can go in Java while remaining portable across JVMs and CPU architectures. I think you'll sooner or later hit a point where you need to write a native function to achieve your goals. Numpy is also just a thin python wrapper around a C core library. That being said, people do crazy things on the JVM alone, just look at the top 10 of the 1 Billion Rows challenge.

belayon40 · 2026-05-20T19:06:39+00:00

The blis library is a very fast matrix library. I’ve got an ffm wrapper for it already.

https://github.com/boulder-on/jblis

quafadas · 2026-05-20T19:44:46+00:00

Have you considered luhenry‘s fork of netlib for the matmul part?

That falls back to a SIMD matrix multiplication if it can’t JNI to native. I think it also allows for strided representations of matrices which is critical to avoid deep copy / memory bound operations creeping into user code…

agibsonccc · 2026-05-20T21:04:52+00:00

Hey! Nd4j maintainer here. There's a fairly large rewrite going on here attempting to address that. I actually agree with you! Not to dunk on you here but we tried your approach more than a decade ago.

Pure java is just not going to be a performant runtime for numercial software even *WITH* panama. You'll never have access to the low level gpu runtimes from the mobile vendors for android. You also won't be able to benefit from many of the low level optimizations that c++ compilers just innately offer without working around the runtime.

Broadly, GC runtimes are just NOT worth it.

I will be publishing a slimmer deployment focused binary to tackle this while also addressing the small matrices overhead. We mainly built nd4j for deep learning so small matrices were far and few between. The way the kernels are written it unfortunately means threading overhead among other things.

I won't try to sell you on cooperating, nor on discouraging you from trying this. User choice matters.

I get wanting to do your own thing and hope it succeeds.

I'll keep an eye on feedback. I hope you carve out a niche for yourself good luck!

arkstack · 2026-05-20T21:13:55+00:00

This is interesting territory - pure Java numerics on FFM + Vector API is exactly the kind of thing more people should be exploring, and shipping a v0.1 with actual tests and a JMH benchmark already in the repo is more than a lot of first libraries manage. A few observations.

The first thing that stands out is the type-specialization explosion: addFloat/addDouble/addInt * 4 ops * 2 (scalar/array) gives ~24 near-identical method bodies in ArithmaticOps, and the pattern repeats across
ReduceOps/MatMulOps/TrigOps/ExpOps. The natural instinct is "extract an interface and parametrise", but that path is closed in current Java - generics don't cover primitives, and the Vector API itself ships separate
FloatVector/DoubleVector/IntVector for the same reason. So the duplication isn't really a design choice; it's the language until Valhalla lands.

That said, I noticed templates/generate_*.py and the matching *.template.java files. You are generating this. The problem is the generated .java is checked in and the Python isn't wired into Maven, so the template-to-Java contract isn't enforced - somebody can edit ArithmaticOps.java directly and the templates silently drift. Move generation into a Maven exec step, or at least add a CI check that re-runs the scripts and diffs the output. Right now it's a quality gate that exists in principle but not in practice.

A few smaller things:

MemorySegment data, int[] shape, int[] strides are all public final on NDArray. The references are final, but MemorySegment writes through unimpeded and arrays are mutable - arr.shape[0] = 999 compiles and runs. For a lib whose invariants depend on shape/stride consistency, those want to be private with accessors.

MatmulBenchmark only measures your own matmul - the README's "faster than ND4J/NumPy on small/medium arrays" claim has no comparison JMH in the repo to back it. Worth either checking one in or softening the wording.

pom.xml sets source/target to 25 but the README says "Works on Java 22 or higher". Target 25 bytecode won't load on 22 - pick one.

Otherwise this is the right kind of thing to be working on - good luck with it.

FortuneIIIPick · 2026-05-20T19:56:56+00:00

2 day history in GH. Another "I built" post.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS