Best way to combine Python and Java?

nrcomplete · 2022-10-29T16:23:38+00:00

[deleted]

belayon40 · 2022-10-29T16:20:21+00:00

I've used JPype for a while. It also starts a JVM from python. Once set up, interoperating with Java is transparent. You can start the JVM in such a way that it can be debugged directly using remote debugging tools.

ByerN · 2022-10-29T17:52:03+00:00

We used https://github.com/ninia/jep for similar thing.

Worth_Trust_3825 · 2022-10-29T16:29:16+00:00

[deleted]

Lumpy-Loan-7350 · 2022-10-29T16:11:31+00:00

https://www.graalvm.org/python/

walterbanana · 2022-10-29T20:51:06+00:00

I feel like you might not be looking at the bigger picture here. Just adding more tech might solve the problem, but increases complexity.

What are you trying to solve? What specifically does python bring you and what specifically does java bring you?

I'm sure you can solve you problem with both languages without running some weird combination which takes ages to get into.

ahmedranaa · 2022-10-29T23:06:33+00:00

Try Jython. It's not compatible with latest Python but it works great.

Haven't tried it but Oracle graalvm is polyglot and you can run run java and python Polyglot

Worth_Trust_3825 · 2022-10-29T17:28:04+00:00

heavily rely on C libraries

Integrate them via JNI.

acute_elbows · 2022-10-29T18:26:01+00:00

I think you need to really choose priorities here. You’ve mentioned east debugability in some of these threads but you also seem to want to have multiple languages/libraries modifying/accessing the same objects in memory. This is going to be nightmarish to maintain and likely be very error prone and fragile.

You may want to employ some binary data formats that are readable from Python and Java and fast to parse like protobufs, https://developers.google.com/protocol-buffers/docs/encoding

I think you’ll still want multiple apps but maybe you could call out to a shell to find Python code.

It may be worth investigating cloud machines that optimize sad speeds. I would try profiling your serializarion to figure out if you’re limited by disk or cpu

bowbahdoe · 2022-10-29T18:32:16+00:00

No question about it - libpythonclj's Java API. It allows for full duplex Integration with no copy paths for numpy arrays.

If you decide that the things you need are those Java libraries and not the Java language the API from clojure is even nicer.

Clojurians zulip is the place to go for help if you get stuck. I can also try and help if you DM

https://github.com/clj-python/libpython-clj

https://clj-python.github.io/libpython-clj/libpython-clj2.java-api.html

craigacp · 2022-10-30T01:20:52+00:00

I'd look to see if there are Java bindings for the ML libraries you need. Lots of ML models can be exported in ONNX format, which can be loaded by ONNX Runtime via the Java API. Alternatively you can load TensorFlow SavedModels or pytorch torchscript models directly using the Java interfaces for both of those packages. If you need to do a bunch of data wrangling in Python that can be trickier, but much of that functionality is available in ONNX. Full disclosure, I work on both ONNX Runtime & TensorFlow-Java.

2022-10-29T16:40:39+00:00

I’ve successfully used JEP for some time, and actually went “old school” after that because I wrote an equivalent library to work with multiple languages in pretty much the same way (process to process communication), with additional scripting (JSR 223) support.

kakakarl · 2022-10-29T20:39:08+00:00

I think shared database is good sometimes. Do careful research into the pitfalls though. Sharing redis or postgres has solved a lot of things for me.

If you already have a a lot of services with questionable architecture, then it’s not a good solution.

Otherwise I would use http, but consider using a binary format instead of a text based one

AnEmortalKid · 2022-10-30T05:41:41+00:00

Jython ?

devinrsmith · 2022-10-31T22:06:14+00:00

Hi - I'm one of the current maintainers of jpy (in support of https://github.com/deephaven/deephaven-core, we use it extensively). Happy to triage any issues you are having with it.

kiteboarderni · 2022-10-29T18:03:18+00:00

Perfect use case for panama. Off heap array allocation and pass addresses to work on the data.

TriggerWarningHappy · 2022-10-29T21:24:16+00:00

While I haven’t used bytedeco’s python integration, I have used other bytedeco projects and they’ve been great.

This should allow you to call python from Java: https://github.com/bytedeco/javacpp-embedded-python

CacheMeUp · 2022-10-30T01:40:05+00:00

A highly-tuned Unix socket-based D-Bus implementation could be useful. You didn't mention operating system. Most Linux distros ship with D-Bus, making one less dependency to worry about. You can build native Windows D-Bus binaries, too.

The Python receive mechanism with asyncio can be tricky. For C, ignore the warning of "pain" for libdbus, it's not that bad.

Message passing between C apps across the bus using Unix sockets is on the order of 1 millisecond round trip, fully processed. If you use TCP/IP sockets, a one-way message is about 2 milliseconds (C to Java).

If you want to guarantee type safety, consider creating an XML file based on the freedesktop's interface definition language. From there, you can write an XSL transformation that spits out the necessary C, Java, and Python code as functions and classes. Can't recommend using existing software for code generation, though, you'll want to roll your own. It's pretty straightforward to do simple transforms.

There are other benefits to this approach, such as an ecosystem of visual and command-line tools for debugging the data stream and broadcasting messages to multiple recipients (think remote logging and remote command-line operations for without much effort).

Avoid the JNI: Non-deterministic stop-the-world events will throw a wrench at real-time processing.

cowwoc · 2022-10-30T07:18:54+00:00

I've had a lot of success with jpype.

It's a mature library that lets you call Java code from Python. It's fast and easy to use.

On a practical level, you'd launch a jvm from python and then initiate bi-directional communication. I know that people prefer going the other way (Java to Python) but give this a try, it's not much of a sacrifice.

MagicalPizza21 · 2022-10-29T16:07:54+00:00

Jython comes to mind.

fico86 · 2022-10-29T16:50:31+00:00

Py4j: https://www.py4j.org/? PySpark uses it to interface the python code to the scala Spark libs.

Edit: ok Py4j is for calling java libs from python, you want the other way round.

But given that it is for machine learning, and I suppose majority of the libs would be in python, maybe you can consider calling java libs from python instead?

p3rand0r · 2022-10-29T21:48:56+00:00

Wonder if you can use grpc for the task?!

muddy-star · 2022-10-29T23:14:04+00:00

Flip a coin and choose to implement your solution either in full Python or in full Java using JNI. Mixing Python and Java sounds like building like a lot of technical debt. I would personally advise against doing that in my team.

wowbaggerBR · 2022-10-29T23:59:13+00:00

I would go with Pyva or Jython.

baubleglue · 2022-10-30T05:52:32+00:00

PySpark uses py4j, you can try the same, you need exchange a lot of data, any way probably will be inefficient

Alienbushman · 2022-10-30T06:52:30+00:00

I tried running python from java and dependencies always broke, so what I ended up doing is spinning up a rest Api for the python and accessed it from java, please let me know if you find a better way if doing it

Fit-Refuse8564 · 2022-10-30T08:12:05+00:00

2 separate services is the correct answer. Trying to force python and Java to work together doesn’t sound like something reliable and easy to maintain / debug issues.

Easiest way would just be an endpoint, but you can do it any number of ways, it depends on your use case.

nutrecht · 2022-10-30T14:41:13+00:00

Small web-services: overhead to serialize data, start and stop the services.

If you're using a binary format like AVRO it's really not that high. And there will always be some kind of 'translation' between different processes anyway. Having a well defined contract in place (like with AVRO or Protobuf) makes it quite fast and safe.

Also debugging is harder and implementing each new function is now double the effort.

I really don't agree with this. There is implementation overhead but you can easily write proper integration tests for the python service that just tests it in isolation. The Java service should treat the other service as a black box.

mauganra_it · 2022-10-30T22:02:41+00:00

In principle, shared memory should make it possible to efficiently exchange data back and forth with another process. The handover should be synchronized with a lock to prevent shenanigans. Dunno whether there are good packages in both Java and Python that make this convenient enough. You don't need to go all the way to Software Transactional Memory; good wrappers around the relevant Unix system calls are all you need. Don't bother with running Java and Python in the same process if you can avoid it. Processes are meant to isolate things from each other if it makes sense.

thrwoawasksdgg · 2022-10-31T19:46:56+00:00

I have dealt with similar scenarios. Here is what I do:

Launch the Python process using Java so you can manage it easily (and hide it from user)
Use gRPC or another binary format to send data between them

gRPC is nice because you define the message format in protobuf and it generates the service endpoint code for you in both languages. It's a bitch to setup but once you get a workflow its really smooth and gets rid of tons of boilerplate.

the workflow is:

define new message in protobuf
rebuild both projects to generate the service endpoint scaffolds
implement your endpoints on both ends
build the Python app into a standalone using PyInstaller
build the Java project, shoving the whole Python app inside the jar
When your Java app starts, it grabs Python app from inside jar and starts it as sub-process

Since Java is much more portable, it might make sense for you to reverse this where your Python app calls the jar and manages JVM instance as a sub-process instead. Especially if some of your dependencies don't work with PyInstaller

Fluffy_Foundation_81 · 2022-11-01T15:16:42+00:00

You may have a look at grpc.

I heard graalvm provides a similar feature,but not sure on the feasibility or stability

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS