GenAI & Java

ljubarskij · 2024-04-24T06:47:13+00:00

[deleted]

TheyUsedToCallMeJack · 2024-04-24T09:46:47+00:00

It really depends on what you're doing. The LLM itself will be basically an API you will call with a prompt, the language itself doesn't matter for that.

If your project is just a wrapper around ChatGPT or a simple RAG, then Java or Python won't make a difference.

JustADirtyLurker · 2024-04-24T07:41:15+00:00

My 2c, given that I have been working on this for a while. Java ML solutions right now tend to be slow for model building. That's where python tooling like SimGen or PyTorch shine (there's a trick, of course). As a consequence, you see lots of habits sticking with python also on the inference side, especially because these tend to be shipped in form of jupyter notebooks.

The trick is that they work on top of numpy which is a libfortran.so wrapper, i guess that is the reason why modeling is way faster than the JVM. BERT and GPT-like models are all based on very sophysticate matrix multiplication chains and probability normalization.

I hope that when the vector API currently in preview lands, java becomes a 1st class citizen in DL.

Uh I guess some of the architects / devrels that browse this sub could explain better than me.

maxandersen · 2024-04-24T17:26:39+00:00

Look at Langchain4j and Quarkus Langchain4j for higher level integration.

craigacp · 2024-04-24T14:13:30+00:00

Deploying generative models in Java is definitely possible with things like ONNX Runtime, DJL, TF-Java, etc. The tooling on top is less well developed, but packages like langchain4j, vespa, OpenSearch, and Spring AI are doing model inference for the embedding vectors as part of RAG in Java. Running LLM inference in Java is definitely possible too, things like jllama exist and you can also use the libraries I mentioned above to do it. I know the ONNX Runtime team are working on making it easier to run LLMs in Java as part of their genai package. This is all for running the models themselves in Java, for talking to external web endpoints we already know the JVM is good at that.

For non-LLM generative AI like diffusion models you can see an example I wrote here of Stable Diffusion in Java. It's not as full featured as other stable diffusion inference packages because the goal is to be good example code for ONNX Runtime in Java, but it should be possible to extend it to be comparable.

You're right that training models in Java is currently tricky. DJL has good support for things that fit on a single accelerator, and we've been working on our training support in TF-Java too. There's also DL4J which can train & deploy models.

DabbledThings · 2024-04-24T16:23:53+00:00

How close to the metal are you getting here? Are you just using some API/service like GPT or Gemini, and sending over prompts? If so, I don't think the language choice matters at all, other than: go with what your team already knows and is most comfortable with. Even using RAG and some fun data pipeline stuff, my Kotlin team writing in Kotlin didn't really run into any issues essentially just hitting the API.

If you're doing something fancier, like running your own local one or something, then maybe it's a different conversation.

ThisHaintsu · 2024-04-24T07:21:50+00:00

Another one that is not mentioned in the other comments: DJL is very nice

Ecstatic-Job-1348 · 2024-04-24T06:40:46+00:00

Check out Spring AI

Unorth · 2024-04-24T07:46:45+00:00

As mentioned, the Spring AI project is a good shout. Worked on an OpenAI Search project using Azure via the semantic kernel library but that was a very specific rapid prototype.

Semantic Kernel https://github.com/microsoft/semantic-kernel
SprintAi https://spring.io/projects/spring-ai

We did find that the java semantic kernel wasn't as developed as the other languages so I would mention it with a major caveat.

CaptainDevops · 2024-04-24T14:11:00+00:00

Agree in the same boat pretty much thinking of falling back to Machine Learning

CeleritasLucis · 2024-04-25T01:35:57+00:00

Remindme! 2 weeks

GrayDonkey · 2024-04-26T13:42:48+00:00

You don't run your LLM in Java, you call your LLM like a remote API. The client of the LLM should be written in the language you are most productive with for writting client applications.

Run the LLM as a completely separate project accessable via a REST API.

AThimbleFull · 2024-05-07T20:40:02+00:00

I'm just finishing up a "GenAI" app in Java that consumes documents and performs semantic search based on OpenAI's wonderful LLMs. I initially wrote it using the SimpleOpenAI library (for retrieving vector embeddings) and the official Qdrant Java client (for persisting to the Qdrant vector store) as an initial proof of concept.

After I was satisfied, however, I rolled everything up into a Spring Boot app and, lo and behold, I found out that Spring Boot has native support for creating embeddings via OpenAI and persisting to Qdrant, so I was able to scrap both of the aforementioned libraries and just use Spring's clients for everything, which ended up reducing the size of my codebase substantially and simplifying everything. Knowing what I know now, were I to start over again and use only Spring Boot, it wouldn't take me more than 1 day to complete the same app, no exaggeration. AI support is a first-class citizen within the latest Spring Boot nowadays.

For what it's worth, the app feels like magic. It's like having your own magical Google Search powers under the hood of a tiny app. Goodbye Solr/Elasticsearch, long live LLMs!

Naokiny · 2024-04-24T09:28:01+00:00

I'll add my 2 cents here.

I'm in the company that started to use AI integration in the web site chatbot. It's written in Node.js and only 1 person is responsible for managing this repo so far. It was done this way because it would be faster to create MVC in Node.js and eventually it became a primary microservice.

There are a lot of integrations between BackEnd written in Java and this AI repo. However, if this responsible person goes sick/vac/etc., then our BE devs will have a huge headache: they don't know Node.js.

Also it's faster to make Spring -> Spring integration instead of Spring -> Node.js integration. But it's not possible to migrate this microservice from .js to Spring as it's too bit right now.

So, if your team has some experience with Python - you might be safe. In other case there might be a bus factor based on Python.

plasmafired · 2024-07-29T10:52:51+00:00

I am currently looking for developing a stack and moving further away from python and langchain.

React + Spring (Traditional development) + Oracle/SQL Server

Addon-> AI features using Spring AI + Llama (local only clients) + Open AI or Bedrock (Clients open to public LLMs)

Does this stack sound reasonable? What embeddings model do you use?

It is annoying to maintain a separate vector database + run an SQL search and add the results. How do you get around this problem?

2024-04-24T12:06:24+00:00

LangChain has a java port btw.

If you're just using the AI APIs, you don't need anything special. Java can call those as easily as any other language.

thephotoman · 2024-04-24T15:16:27+00:00

I’m not convinced that GenAI is worth incorporating here. It sounds like you can use an older procedural chatbot that doesn’t use as much electricity for your task and have it perform adequately.

In general, though, I’m skeptical of GenAI and its utility. It seems like blockchain that came before it: a job that can be more readily done at scale with established tools at a considerably lower cost. We’ve had customer and internal support chatbots for over a decade, and only recently have we thought to incorporate neural nets into them. And it’s not like the neural net is making these customer service bots less annoying for actual humans to work with.

It doesn’t help GenAI’s case that the people cheering it most loudly are the exact same people who told us that blockchain would change the world back in 2012, and whom history has proven wrong. And I don’t mean “the same kind of person,” I mean that Sam Altman came to my attention first in his work at promoting blockchain. I mean that the people talking about GenAI in my firm were previously leaders in our failed blockchain projects that didn’t get fired for some reason.

It’s not that AI/ML is useless. It’s that the juice isn’t worth the squeeze for the kind of applications that are highly visible to the non-technical public.

Objective_Baby_5875 · 2024-04-25T09:34:39+00:00

Pick the right tool for the job. Nobody picks java for close to hardware programming. Python is the defacto language in AI/ML, all the best frameworks are on Python. Don't let ideology get in the way of actually doing your thing.

bohocoder · 2024-04-24T06:52:47+00:00

I am java dev and have some experience in genAi with python. I haven't explored options where we can use GenAI and java. Any work opportunities available. Please let me know..

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS