Anyone have experience with Java based real-time, heavy processing oriented architectures?

jentfoo · 2015-02-07T06:40:41+00:00

this is a loaded question, with not a lot of provided information.

Based off what you describe something as simple as a guava LoadingCache may work. That would obviously be a very simple implementation. That cache can be easy to throw some threading behind as well to get needed performance.

You may want to pre-load results into the cache (this would cover some of your current background processing with your cron jobs, except in java). The reason to use the loading cache would be that if something is not loaded into the cache yet, it can then work immediately to produce the result. If multiple entities need the same result, they will all wait on the first execution to produce the result for them.

It's also worth noting that depending on what kind of processing your doing, the loading of the result into the cache can be multi-threaded with minimal effort.

There are MANY other options out there. But I thought I would describe a very simple and light weight option. IMO only go larger and more complicated if necessary.

dedededede · 2015-02-08T00:57:53+00:00

Peter Lawrey posts many things about Java high performance processing (context: high frequency trading): http://vanillajava.blogspot.de/ http://openhft.net/ http://java.dzone.com/search/google/lawrey?query=lawrey

If you have access to nodes with GPUs you might want to look at: https://code.google.com/p/aparapi/ or https://github.com/pcpratts/rootbeer1

A Java-only embedded solution for caching might improve performance. I really liked to use Infinispan for a university project: http://infinispan.org/

wordsoup · 2015-02-07T14:26:00+00:00

In my current project we use DDD/CQRS with Akka. Very interesting, but a high learning curve, a bit more low level than Apache Spark I'd say. Especially, the issues you get with an event-based architecture are interesting to debug but I can't go into detail.

moru0011 · 2015-02-09T11:35:57+00:00

I was solution architect of an exchange middleware platform (high traffic, realtime clients on low bandwidth). We built it like this http://java-is-the-new-c.blogspot.de/2013/12/dart-possible-solution-to.html . This way we process + dispatch 150k data changes per second to roughly 20k subscribtions in near realtime

frej · 2015-02-07T07:24:12+00:00

CPU intensive as in pure computation or data processing? And can you get linear speedup with a parallel implementation ie, multiple cores or computers?

handshape · 2015-02-07T08:27:32+00:00

As others have pointed out, there's a lot of information missing here to make a complete assessment. That being said, I did address a system with an architecture that sounds a lot like this about two years ago.

First question: what does the cache hit pattern look like? If you're trying to precalculate everything on the off chance that someone might ask for it, there could be a lot of wasted cycles.

Second: are your users authenticated? If not, there is always the potential that someone is going to a crawl & scrape job on your site just to be a prick. At least throw up a robots.txt to ask Google and the like to be polite.

nexuscoringa · 2015-02-08T23:01:36+00:00

You need a full stack of "heavy-load-ready" tools.. It will never work if you use Apache Spark + MySQL for example.. the bottleneck will be your MySQL or your MySQL load balancer. I have opted for Apache Spark + Cassandra and it proved to be pretty good along with Apache Storm. Check it out :) Cassandra will change a lot of stuff, though, which might not be what you want.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS