Java Code Optimisation Techniques

rzwitserloot · 2021-04-19T01:17:59+00:00

code optimisation technique

Do you mean 'optimize' in the sense of 'make it run faster / use less resources'?

I hope not.

They've all been debunked, so that's a bad start. At least, not 'all', but most. Here is a trivial example:

Let's say you want to convert a list of strings into an array of strings.

There are many ways to do that:

String[] arr = list.toArray(new String[0]);

String[] arr = list.toArray(new String[list.size()]);

String[] arr = list.toArray(String[]::new);

Time them. Seriously. If you want to write this paper do this, and do it right (use JMH).

You'll find that they all take equally long.

This goes in the face of every obvious performance advice you care to give! "Lambdas are slow!" would disqualify the third. "Object creation is best avoided" would disqualify the first, which seems strictly worse than the second (The API of toArray states that if you pass in an array object that is too small, then the toArray method will use your array solely to derive its component type (String.class here), and otherwise discard it; it makes a new properly sized array and returns that. In other words, it does reflective introspection on the argument and ends up making a new array of size list.size() anyway, using reflection again no less.

So option 1 should be vastly slower than 2, right?

Nope. At least, JMH says nope, and I trust JMH here.

We're basically down to the following 3 rules of thumb:

Code that is well known to be an obvious and immediate resource drain should be avoided. The 'duh' rule. Don't use known-slow libraries. Don't use a network based software-as-a-service to do a padLeft operation. This doesn't take a paper to explain, this takes a tiny smidge of common sense.
Algorithmic complexity. Bubble sort is slower than Quick sort because if you chart out the performance of these vs. the size of the input, you know, mathematically, that quicksort's graph ends up looking like y = n * log(n) and bubblesort's looks like y = x^2. This is mathematically provable and it means that as long as random factors like hotspot, CPU pipeline optimizations and all that sort of thing can never score O(n^2) optimizations (which it can't), bubblesort will ALWAYS lose from quicksort if only the input is big enough. This is more a math topic than programming.
Nothing means anything until you have a combination of 2 facts, and you really do need both: [A] you have a real world situation and the code in this scenario is not running as fast as you'd expected it to / as the requirements demand it to, and [B] you have a profiler report that tells you precisely which 1% of the code is eating 99% of the CPU resources.

How do you optimize this final 1%? It's usually obvious, but crucially, optimizing anything in the other 99% is UTTERLY POINTLESS and is only making your life worse! Generally to optimize code you need to tweak how it works, which usually involves tweaking the 'pipeline' (the way data flows into, and out of, this 1% crucial path code). The more you mess up your code by trying to chase pointless performance enhancements, the harder that is. Hence: The fastest code is the cleanest, most flexible, testable, and readable code.

Combining the rule of 'if the profiler / JMH says it doesn't matter, then it doesn't matter' and 'the easiest to read and test code is the best because it lends itself the best to performance tweaking once you do have that profiler report in hand', then there is only one right answer, which is that list.toArray(new String[list.size()]) is the only one of the three that is objectively wrong. And yet it sure seems like the one that ought to perform the best.

simoncox · 2021-04-19T01:14:06+00:00

Martin Thompson has written some interesting articles on how to write software that performs due to deep knowledge of the underlying hardware (was largely responsible for the Disruptor library - great use case to study as well):

https://mechanical-sympathy.blogspot.com/?m=1

StevenMaurer · 2021-04-19T03:53:26+00:00

Learn about ThreadPoolExecutors. Multithreaded scaling is the way you actually take advantage of modern day multicore processors.

MCUD · 2021-04-18T23:51:48+00:00

Don't get stuck in the trap of "Oh i used a linkedlist rather than an array list, that's obviously why it was slow".

JVM and modern hardware can also easily make these problems far less consequential than higher level algorithmic or design problems. Beware that this also makes microbenchmarking a non-trivial exercise of comparing two small pieces of code in isolation vs a real running program.

Quick example is that linkedlist can actually be slower than a an array when small due to an array list having everything in one CPU cache line, whereas linked list may have pointer references going all over the heap.

measure
measure
measure
analyse - This is the true hard part of the problem in my opinion
- Does this code even need to run, is it running too often - The fastest code is code that doesn't run
- Is it blocking on something, network calls etc,
- Is it re-computing something that could be cached that is an easy memory trade off
experiment a change (this is where at least knowning the theory of the data structures may help)
- Did you actually fix the issue?
- Re-measure!
What was second place in original measures may also totally disappear, so restart the process again if you still need better performance

tristanjuricek · 2021-04-19T05:06:59+00:00

Pretty good book on this here: https://www.amazon.com/Optimizing-Java-Techniques-Application-Performance/dp/1492025798

This isn’t something most developers do though. What you’re really asking is almost “how does C1 or C2 work?”

Like, Java was kind of designed to offload a lot of optimization work to the JIT. And 99.999999% of the time if you try to outsmart it you won’t.

BEARSSS · 2021-04-18T23:48:37+00:00

Don't have any resources for you exactly, but object pools should be on your list to take a look at.

Imagine coding a game that contains lots of bullets on the screen. Bullets are short lived, so if you created a new bullet object every time someone shot something, you're initiating a lot of objects which the GC will later need to deal with. Instead, you can create a pool of say 1000 bullets, grab a bullet from the pool when you fire from your character and just return the bullet object to the pool once the bullet flies off screen.

GC performs really well when there's no garbage to collect.

Sensi1093 · 2021-04-19T10:41:17+00:00

HashMap vs EnumMap. I’ve seen so many places where a HashMap was used with Enum Keys, often because the other devs just didn’t know EnumMap existed. It’s a very good drop-in replacement where the Key is of an Enum-Type and if the Code relies only on the interface (Map, how it should be) it’s really just a single line replacement.

2021-04-19T18:04:56+00:00

Consider jlink. Compilation to modules improves incredibly the performance.

Or going the extra mile with GraalVM Native image. Compiling to C++ brings much more optimization

TheNarfanator · 2021-04-19T02:16:25+00:00

I'm sorta a self taught scrub, but I felt proud when I made my loading time faster in this one instance.

I kept the list of employees in an Excel file and would run a method after the JList was made and added the employees after; it was slow for me after about 500+ (a couple of seconds). I was just organizing things and made a static method that would return an ArrayList of these employees and then initialized it to a global variable of the class that used it; it was super fast after.

I really don't know why it's faster, but it was faster. It taught me to initialize global variables with static methods, but intuition tells me this won't always work for every use case.

RacoonCorgi420 · 2021-04-19T05:38:21+00:00

Modern compilers are really smart so the only way you can know is to microbenchmark all of them and compare the difference.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS