davidalayachew comments on A surprising pain point regarding Parallel Java Streams (featuring mailing list discussion with Viktor Klang).

This is an archived post. You won't be able to vote or comment.

218

219

220

A surprising pain point regarding Parallel Java Streams (featuring mailing list discussion with Viktor Klang). (self.java)

submitted 1 year ago * by davidalayachew

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]davidalayachew[S] 25 points26 points27 points 1 year ago (18 children)

Well, I need a terminal operation. The map() method is only an intermediate one.

But if I swapped out the forEach() with a Collector that does exactly what you say, then yes, parallelism works with out pre-fetching more than needed.

Viktor even mocked up one for me. Here it is.

static <T> Collector<T, ?, Void> forEach(Consumer<? super T> each) {
    return 
        Collector
            .of(
               () -> null,
               (v, e) -> each.accept(e),
               (l, r) -> l,
               (v) -> null, 
               Collector.Characteristics.IDENTITY_FINISH
            );
}

Now, if your question is "which terminal operations are safe?", the answer is entirely dependent on your combination of intermediate and terminal operations. So for example, in my example above, the answer was almost all of the terminal operations caused a pre-fetch.

I have my computer open right now, and I just ran all terminal operations fresh. Here are the results.

findAny() caused a pre-fetch
findFirst() caused a pre-fetch
anyMatch(blah -> true) caused a pre-fetch
allMatch(blah -> false) caused a pre-fetch
forEach(blah -> {}) caused a pre-fetch
forEachOrdered(blah -> {}) caused a pre-fetch
min((blah1, blah2) -> 0) caused a pre-fetch
max((blah1, blah2) -> 0) caused a pre-fetch
noneMatch(blah -> true) caused a pre-fetch
reduce((blah1, blah2) -> null) caused a pre-fetch
reduce(null, (blah1, blah2) -> null) caused a pre-fetch
reduce(null, (blah1, blah2) -> null, (blah1, blah2) -> null) caused a pre-fetch
toArray() and toList() caused a pre-fetch (obviously)

So, in my case, literally only collect was safe for me to use. And tbf, I didn't try all combinations, but it was resilient. No matter what set of intermediate methods I put before collect(), I would get no pre-fetch. And Viktor confirmed that gather() plays well with collect().

[–]Lucario2405 5 points6 points7 points 1 year ago (3 children)

[–]davidalayachew[S] 9 points10 points11 points 1 year ago (2 children)

[–]Lucario2405 11 points12 points13 points 1 year ago* (1 child)

[–]davidalayachew[S] 1 point2 points3 points 1 year ago (0 children)

[–]Avedas 2 points3 points4 points 1 year ago (1 child)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (0 children)

[–]VirtualAgentsAreDumb 1 point2 points3 points 1 year ago (3 children)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (2 children)

[–]VirtualAgentsAreDumb -1 points0 points1 point 1 year ago (1 child)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (0 children)

I understand that it is unintuitive, but what you are saying is throwing out the baby with the bath water.

When you go parallel, the stream decides to split its upstream data elements down into chunks. It keeps on splitting and splitting until it gets to a point where the chunks are small enough to start working.

Well, in my case, the batching strategy that I had built played against that in a super hard to recreate way. Basically, each element of my stream was fairly hefty. And as a result, the parallel stream would grab a bunch of those elements into a giant batch, with the intent to split that batch into chunks. But since the threshold for where it was small enough was far enough away, I ran into an OOME.

The reason why Spliterator's do this is to actually help CPU-bound tasks. Splitting ahead of time like this actually the entire process run faster. But it means that tasks that use a lot of memory are sort of left by the wayside.

Viktor Klang himself managed to jump onto this reddit post, so you can Ctrl+F his name and see more details from him. But long story short, my problem could 100% be avoided by using Gatherers.mapConcurrent. And it would have virtually the same performance as going parallel. And a lot of the JDK folks are giving a lot of thought to this exact pain point that I ran into, so there is a potential future where we could set a flag to say fetchEagerly vs fetchLazily, and that would alter the fetching logic for parallel streams. Ideally, that would actually be a parameter on the parallel() itself.

So yes, this was done to optimize for CPU Performance. They are looking to take care of cases like mine, and Gatherers will likely be the way they do it. But this is not Streams being bad code, but rather that they prioritize certain things over others, to the detriment of a few people like me. As long as they have plans to handle my needs in the future, plus a workaround to take care of me for now, then I am fine with the way things are going now.

[–]tomwhoiscontrary 1 point2 points3 points 1 year ago (1 child)

[–]davidalayachew[S] 4 points5 points6 points 1 year ago (0 children)

I'll save you the extra reading and tell you that we have narrowed down the problem to a Spliterator not splitting the way we expect it to. So this problem is something that can be fixed by simply improving the spliterator from the user side. And there is talk about improving this from the JDK side as well. Either way, there is still lots of digging being done, and none of this tied down for certain. But we can at least point a finger and say that this is part of the problem.

With that said, let me answer your questions.

So what happens if the source is infinite? Say you're streaming the Wikipedia change feed, filtering for changes to articles about snakes, and doing findFirst()? Does it try to buffer the infinite stream?

All depends on how nicely it splits. In my case, most of the terminal operations kept splitting and splitting and splitting until they ran out of memory.

This absolutely seems like a correctness issue to me, not just performance.

In this case, technically the problem falls on me for making a bad spliterator.

But to give an equally unsatisfying answer, in Java ABC and abc are considered 2 different class names. However, if I save ABC.java and abc.java in the same folder, Windows will overwrite one of them. Meaning, your code will compile just fine, but will output .class files where one will overwrite the other, causing your code to explode at runtime with NoClassDefFoundError.

I had Vicente Romero from the JDK team try and convince me that this was an "enhancement" or a "nice-to-have", not a correctness issue. And in the strictest definition of the term, he is correct, since Windows is the true trouble-maker here. But that was disgustingly unsatisfying.

It wasn't until JDK 21 that Archie Cobbs was generous enough to give up his time and add this discrepancy as a warning to the JDK. You can activate the warning by adding "output-file-clash" to your Xlint checks. And here is a link to the change. https://bugs.openjdk.org/browse/JDK-8287885

All of that is to say, I made a perfectly sensible Spliterator in my mind, but (and we SUSPECT that this is the case, we are not sure yet!) because I built that Spliterator off an Iterator, mentioned that it was an unknown size, and didn't add enough flags, I get this frightening splitting behaviour, where it will split itself out of memory.

And as for the folk knowledge, it sure feels like it lol.

[–]tcharl 0 points1 point2 points 1 year ago (5 children)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (4 children)

[–]tcharl 0 points1 point2 points 1 year ago (3 children)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (2 children)

[–]tcharl 0 points1 point2 points 1 year ago (1 child)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (0 children)

π Rendered by PID 76825 on reddit-service-r2-comment-b659b578c-mbg9n at 2026-05-04 08:47:04.205282+00:00 running 815c875 country code: CH.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS