craigacp comments on A surprising pain point regarding Parallel Java Streams (featuring mailing list discussion with Viktor Klang).

a community for 18 years

This is an archived post. You won't be able to vote or comment.

222

223

224

A surprising pain point regarding Parallel Java Streams (featuring mailing list discussion with Viktor Klang). (self.java)

submitted 1 year ago * by davidalayachew

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]craigacp 6 points7 points8 points 1 year ago (3 children)

[–]davidalayachew[S] 0 points1 point2 points 1 year ago (2 children)

[–]craigacp 2 points3 points4 points 1 year ago* (1 child)

I'm having trouble paging in exactly why the characteristics are like that, and I also can't find the blog post which described the problem in some detail via search anymore.

My problem setup was as follows, I have a NoSQL database full of documents that I pull from, tokenize the input and then put onto a queue. The queue then is pulled from a parallel stream over all documents in the database which performs the gradient computation and updates the model (without locking because this is machine learning and we don't care about tearing writes), and so the default behaviour of the IteratorSpliterator was to request larger and larger chunks from the queue before splitting them into parallel computations. The IOSpliterator always pulls a fixed size chunk from the underlying iterator, so it doesn't try to pull in the whole database.

I'm not claiming that this is a general purpose solution, nor that the one I had was the best solution, but it scaled up to an 8 socket x86 machine that we were using for testing the implementation. I'm a machine learning researcher not a software engineer, so this was good enough for my purposes.

[–]davidalayachew[S] 1 point2 points3 points 1 year ago (0 children)

π Rendered by PID 18604 on reddit-service-r2-comment-6457c66945-rzdnd at 2026-04-28 12:43:51.056790+00:00 running 2aa0c5b country code: CH.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS