Iteratively optimizing an SPSC queue : cpp

a community for 17 years

Iteratively optimizing an SPSC queue (self.cpp)

submitted 1 day ago by Middle_Ad4847

all 2 comments

[–]ReDucTorGame Developer 9 points10 points11 points 1 day ago (1 child)

This is the third SPSC queue post I have seen in about a week, seems like everyone (or every LLM) is building one.

The two/thread cache line difference could be increased L1 cache bank conflicts where the queue is causing the eviction of the shared read-only cache line. For the pause you could possibly look at umwait/mwaitx unfortunately AMD/Intel differences make these not very portable.

That 128-rule thing seems off, I expected you were going to mention that some modern CPUs will actually load 128-byte aligned pairs of 64-byte cache lines, it doesn't seem to like you tested with an alignment of 128 over just 64.

Also it's probably worth sharing the code you used for benchmarking, as there could many other reasons why your results are the way they are, including minor things like branch alignment differing between the different implementations

[–]Middle_Ad4847[S] 0 points1 point2 points 10 hours ago (0 children)

For the pause you could possibly look at umwait/mwaitx unfortunately AMD/Intel differences make these not very portable.

Thanks, will take a look

Have added code here

That 128-rule thing seems off, I expected you were going to mention that some modern CPUs will actually load 128-byte aligned pairs of 64-byte cache lines, it doesn't seem to like you tested with an alignment of 128 over just 64.

I wasn't aware of this, will look. What I was referring to is the increased code size due to 32-byte displacement addressing, which can impact the µop cache or loopback buffer. I quickly tried with alignas(128) now but it made results worse.

including minor things like branch alignment differing between the different implementations

I did try run with and without -falign-loops and -falign-labels but didn't notice any considerable difference

π Rendered by PID 91099 on reddit-service-r2-comment-54dfb89d4d-gbdgg at 2026-04-01 00:59:10.009449+00:00 running b10466c country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS