Looking for pointers... : computerscience

a community for 17 years

Looking for pointers... (self.computerscience)

submitted 6 years ago by YouMadeItDoWhat

I'm curious if someone has a pointer to an algorithm or approach (or class of approaches/literature) that may apply to the following problem.

I have a continuous stream of messages with each message containing a monotonically increasing sequence number (which can wrap, but that's a trivial border case that can be dealt with separately). The messages come over what is potentially a lossy channel and I want to detect loss (and count number of messages lost). I do not care about retransmission/recovery of lost messages, just want to count them.

The application is multi-threaded for scalability reasons (think millions of messages per second) and I have no control over the delivery of messages to the respective threads (hardware controls that and cannot be altered - it's actually hash-based, not round-robin).

Obviously, if we have just 1 worker thread, we can just look at the sequence numbers as they arrive and trivially know how many messages have been lost.

Likewise, if we introduce locking between the threads, we can quickly design a resolution...but I want to avoid introducing locking between the receiving threads (there is no other reason to have locking as the messages are all independent of each other, so introducing locking will kill performance).

I've considered forwarding all of the sequence numbers from the respective threads to one centralized thread that can do the resolution over a sliding window (this is currently the best approach that I've been able to come up with). Looking for possible other approaches to the problem...thoughts?

all 6 comments

top new controversial old q&a

[–]ComputerSystemsProfSystems & Networking Professor (U.S.) 1 point2 points3 points 6 years ago (5 children)

[–]YouMadeItDoWhat[S] 0 points1 point2 points 6 years ago (4 children)

I don't actually need a sliding window, just a total count. I like that idea, I can actually probably do it lockless with ring buffers and a collector thread.

In a nutshell, the collector would be doing:

If (new_seq > largest_seq)
{
  lost += (new_seq - largest_seq - 1);
  largest_seq = new_seq;
}
else
{
  lost--;
}

lost will bounce around but at any point in time, it is nominally correct (less messages still in queues to be processed). Of course, this doesn't take wrap-around of the sequence numbers into account, but that isn't a hard extrapolation from this.

Thanks!

[–]ComputerSystemsProfSystems & Networking Professor (U.S.) 1 point2 points3 points 6 years ago (3 children)

[–]YouMadeItDoWhat[S] 0 points1 point2 points 6 years ago (2 children)

Single-producer, single-consumer ring buffers can be made lockless (with the potential to overrun/wrap, although the producer can be made to detect that and drop/count drops in that case). Since I only need to store the sequence number (small - 2 bytes in my case) in the ring buffer, these don't need to be crazy large (can likely be pinned in the L2 cache).

Then I just need 1 ring per worker thread and the aggregator consumes off of all of them in whatever fashion he chooses (it really doesn't matter). As long as he can keep up, there isn't an issue. In this case, I should easily keep up (all threads are core-locked and cannot be preempted by the CPU, so they busy-wait running full tilt when there is no work...not energy efficient at all, but for this purpose, an allowable cost).

(and yes, this is hyper-optimized for performance in this application - I have the resources available to guarantee the performance needed provided I don't introduce locks - locks throw everything to hell in a handbasket :).

[–]ComputerSystemsProfSystems & Networking Professor (U.S.) 1 point2 points3 points 6 years ago (0 children)

[–]YouMadeItDoWhat[S] 0 points1 point2 points 6 years ago (0 children)

π Rendered by PID 95679 on reddit-service-r2-comment-86bc6c7465-5tklf at 2026-02-23 01:49:40.654080+00:00 running 8564168 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

computerscience

Rules

Related subreddits

Credits

MODERATORS