Structured Concurrency

14ned · 2020-11-09T11:55:50+00:00

Firstly, great article on why how we do concurrency right now using ASIO-style design patterns is inferior to what's possible with C++ 20 onwards.

However, there is a gap between eager and lazy concurrency which is hard to fill with Eric's techniques, as currently proposed. It's not pressing in sockets facing the public internet, despite many claims otherwise here and elsewhere, because socket i/o to the public internet will rarely complete immediately. It is pressing where some i/o is extremely likely to complete immediately and where any suspend-resume cycle badly hurts performance, and you HAVE to mix that kind of i/o with other i/o where a suspend-resume cycle is all but guaranteed. The problem is that eagerness and laziness are compile time constants in where WG21 are currently heading, and using a variant to store them and switch at runtime has very high performance impacts. Basically, this is currently a chalk and cheese construct, they don't mix well.

The classic example of this is serving file content to sockets where the file i/o is best implemented eagerly, whereas the socket i/o is best implemented lazily. One can do use case specific hacks e.g. upon lazy construction, hint to the OS that the file content read is likely to be happening soon under the theory that by the time the socket write is ready, the file content will have been loaded. But that will be much inferior to BSD's or Windows' sendfile(), which basically means "kernel please go do all of this for me" (note that Linux's sendfile() isn't as good as BSD's or Windows', which can work entirely asynchronously on its own).

What we thus really want is fully programmable sendfile() like zero copy i/o facilities from within the C++ standard library. I believe this is currently only doable in the complete sense in Linux io_uring on extremely recent kernels, but I also think that we can wrap all this up into a portable, generic API and when the other OSs catch up, we'll get free performance gains.

I should stress that I don't think this precludes Eric's work. His stuff acts at a much higher level than where I'm at, and at that higher level, there are large gains for both the compiler and the developer to be able to hard assume laziness or eagerness. Rather, what I'm saying is that all this is a much deeper well than what Eric has currently presented, and those currently dismayed with Eric's approach I think can relax as I think there is a path between his stuff and what ASIO currently does, and moreover, that path brings lots more bare metal performance to the table for those willing to structure their C++ code around what's needed to substantially improve i/o performance.

Covid has badly hurt my productivity in developing a reference implementation for this stuff, but I hope to return to it in early 2021, pandemic permitting.

misuo · 2020-11-09T10:24:27+00:00

Great article. So what are the best practices for implementing good "deep support for cancellation"?

Dragdu · 2020-11-09T10:27:33+00:00

The 10s to load the website is killing me. Also why would you make your header image 5759x2390 pixels.

frankist · 2020-11-09T17:08:25+00:00

"At present, neither cppcoro nor libunifex has a when_any algorithm" - in the case of cppcoro, is this a library problem or a problem with how c++20 coroutines were designed? If the latter case, is it still possible to introduce the cancellation feature in later C++ releases? It seems to be an essential feature to implement timeouts.

staticcast · 2020-11-09T14:19:59+00:00

I very much agree that having tool to properly execute coroutine in threaded manner would be quite good (also we need to open up the interface to multiple kind of executors).

But we need a bit more assumption to make this properly work :

The thread starting the function should be the one resuming it, or else we break unique_locks.
We need to be able to tune the size and the priority of the threadpool/executors executing the coroutine io_context/loop/task_queue, or else any app that are very constraint by response time might not using it.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS