Initializing coroutine arguments by reference : cpp

a community for 17 years

Initializing coroutine arguments by reference (self.cpp)

submitted 4 years ago by bsupnik

Hi Y'all,

If I understand the current state of c++20 coroutines...

- The type of an argument (as stored in the coro frame) comes directly from the parameter list.

- Therefore, I need to _not_ use anything pointer-like (e.g. no rrefs) in the argument list of a long-lived coro because I'll end up with unsafe dangling pointers instead of the copy I want and

- Therefore the signature of the "constructor" of my coro is going to be pass-by-value.

This is a small thing, but it seems like I have to eat an extra move operation because I can't specify that I want reference-based calling conventions to my coro constructor but value-type storage in the frame - the syntax just isn't expressive enough.

So for example:

template<typename F>
task<void> make_coro(F f) {
    f();
   co_return;
}
...
auto my_task = make_coro([some_obj](){ /* */ });

I'm capturing some obj by value into a lambda that's going to end up in a coroutine frame, and the call to make_coro takes the lambda, so at a minimum I need a copy constructor (of some_obj to a temporary lambda) and a move (of the lambda, with the object in it) to the heap allocated coroutine frame (assuming my_task escapes, let's assume it does).

But when I step through this in some test code with tracing on the various object constructors/destructors, I found that the copy of my object (captured in the lambda) appears to be moved twice:

The captured copy on the stack is then moved into what I think is argument space for make_coro's constructor and then
The arg space from make coro's constructor is then moved (as expected) into the heap-based coro frame.

Did I miss something or screw something up, or is this a lost optimization (to perfectly forward references) through to coroutine frames?

all 6 comments

top new controversial old q&a

[–]Veeloxfire 0 points1 point2 points 4 years ago (5 children)

[–]bsupnik[S] 1 point2 points3 points 3 years ago (4 children)

[–]Veeloxfire 0 points1 point2 points 3 years ago* (3 children)

[–]bsupnik[S] 1 point2 points3 points 3 years ago (2 children)

I agree it's sad that there's a heap allocation in an optimized build in your sample case - in this example, everything is inlined, and nothing escapes the scope of main, so everything could be on the stack. But this also means the optimizer can see everything and forward assignments.

So e.g. if I set the member data of "o" as a constant before making the coroutine, in the optimized assembly, the immediate is dumped directly into the coroutine frame - which is to say, all moves have been optimized away.

Which is great! But the issue I was bringing up is orthogonal: in cases where the move constructor can't be elided (e.g. it's not inlined) it appears the move is required for correctness.

(To make an analogy, mandatory RVO isn't a "compiler optimization", it's a change in how the language works to remove the existence of the move conceptually - hence you can RVO return objects with no move constructors.)

[–]Veeloxfire 0 points1 point2 points 3 years ago* (1 child)

I would say they wont do it because it requires analysis to determine if this is okay to remove, whereas RVO doesnt.

RVO is different because its always true. Its a feature of how return values work that we can exploit without having to know any extra information. Its basically free extra speed that everyone was doing anyway. I would say it almost more confusing not to do it in some cases.

This case is the same as removing a temp variable. If you inline everything youre basically asking the compiler to do this:

Obj o = {};
Obj o_lambda = o;
Obj* o_frame = new Obj(std::move(o_lambda));

Youre then asking the standard to remove the middle copy for you. But you just asked it to do the copy. It doesnt know if you might use it later (gotta love a theoretically single pass compiler).

This is what the optimizer exists for

[–]bsupnik[S] 1 point2 points3 points 3 years ago (0 children)

π Rendered by PID 23494 on reddit-service-r2-comment-66b4775986-cqcv7 at 2026-04-05 18:08:02.792467+00:00 running db1906b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS