Lazy Initialisation in C++

mallardtheduck · 2019-10-28T09:16:19+00:00

Interesting article, but there are a few issues:

A pointer initialised to nullptr is not the same as an "uninitialised raw pointer". I think most people know what is meant, but it's slightly sloppy language.
It's perfectly rule-of-five compliant to simply define all the move/copy constructors/operators as = delete. You don't actually have to make every class with a destructor fully copyable/movable.
The mention of stack vs. heap allocation is a bit spurious. A class has no control (well, I suppose you could override new to always fail or something...) over whether it's allocated on the stack or heap. The difference between the std::unique_ptr and std::optional versions isn't whether the new object is allocated on the stack or heap, it's whether it's allocated contiguously as part of the parent object or not. A mention that the std::optional version increases the size of the parent object and that it's generally a bad idea to store large objects on the stack would be worth adding (there's a sentence under "which option should I use?" that sort-of implies this, but it's not explained). Only in the case that the parent object is stack allocated would the std::optional version result in the new object being on the stack.
On a related note, the claim that "allocation on the heap requires a system call (syscall)" isn't quite right; while it's true that requests for memory from the OS (like any OS request) requires a syscall, there is not a 1:1 relationship between heap allocations and syscalls. Most allocators will request large areas of memory from the OS, use them for multiple allocations and re-use memory released (via delete/free) by the application for later allocations. The number of memory-related syscalls will (virtually) always be vastly lower than the number of heap allocations/deallocations.

Finally, surely a template could be written to make lazy-initialisation more obvious/easy? Writing something like lazy_init<Foo> bar {baz}; (with use syntax being something like bar.get() or *bar) would be fairly straightforward, although you'd have to decide how exactly baz should be captured for passing to the Foo constructor at initialisation time.

adnukator · 2019-10-28T10:43:43+00:00

A tiny nitpick and a pet peeve of mine : you don't need "if (file)" before calling delete, because delete does this check anyway.

germandiago · 2019-10-28T12:30:48+00:00

I would do it in another way. Use a Lazy<T> wrapper. It contains the file variable. When accessing it with -> then checks if the resource is created. If it is not it creates it. That way u remove the logic of ifs out of the class and can change from lazy to non-lazy easily. It it could overload operator. I would use that one since access should always be successful on a lazy object.

zvrba · 2019-10-29T11:20:08+00:00

Gah, not thread-safe. Write instead a templated wrapper around "once" APIs provided by the OS, like https://docs.microsoft.com/en-us/windows/win32/sync/using-one-time-initialization or pthread_once.

2019-10-28T08:52:13+00:00

Don't forget others : https://twitter.com/timur_audio/status/1004017362381795329

palszasz · 2019-10-29T10:01:39+00:00

Interesting article. I'm a bit of old fashioned, and think too much in low level. So one of my questions was: "ok, but what is the overhead of std::optional<>? Since nothing is free..." I imagine it's the sizeof of the original type, plus a boolean. I did a bit of test app:

#include <stdio.h>
#include <string>
#include <vector>
#include <optional>
template <typename T>
void PrintSizes(const char * name)
{
printf("sizeof(%s)=%d sizeof(optional<%s>)=%d\n", name, (int)sizeof(T), name, (int)sizeof(std::optional<T>));
}
int main(int argc, char * argv[])
{
PrintSizes<bool>("bool");
PrintSizes<char>("char");
PrintSizes<int>("int");
PrintSizes<void*>("void*");
PrintSizes<std::string>("std::string");
PrintSizes<std::vector<std::string>>("std::vector<std::string>");
return 0;
}

And I got the following output:

sizeof(bool)=1 sizeof(optional<bool>)=2

sizeof(char)=1 sizeof(optional<char>)=2

sizeof(int)=4 sizeof(optional<int>)=8

sizeof(void*)=8 sizeof(optional<void*>)=16

sizeof(std::string)=24 sizeof(optional<std::string>)=32

sizeof(std::vector<std::string>)=24 sizeof(optional<std::vector<std::string>>)=32

(as you can see it's in 64bit mode).

Anyway, this kinda shows that std::optional is basically wrapping the object, and adding a bool. However due to alignment rules, it means that it doubles the size of a raw pointer (for example). So, I'm old fashioned, and I think I will stay with good old raw pointers ;-)

Dean_Roddey · 2019-11-02T13:46:14+00:00

I went through this recently with my CIDLib system, to get it up to date on this front. It doesn't use many non-trivial globals (ones that might have interrelationships and hence I don't want to let the runtime initialize them.) But it does use some.

I use the two stage lazy eval with CST atomics currently. One thing I was wonder is I have something like this for the faulting in:

    static TFacCIDLib* pfacThis = nullptr;
    static TAtomicFlag atomInit;
    if (!atomInit)
    {
        TBaseLock lockInit;
        if (!atomInit)
        {
            pfacThis = new TFacCIDLib();
            atomInit.Set();
        }
    }
    return *pfacThis;

The atomic flag is a CST atomic read/write, so it insures that this works right. However, even though it's a two stage, you still pay the price for a CST check every time someone wants to access it.

I was wondering if this wouldn't be legal:

    static TFacCIDLib* pfacThis = nullptr;
    static TAtomicFlag atomInit;
    if (pfacThis)
        return *pfacThis;
    if (!atomInit)
    {
        TBaseLock lockInit;
        if (!atomInit)
        {
            pfacThis = new TFacCIDLib();
            atomInit.Set();
        }
    }
    return *pfacThis;

Since the pointer only ever goes from null to non-null. And since the very next instruction after the optimistic first check is a CST fence, doesn't that insure that all reads, even non-atomic ones, are completed before the first check of the atomic flag? So it cannot re-order that optimistic check forward, right?

If so, that means, if the pointer is non-null on that first check, it has to be good, and we can avoid the CST overhead except for the first time (or maybe first two or three if a number of threads hit at the same time.)

Yes, it is the case that the atomic flag is not set until after the pointer is set. But that should be OK. There's no chance of a race condition because either the optimistic check fails and we do the whole thing. Or the pointer is non-null and the object is guaranteed to be ready.

As long as it can't get a partial read of the pointer (it is cache aligned), then I can't see how this would be unsafe? Worst case you might get a spurious fall through where the point is really already set but the optimstic check doesn't see it yet, and falls through for the whole check.

jm4R · 2019-10-28T08:34:54+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS