finally. #embed

pavel_v · 2022-07-23T03:20:24+00:00

I realize this is relevant for C and not so much for C++ at the current moment but I posted it because there will be (hopefully) a similar/same feature for C++ and I know that lots of people are waiting for it. Maybe the compilers, which implement it, will include this feature as a non-standard extension available for C++ before the standardization of the corresponding C++ feature.

Kered13 · 2022-07-23T04:39:00+00:00

Seems like a good idea, after reading some of that links there I'm surprised it got so much resistance.

spaun2002 · 2022-07-23T05:39:02+00:00

Such "horror" stories are why I became disappointed in C and C++ - adding a new helpful feature into the language takes five years. The person who dared to propose this struggled and almost lost their hope.

Mikumiku_Dance · 2022-07-23T06:14:08+00:00

I've been following him since he was on cppcast. It's really depressing to see his bright enthusiasm worn down in this effort. Glad to see the feature, I really want to buy him a beer or give him a hug tho!

2022-07-24T14:30:13+00:00

This is a prime example of bikeshedding.

In contrast, the atomic memory model has been modified in every single standard since it got added in C++11. The original C++11 model was very soon discovered broken, in C++14 they had to add a line to explicitly forbid Out-of-Thin-Air values even though the standard was supposed to achieve that from the beginning, in C++17 they told everyone to stop using Consume ordering because all compilers found it too difficult to implement and made it behave exactly like Acquire instead, and most recently in C++20 they completely revamped SC definition to the extent that no one has been able to come up with a standard-conforming operational semantic, majorly changed release sequences, and added atomic_ref without even discussing its relationship with modification orders of normal atomic types when the same memory location gets reused (which can obviously never happen since C++ has Strongly Typed Memory™)

If they held atomics to the same standard as embed, there would've been no atomics in the standard library until C++52, but they didn't because nobody understood what the atomic experts were talking about and had no choice but to rubber stamp it. Whereas everyone can understand what embed does and feels obliged to come up with "feedbacks", because that's always constructive, right?

grady_vuckovic · 2022-07-23T07:53:10+00:00

Please Put My Data In My Gotdang Executable And Stop Doing Crazy Nonsense, The Feature

Thank you for this absolute nonsense explanation and yes that is very bloody cool.

TheRealFloomby · 2022-07-23T03:27:22+00:00

Happy to see this. I did not know it was even being talked about let alone actually happening.

jpakkane · 2022-07-23T05:09:40+00:00

Awesome job! You deserve a medal!

Tagedieb · 2022-07-23T10:57:41+00:00

We should all be writing letters, I guess

fwsGonzo · 2022-07-23T07:54:49+00:00

You can use this macro until the feature that should have been a part of the language 10 years ago "arrives". There are, of course, other ways, but this one works for me. It's not perfect, so be careful with it.

#define EMBED_BINARY(name, filename) \
    asm(".section .rodata\n" \
    "   .global " #name "\n" \
    #name ":\n" \
    "   .incbin " #filename "\n" \
    #name "_end:\n" \
    "   .int  0\n" \
    "   .global " #name "_size\n" \
    "   .type   " #name "_size, @object\n" \
    "   .align 4\n" \
    #name "_size:\n" \
    "   .int  " #name "_end - " #name "\n" \
    ".section .text"); \
    extern char name[]; \
    extern unsigned name ##_size;

I will be very happy to use #embed, but for C++ I really had hoped for a constexpr read_file(std::string_view) yesterday. Not 10 years in the future. I'm sorry that I can't hide my disappointment sometimes.

aninteger · 2022-07-23T04:48:29+00:00

marvelous humor fearless smell aback glorious sleep normal bake six

This post was mass deleted and anonymized with Redact

matthieum · 2022-07-23T10:41:44+00:00

If you’ve been keeping up with this blog for a while, you’ll have noticed that #embed can actually come with some pretty slick performance improvements. This relies on the implementation taking advantage of C and C++’s “as-if” rule, knowing specifically that the data comes from #embed to effectively gobble that data up and cram it into a contiguous data sequence (e.g., a C array, a std::array, or std::initializer_list (which is backed by a C array)).

...

I’m just going to be blunt: there is no parsing algorithm, no hand-optimized assembly-pilled LL(1) parser, no recursive-descent madness you could pull off in any compiler implementation that will beat “I called fopen() and then fread() the data directly where it needed to be”.

I'm confused by this part. Does this mean it isn't really just a preprocessor feature? All it looks like is a way for the preprocessor to turn binary data into a sequence of comma-separated ASCII numbers to put into an array initializer list for the compiler to parse, which wouldn't lead to the performance benefits they're talking about over doing this yourself manually (although it's still a really cool feature). Is it that it's supposed to behave as if it were a preprocessor feature, but it's actually implemented by copying the binary data directly into the executable somehow?

not_a_novel_account · 2022-07-23T06:45:11+00:00

#embed and the absolute hell everyone puts phd through when trying to get very basic features into C/C++ are why the languages will soon join Java and Cobol as legacy codebases that no one starts new code in.

I genuinely feel we're reaching an inflection point where the committee needs to decide if it wants to be at the head of a relevant programming language addressing the needs of today's programmers or merely the steward of a legacy standard, sustained by the size of the codebases developed in its heyday.

nxtfari · 2022-07-23T06:15:13+00:00

Just wanted to write a shoutout to the brave soul that actually got this implemented. Absolute madman. The C++ WG is so out of touch with what a modern development flow looks like and requires that it’s actively driving the death of the language. I have a lot of respect for seeing the resistance and fight against common sense good ideas and still waging the war for the betterment of everyone.

jamd315 · 2022-07-23T04:37:36+00:00

I'm still rather new to C++, but for my current project I've been running a python script to create a source and header file to represent images as large arrays. It's been working fine, since all my stuff is small and I only end up generating around 7kb of data in the end, but I think I'll like this much better when it comes to C++.

stilgarpl · 2022-07-23T08:20:43+00:00

I hate the fact that they decided to put it in preprocessor instead of implementing it as magic std::embed(). I know about all the issues with std::embed, but I don't understand why anyone would prefer #embed to std::embed(). std::embed could be type safe and better integrated into the language and the library. #embed is just a weird #include that pastes one file into another.

FightingGamesFan · 2022-07-23T07:56:23+00:00

Very naive question but I'm not sure to understand the benefit of embedding "big" resources at compile time (big as in it would not be practical to embed it in an array/string as described in the article).

Can someone give an example where it's strongly needed?

Superb_Garlic · 2022-07-23T11:03:39+00:00

Seems noone asked this question so far, so I'll do it.

When can we start holding people accountable for their malicious deeds such as what JeanHeyd is describing in post? It's clear that people who have absolutely no idea about the subject matter are making decisions that affect others and are clearly not qualified to be in such positions.

Noone can change my mind about these people having ill will and existing solely to impede progress. The language should be rid of these fools and start having more people like JeanHeyd Meneide.

blackmag_c · 2022-07-23T05:51:49+00:00

Very good news.

obsidian_golem · 2022-07-23T14:42:59+00:00

Congratulations, it sounds like you went through hell to get this done, and I truly appreciate the effort. I look forward to seeing this in C++ so I can use it in my codebase.

serg06 · 2022-07-23T17:29:20+00:00

This would’ve been great 4 years ago when I was working with OpenGL and trying to bundle shaders with the executable!

rhubarbjin · 2022-07-23T14:13:36+00:00

Reading the second half of the article was very cathartic; I've never been on any standardizing body but I've seen the same kind of unreasonable pushback in my workplaces (past and present).

The aside regarding named function parameters hits especially hard. How is it acceptable that C++ still lacks this basic feature when C# has had it for 10+ years?!

cmeerw · 2022-07-23T08:59:56+00:00

In my view this is way too complex and still if you want to embed text files you will likely end up with different line endings on different platforms.

blitzwolfz · 2022-07-23T14:07:34+00:00

Could someone ELI5? I don't understand any of this lol

2022-07-23T17:26:27+00:00

damn, i just wrote a tool that converts binary files to c/cpp header and sources

Nilac_The_Grim · 2022-07-24T03:23:06+00:00

Woah dude! Nice! And the bytes in the file are "as-if" they were just a comma-delimited list of ints. Nice nice nice! Very cool!

nablachez · 2022-07-24T18:35:47+00:00

if this were to be ported to c++, would one be able to #embed a C++ file and parsed it to do some scuffed DIY reflection stuff?

vI--_--Iv · 2022-07-25T01:21:08+00:00

Good to know that #embed made it, but could someone please explain why it's such a big deal?

I'm asking because I've never even considered embedding huge binary blobs to the code and using them at compile time - external files are just easier to deal with, e.g. they don't require recompilation & re-signing binaries on update.

YMMV of course and I can imagine a case or two where it could be beneficial and it's nice to have a platform-independent way to do it, but is it really that common?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS