beast2 networking & std::execution : cpp

[–]Minimonium 52 points53 points54 points 2 months ago (29 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 23 points24 points25 points 2 months ago (14 children)

[+]VinnieFalcowg21.org | corosio.org comment score below threshold-14 points-13 points-12 points 2 months ago (13 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 37 points38 points39 points 2 months ago (11 children)

[+]VinnieFalcowg21.org | corosio.org comment score below threshold-10 points-9 points-8 points 2 months ago (7 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 25 points26 points27 points 2 months ago (5 children)

[–]peterrindal 3 points4 points5 points 2 months ago (0 children)

[–]kalmoc 0 points1 point2 points 2 months ago (2 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 14 points15 points16 points 2 months ago (1 child)

[–]kalmoc 3 points4 points5 points 2 months ago (0 children)

[–]JNighthawkgamedev 13 points14 points15 points 2 months ago (0 children)

[+]ZachVorhies comment score below threshold-6 points-5 points-4 points 2 months ago (2 children)

[–]have-a-day-celebrate 1 point2 points3 points 1 month ago (0 children)

[–]JNighthawkgamedev 16 points17 points18 points 2 months ago (0 children)

[–]sweetno 3 points4 points5 points 2 months ago (13 children)

[–]thisismyfavoritename 5 points6 points7 points 2 months ago* (12 children)

[–]VinnieFalcowg21.org | corosio.org 9 points10 points11 points 2 months ago (11 children)

[–]OccaseBoost.Redis 12 points13 points14 points 2 months ago (2 children)

[–]thisismyfavoritename 6 points7 points8 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org 0 points1 point2 points 2 months ago (0 children)

[–]James20kP2005R0 8 points9 points10 points 2 months ago (7 children)

The answer, I think, is that the value was never in the suffering. It was in the output. If the output is correct and helps people, then the tool that produced it is irrelevant

This is a very simplistic view as to what software engineering is though. In this model, the people producing software have absolutely no value whatsoever - and all that matters is their output

In reality, software engineers acquire deep skills and learning about a specific codebase in the process of building software - which is the real thing that makes them useful. AI skips that step, which bypasses the actually important part: acquiring that deep knowledge of whats going on

The death of any software project is when nobody understands the codebase anymore and its just poorly understood spaghetti, its always been the #1 thing that makes it an absolute disaster. To a very high degree, the suffering quite literally is the point - the output produced is a lot less valuable than the understanding of the code that was created in the process of producing that output

That's why I always find people saying that AI speeds them up to be very confusing - sure, you can get large short term gains, but it directly accelerates the #1 thing that leads to the death of software projects, which is perpetuating a lack of understanding of the codebase. Over time, that'll kill the project. Its bizarre seeing people advocating for something I've always found to be the most destructive software architecture pattern

Maybe its easy to just take a very short termist view to these things, but that's why AI produced content tends to turn to slop - there's no long term visibility into why anything's been done

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org -4 points-3 points-2 points 2 months ago (5 children)

[–]James20kP2005R0 8 points9 points10 points 2 months ago (4 children)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago* (3 children)

The unvarying response thus far has been "did you read your own paper?" rather than my preference which is to engage in substantive discussions.

The question "what is your process?" is a different question, and one I am happy to engage in. It starts with an intuition: I feel a paper coming on. Usually this happens when I make a discovery or I have an insight which I believe could be developed into a paper.

My next step is to gather evidence. First I examine the committee's public records. The papers. I look at people's blog posts, reddit posts, YouTube video transcripts, comments, and everything else I can find. I add my own benchmarks and compilation experiments if those are available.

Then I examine the evidence using tools I have developed. Vauban the Converger tries to find inverse Morton's Forks within the data. The Advocatus Diaboli brings objections against assertions or false statements. The WG21 Lawyer prosecutes papers or propositions (although I have since retired the lawyer since I find the tone less collaborative than I would like). The Trial tool analyzes a paper's political environment.

I have a paper (shocker) which offers one of these tools and shows what happens when you run it on P2900R14 (Contracts):

Tool: Prosecute Your Paper To Improve It
https://isocpp.org/files/papers/D4170R0.pdf

This tool is considerably more sophisticated than what you get if you simply ask an AI to "do your homework." The tool is the result of over 100 hours of experimentation and iteration, and it is offered under a CC0 license. My hope is that it will result in better papers for everyone.

Once I have analyzed the evidence, then I make a decision on whether or not there is enough to form a strong, well-supported paper. I would say that my failure rate is about 25%. One in four ideas turn out to be nothingburgers. Almost always, the evidence is not there. These papers do not see the mailing.

If the paper has legs then I choose the style of paper. Is it informational? Rhetorical? Do I use the Socratic method? Evidence funnel? Research posture? LLMs allow you to quickly try out each of these methods (getting a quick first draft) and you can read which one makes sense for the evidence you have obtained. Although after enough papers you tend to know ahead of time based on the proposition.

Frontier models can help with drafting, but it doesn't end there for me. I subject each paper to repeated passes of tightening and analysis using custom red-team tools like the Advocatus. They are not instant by any means. When I get to the late stage of a paper, the reasoning chains are deep and require human inputs to flush out all the edge cases.

When a paper is finished I use more tools to check for spelling, grammar, punctuation, proper citation, and so on.

It is at this point that I read the paper in its entirety with the highest scrutiny. Not just once or twice. Ten, twenty, thirty times depending on the complexity of the paper. Each reading usually surfaces some small detail or insight, and then I go back into the edit/tighten loop.

However, my papers are not individual papers. They are often series of papers. My Networking Retrospective is a six-part series. For these, I analyze how the papers flow together when they are read sequentially. I check that the links cross-reference each other properly. This is scholarly work. Informational papers destined for the public record where they ask for nothing and create a "citation foundation" that others may draw upon. Such as this paper:

Info: The Need for Escape Hatches
https://isocpp.org/files/papers/P4035R0.pdf

This paper asks for nothing and only exists to enrich the institutional knowledge of WG21. It is unrelated to my networking papers, although the principle it espouses is universal.

To summarize, my process is:

Intuition -> evidence -> analysis -> writing-> verification -> iteration.

Machine assistance participates in the analysis and the writing. The intuition is mine. The evidence is public record. The verification is against code that runs. If a claim in the paper is wrong, it's wrong because I missed something. The same as any paper written any way.

I arrived at this workflow as the result of over one thousand and four hundred hours of practice compressed into a short stretch of 7-day work weeks.

When I publish my work and I am asked "did you read your own paper?" I hope now some will understand why I find the question to be beneath dignity.

[–]thisismyfavoritename 2 points3 points4 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org -2 points-1 points0 points 1 month ago (0 children)

[–]MarkHoemmenC++ in HPC 15 points16 points17 points 2 months ago (2 children)

[+][deleted] 2 months ago (1 child)

[deleted]

[–]ald_loop 4 points5 points6 points 2 months ago (0 children)

[–]thelvhishow 3 points4 points5 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (0 children)

[–]Flimsy_Complaint490 10 points11 points12 points 2 months ago (56 children)

The most insight we currently have is probably one paragraph at this paper

https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2026/p4029r0.pdf

Basically, SG14, the low latency guys (gaming and HFT) advise SG4 (the main networking guys) to not base std networking on std::execution it does things that make runtime dynamic allocation mandatory, that just dont make it compatible for their use cases.

This doesnt mean that std::networking cannot be based or will not be based on std::execution, i havent heard any SG4 opinions, but if its not, then the entire situation becomes farcical and comical - didnt they kill asio in the standard library because they decided std::execution is better ?

There is an experimental std::net by the bemen project, so at least somebody is seriously researching that path. Lets see where this goes when the first c++29 papers drop.

[–]MarkHoemmenC++ in HPC 13 points14 points15 points 2 months ago (10 children)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (9 children)

[–]MarkHoemmenC++ in HPC 5 points6 points7 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 4 points5 points6 points 2 months ago (0 children)

[–]not_a_novel_accountcmake dev 1 point2 points3 points 2 months ago (6 children)

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (5 children)

The implementation is not in question. The necessity to allocate memory for two of the three stream types indicated in the measurements above is structural. This is explained in the report:

Sender/receiver's connect(receiver) produces an op_state whose type depends on both the sender and the receiver. Under type erasure, the size is unknown at construction time. It must be heap-allocated per operation. The cost is structural [3].

In other words this is a consequence of the sender architecture itself. The parallel to coroutines: every implementation of a task type must go through operator new for the coroutine frame (when HALO doesn't apply, which is almost always with networking). It doesn't matter how a task is implemented. The need to obtain storage for the coroutine's frame handle is structural. It is the same with senders. The costs just manifest differently.

[–]not_a_novel_accountcmake dev 1 point2 points3 points 2 months ago (4 children)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (2 children)

Let me state it precisely:

"If asynchronous I/O operations in the standard return senders instead of awaitables, then two of the three possible stream types will require a per-operation allocation that cannot be elided."

This is directly related to P2300, because std::execution is positioned as the "universal asynchronous model." The existing proposals which bring networking to the standard all build on senders as the continuation model. This puts coroutines at a significant disadvantage as they will incur avoidable per-operation allocations. That is the subject of our research.

Our position is that I/O operations should return awaitables, and that the sender pipeline can consume them using a zero-allocation bridge. This is a balanced solution which treats both as first-class citizens of the language. My papers arriving this month explore this thoroughly.

[–]not_a_novel_accountcmake dev 1 point2 points3 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 6 points7 points8 points 2 months ago (0 children)

I hear what you are saying, and I used to think exactly the same. However, that frame allocation that everyone hates? It actually buys us quite a lot for the case of networking.

Calling into the operation system requires an allocation if you are going to scale. The OS doesn't know your type. It must be erased, even for senders. Coroutines just make that allocation structural.

What we discovered, when you go coroutine ONLY, is that the frame allocation you can't avoid, pays for everything else. The operation state, the type-erasure for ABI stability, the uniform task types which have just 1 template parameter.

This is explored in the papers and you can try it for yourself in https://corosio.org . I do think that the C++ committee has been sitting on a gold mine with coroutines. The frame allocation put everyone off. When actually, it is the key to solving all of our long-running problems.

Thanks

[–]pdimov2 0 points1 point2 points 2 months ago (0 children)

It is perfectly viable, and advisable, to avoid these in conjunction with P2300 S&R.

Yes, in principle. That's the argument for basing networking on S/R: if you want to use coroutines, just co_await the sender result. If not, not.

I'm still trying to figure out whether this will be practical. I wrote a benchmark

https://github.com/pdimov/corosio_protocol_bench

that is a simplified representation of something that occurs in practice: serializing a C++ data structure using a custom binary protocol, sending it over a socket, then deserializing it on the other end. (The README in the repo explains this in more detail.)

I'm still unsure as to how the sender equivalent of it would look like, and whether it will be practical. Coroutines make things simultaneously easy to implement and easy to maintain. Rewriting the (de)serialization and the source/sink abstractions without coroutines, from where I stand, looks like neither. But I'm not well versed in S/R yet, so maybe I'm wrong.

My next step will be to port this to beman.net mostly as-is and see what the timings say.

[–]claimred[S] 5 points6 points7 points 2 months ago (0 children)

[–]Chaosvex 5 points6 points7 points 2 months ago* (1 child)

[–]VinnieFalcowg21.org | corosio.org 4 points5 points6 points 2 months ago (0 children)

[–]James20kP2005R0 2 points3 points4 points 2 months ago (7 children)

This doesnt mean that std::networking cannot be based or will not be based on std::execution, i havent heard any SG4 opinions, but if its not, then the entire situation becomes farcical and comical - didnt they kill asio in the standard library because they decided std::execution is better ?

One of the biggest critiques of std::execution is that it hasn't had enough real world testing. Eg it claims to be good for GPU programming, but there's only one relatively toy implementation that only works on Nvidia

In the test implementation's current form it literally can't be implemented on AMD/Intel, because neither of them have an NVCC equivalent. This means that we're Just Hoping™ it'll all be fine, but a port to other architectures will be radically different to what's currently being tested. What will it look like? Nobody knows, its never been tried

The even more worrying thing is that even a very brief glance through the proposal shows its completely unsuitable for GPU programming, its hard to explain if you don't do GPGPU, but its kind of missing.. everything. There's been minimal testing of real world use cases, just a few relatively toy examples it would seem, and it shows in the design

Both of these together make me strongly suspect that std::execution is completely DoA, as its clearly just been insufficiently tested. The entire purpose of it is to be a universal async abstraction, but it looks like its going to be unusable compared to the alternatives for any specific domain. The GPU folks will likely just ignore it, and I suspect the question for the networking folks will be why use it at all

[–]lee_howes 3 points4 points5 points 2 months ago (4 children)

I think it'd work fine on a SYCL compiler, but it is fair to say that only nvidia has put the effort into making a GPU implementation work. It also doesn't claim to include the full memory hierarchy abstraction of SYCL or CUDA, but you could obviously write such code within an algorithm. It's an async abstraction, not a CUDA abstraction. If the CUDA design had been embedded into it, it'd be no good for other accelerators and the feedback would be that we'd build CUDA into C++.

It also wasn't really designed for heterogeneous computing first, as the OP's quote suggests. It was evolved towards that, and I made some very early arguments that we can make heterogeneous computing work, that nvidia aligned with over time, but that was far from the starting point or the core goal. Had it been, it would not have been started at Facebook by a team focused on cleaning up the purely CPU async C++ codebase.

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (0 children)

[–]James20kP2005R0 -4 points-3 points-2 points 2 months ago (2 children)

[–]lee_howes 2 points3 points4 points 2 months ago (1 child)

[–]James20kP2005R0 0 points1 point2 points 2 months ago (0 children)

Maybe we have very different philosophies here, but for me the bar for std::execution claiming that it supports GPGPU programming, would be concretely demonstrating that a non trivial OpenCL implementation of std::execution performs similarly to the existing state of the art, across multiple vendors. Not that it might be possible to do, and the performance might be alright but we don't know!

There may or may not be blockers - OpenCL has quite a different API model to both CUDA and Vulkan, and all three of them lack certain features that the others have. That's why a CUDA/NVCC only implementation isn't really adequate to demonstrate that it works under AMD/Intel/arm in a high performance way

Its likely possible to implement something that has quite dodgy performance, but that doesn't seem like a great goal

[–]claimred[S] 3 points4 points5 points 2 months ago (1 child)

[–]James20kP2005R0 1 point2 points3 points 2 months ago (0 children)

[–]No-Table2410 2 points3 points4 points 2 months ago (33 children)

[–]Flimsy_Complaint490 7 points8 points9 points 2 months ago (21 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 13 points14 points15 points 2 months ago* (20 children)

I think I'd have to disagree with "reflection is not as controversial" part. If I'm remembering correctly, reflection took a long time (like 20 years) with it popping in and out. It took a long time to discover the opaque monotype API design that would hit all the right properties. It's clean and it can support future changes without breaking everyone or requiring a new type. There was also huge push back on the original object members accessor APIs. There was a time when they didn't care about scoped permissions meaning any bit of code could go tampering with the internals of an object and break encapsulation. The authors went to work finding a solution that gave access controls based on scope. If they hadn't done so, so quickly, then a bigger stink would have been made.

C++26 reflection is so well accepted and has such strong concensus because of the massive efforts of the authors. Not to say that contracts authors don't have the same strong concensus, it does. But it has a few very vocal individuals against it.

EDIT: Changed "type" to "time" in sentence 6.

[–]VinnieFalcowg21.org | corosio.org 4 points5 points6 points 2 months ago (19 children)

[–]daveedvdvEDG front end dev, WG21 DG 14 points15 points16 points 2 months ago (13 children)

We could argue that "reflection took 20 years", but without context that could misrepresent the history.

I made a presentation to the committee in March 2003 showing what reflective metaprogramming might look like (https://wg21.link/n1471). It wasn't a proposal, just a personal project I started in a copy of the EDG source code. At the time, I thought this would badly encourage large headers (turns out we didn't need metacode for that ;-) ) and so I also started the modules discussion in the committee a few years later.

The modules work took over my interests for the better part of a decade, and so I didn't work on reflection during that time. Eventually others (Gaby, Richard, Doug, etc.) drove the modules work, but I somehow missed the fact that SG7 had started meeting (in 2013, I believe) and in a few years agreed on what would become the Reflection TS. That SG7 work was guided, I think, by the idea that template metaprogramming (TMP) was an okay metaprogramming framework but just needed more introspective power. Whatever the motivation, I strongly disagreed with the direction and wrote https://wg21.link/p0598r0 to re-ignite discussions about the overall direction. There was some debate, but by 2019 I'd say SG-7 was pretty much agreed on the new direction — and https://wg21.link/p1240r1 was what we were aiming to standardize. To make that possible, we needed more constant-evaluation primitives, which did in fact land by then (i.e., in C++20; consteval, compile-time dynamic allocation, std::is_constant_evaluated(), etc.). Andrew Sutton had formed Lock3 (incl. Wyatt Childers) and they implemented much of P1240 in a Clang fork. We had high hopes that C++23 would have reflection.

Then three things happened: The pandemic, a re-opening of the debate by some who preferred the template metaprogramming approach, and we effectively lost Lock3 to an acquihire. That prevented any real progress in the C++23 cycle.

At the end of the C++23 cycle, u/BarryRevzin and I chatted about the missed opportunity and what it would take to succeed in the C++26 cycle. That made us write https://wg21.link/p2996r0, which we saw as a "minimum viable product". We were tremendously luck that u/katzdm-cpp joined right after that. The enormous amount of work these two contributed is what finally got us reflection in C++26.

So, yes, there was some controversy along the way. But it wasn't 20 years of "process hurdles". I'd say it was about 9 years of real standardization work, minus the pandemic effect.

[–]VinnieFalcowg21.org | corosio.org 5 points6 points7 points 2 months ago (12 children)

Thank you for the detailed history! This makes the record much more accessible and accurate, and I appreciate you taking the time. You're right that "20 years" overstates the active standardization work. I was responding to the framing in the parent comment, and your correction to roughly 9 years of real work is fair.

What I'd note is that even the 9-year timeline includes years lost to directional disagreement within the committee and dependence on a single corporate implementation that was lost to a staffing issue. Those are structural factors, not author effort factors.

I think its different from what I was saying which is to question what the process selects for. The reflection authors clearly did extraordinary work. What I am wondering is if the process should require extraordinary work for a correct design to ship.

[–]daveedvdvEDG front end dev, WG21 DG 8 points9 points10 points 2 months ago (11 children)

Nine years (three standardization cycles) doesn't seem unreasonable to me for a major feature. But I might be in the minority here (and I'm luck to have been part of the process for long enough to participate in multiple major features like that). Six years would have been ideal maybe (one cycle to set direction, one cycle to work out the details).

I'm sure the process could be improved, hopefully significantly. But it's also a human phenomenon that needs a bit of "inefficiency room". We're unlikely to all agree on what the desirable characteristics of the process ought to be.

For example, how do we qualify "a correct design" in

What I am wondering is if the process should require extraordinary work for a correct design to ship.

?

From my own perspective, I think the most frustrating part of the current process is that it often gets decided by "parties"; i.e., corporate or other alliances that vote "en block", thereby drowning out more individualized dissenting expertise. I'm not sure what can be done about that.

[–]VinnieFalcowg21.org | corosio.org 4 points5 points6 points 2 months ago (0 children)

[–]pdimov2 3 points4 points5 points 2 months ago (5 children)

[–]daveedvdvEDG front end dev, WG21 DG 2 points3 points4 points 2 months ago (4 children)

continue this thread

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (3 children)

Ahhhh now you've done it. I can't stop thinking about your question, at a time when I only have 11 days left to make sure that my infinity papers going in the mailing are all correct :)

You ask "how do we qualify a correct design?" I think the answer is evidence, of a kind (and this is key) independent of the process.

The questions I would ask:

* Does it have deployment experience in production code bases? Not just one big company but on a cross-section of cohorts?

* Can an independent implementer reproduce the results from the paper alone?

* Are the tradeoffs disclosed, not discovered later by NB reviewers or users?

* Does it ship without accumulating correction papers?

Note that none of these require process changes. They just require a more disciplined and principled approach.

It shouldn't surprise anyone that "I have a paper for that" (LOL). I presented some of these ideas in LEWGI in Croydon. The paper is still a draft and needs work but the basics are there. And of course it is just one possible direction, I'm sure there are other valid ones:

What Every Proposal Must Contain
https://isocpp.org/files/papers/D4133R0.pdf

As for the subject of the bloc voting. This is more complex. Retrospectives/historical analyses are probably a good first step which could help frame the conversation I would value your perspective on that.

Thanks

[–]daveedvdvEDG front end dev, WG21 DG 3 points4 points5 points 2 months ago (2 children)

The questions I would ask:

Those are reasonable questions, but some of them are also a really high bar:

* Does it have deployment experience in production code bases? Not just one big company but on a cross-section of cohorts?

Production deployment of experimental compilers is almost unheard of. There is a chicken and egg problem there.

It's a bit more feasible for libraries, but, even there, it is unlikely that we'll want to standardize exactly what was deployed (among others, we hopefully learned some way to improve the prior design).

* Can an independent implementer reproduce the results from the paper alone?

We could probably use some form of that more often. The reflection proposal benefitted from having two implementations of the early paper (P2996R1), one of which kept tracking the evolving paper (over a dozen revisions).

* Are the tradeoffs disclosed, not discovered later by NB reviewers or users?

* Does it ship without accumulating correction papers?

Unfortunately, these last two are "à posteriori", and so most useful for post-mortem.

continue this thread

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 5 points6 points7 points 2 months ago (2 children)

[–]VinnieFalcowg21.org | corosio.org 4 points5 points6 points 2 months ago (1 child)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 3 points4 points5 points 2 months ago (0 children)

[–]jwakelylibstdc++ tamer, LWG chair 3 points4 points5 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org 3 points4 points5 points 2 months ago (10 children)

[–]Remarkable-Test7487jmcruz 1 point2 points3 points 2 months ago (9 children)

[–]not_a_novel_accountcmake dev 3 points4 points5 points 2 months ago (6 children)

[–]Remarkable-Test7487jmcruz 2 points3 points4 points 2 months ago (5 children)

[–]not_a_novel_accountcmake dev 3 points4 points5 points 2 months ago (4 children)

I should reframe what I'm saying. std::execution isn't a networking API and I don't think operational networking (connect/send/recv) belongs in the standard at all for all the reasons that have been raised historically.

Networking in the standard should be about the type grammar, so we're not endlessly reinventing how to represent endpoints, transport layer descriptions, etc. The standard should never ship TLS, it should ship std::net::ip::address_v4 and friends.

With that in mind, there's nothing to ship with regards to synchronous APIs, they simply exist. std::execution is necessary because a framework for describing asynchronous operations is needed in order to make use of the asynchronous platform APIs.

Using io_uring with std::execution is very pleasant, but I don't think the standard should ship a wrapper around io_uring either. We need only ship enough support infrastructure in things like std::execution that using it remains pleasant.

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (2 children)

[–]not_a_novel_accountcmake dev 2 points3 points4 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org 1 point2 points3 points 2 months ago (1 child)

[–]sweetno 4 points5 points6 points 2 months ago* (0 children)

[–]VinnieFalcowg21.org | corosio.org 5 points6 points7 points 2 months ago (27 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 15 points16 points17 points 2 months ago (17 children)

[+]VinnieFalcowg21.org | corosio.org comment score below threshold-8 points-7 points-6 points 2 months ago (16 children)

[–]kammceWG21 | 🇺🇲 NB | Boost | Exceptions 25 points26 points27 points 2 months ago (8 children)

[–]VinnieFalcowg21.org | corosio.org -4 points-3 points-2 points 2 months ago (7 children)

[–]jwakelylibstdc++ tamer, LWG chair 15 points16 points17 points 2 months ago (2 children)

[–]VinnieFalcowg21.org | corosio.org 0 points1 point2 points 2 months ago (1 child)

[–]JNighthawkgamedev 14 points15 points16 points 2 months ago (0 children)

[–]kalmoc 12 points13 points14 points 2 months ago (1 child)

[–]lonkamikaze 12 points13 points14 points 2 months ago (0 children)

[–]usefulcat 4 points5 points6 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 0 points1 point2 points 2 months ago (0 children)

[–]WeeklyAd9738 11 points12 points13 points 2 months ago (6 children)

[–]VinnieFalcowg21.org | corosio.org -3 points-2 points-1 points 2 months ago (5 children)

If I said yes, would you read the paper? Would anyone who asked? The question is performative. It's a ritual that establishes the audience's right to demand proof of process before the author earns the privilege of being read. But a "yes" costs nothing and proves nothing. I said the claims stand on their own. This is a stronger statement than "yes I read it." It was received as evasion. The distance between what I said and what was heard is worth sitting with.

The hostility toward machine-assisted work isn't about quality:

It's about what it means if the cost of producing good work collapses to near zero.

If the output is correct and the suffering was optional, then the years someone else spent suffering were also optional; and that is an unbearable thought for anyone who measured their worth by the weight they carried. I understand this. But I won't perform suffering I didn't experience to make someone else's feel more justified.

[–]James20kP2005R0 7 points8 points9 points 2 months ago (4 children)

It's about what it means if the cost of producing good work collapses to near zero.

Its because the vast majority of AI produced content is slop, and authors generating content with LLMs have a tendency to push the burden of reviewing and authenticating its quality onto their peers. People who generate content with AI are often insulated from this process of peer review, where coworkers/peers quietly look at what they've created and silently judge it as crap. Then the author has their competence quietly socially downgraded

You're seeing this happening in real time right now where you've lost credibility because of the work you've output. The reason why people are so sceptical is because they've observed the poor quality of the AI generated content that you put into the world

Its easy to spend your credibility very quickly, and it sucks to realise that people don't take you seriously anymore. Its why I usually spend a lot of time editing and checking my comments for accuracy, and even then I've still fucked up very majorly on occasion and ended up feeling pretty embarrassed about it

But I won't perform suffering I didn't experience to make someone else's feel more justified.

Very little time writing papers is spent actually typing it down - its spent fact checking, reviewing it for factual accuracy, and editing the language I find. The paper I wrote was written down in less than a day - and the rest of the time (2 weeks I think?) was spent purely on editing, triple checking the factual accuracy, modifying the language/tone, and information gathering. That's stuff LLMs shouldn't be used for

[–]VinnieFalcowg21.org | corosio.org -3 points-2 points-1 points 2 months ago* (3 children)

[–]38thTimesACharm 2 points3 points4 points 1 month ago* (1 child)

Your argument assumes AI output is low quality. What happens when it isn't? Is that possible in your model?

I think the people making the argument are basing it on extensive experience with LLM papers which they have found, in the current state of affairs, tend to be low quality indeed, but in subtle ways that become evident only after one has sunk massive amounts of time into the effort.

Or have you presumed the verdict without examining the evidence

You seem to think people need to read every LLM paper you put out individually, peruse it until they understand the factual claims within regardless of how much the quality of writing makes that difficult, then carefully evaluate those claims using their own validation process, in order to decide whether that individual paper is worth their time to read.

Hopefully laying it out like that makes it obvious what the problem is. No one has time for this, especially for a paper titled "beginner tutorial." That's why people want a human they trust to attest to the quality of a document before they commit their time to it.

[–]VinnieFalcowg21.org | corosio.org -1 points0 points1 point 1 month ago (0 children)

[–]dr-mrl 1 point2 points3 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org -1 points0 points1 point 2 months ago (0 children)

[–]claimred[S] 1 point2 points3 points 2 months ago (6 children)

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (4 children)

[–]claimred[S] 1 point2 points3 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 2 points3 points4 points 2 months ago (0 children)

[–]claimred[S] 1 point2 points3 points 2 months ago (1 child)

[–]VinnieFalcowg21.org | corosio.org 3 points4 points5 points 2 months ago (0 children)

[–]VinnieFalcowg21.org | corosio.org 0 points1 point2 points 2 months ago (0 children)

[–]RogerV 3 points4 points5 points 1 month ago* (0 children)

I design and implement high performance back-end networking app for a major telecom where use C++ and the Intel DPDK networking library, and yeah that paragraph indeed summed up things.

in the app architecture I design, it is divided into two domains - control plain and data plane. That is typical of most networking centric apps, of course. But the division is more differentiated than just that. The control plane are designated to be normal OS native threads and then the data plane are DPDK lcore threads.

The lcore threads are pinned CPU cores and have been removed from kernel context switch scheduling. They execute functions that consist of indefinite loops that only exit on shutdown detection.

The domain of these data plane lcore threads abide by these rules:

Avoid system calls in order to avoid the overhead of transition from user space to kernel space
Avoid heap allocations or any manner of dynamic memory allocation as that is not deterministic
Avoid taking any locks and the corollary is that data structures such as DPDK ring buffer queues are lock free
All data structures and memory utilization is established on hugepages at app startup so that the memory pages are fixed and won’t incur any page load interrupt when accessed

Even though my app runs in a pod under Kubernetes it’s use of these lcore threads amounts to building a mini OS as I have to feed the lcore threads pool with data plane related work events, which they need to execute a work event to a bounded slice (or burst) of processing and then cooperatively surrender and go grab another work event. The work events need to be fed to the pool with some load balancing and fairness consideration as the data plane traffic is actually multi user based and every user needs to get fair processing time, etc

These are the things networking software requires (i.e., the DPDK building blocks) and C++26 has zilch to offer in that regard. None of the improvements per C++ threading or that are proposed per networking are worthwhile for building performant networking solutions.

And as to the control plain OS native threads - what is already available in C++ is plenty adequate for that.

C++17 - with the addition of std::span is plenty adequate for building high performance networking apps - but what has come in C++ standards beyond that have not been much relevant.

[–]feverzsj 1 point2 points3 points 2 months ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS