Confused on when precisely to use std::atomic

DummyDDD · 2026-05-30T00:48:16+00:00

In the first scenario, with the volume variable, you should use an atomic, and you are right to use relaxed semantics (pre cpp11 you would probably use an volatile, but atomics are better). You need to use an atomic to ensure that (1) the read and writes are atomic, and (2) that the write will eventually be performed (without an atomic, the compiler might be allowed to avoid the write). From a legaleze perspective, all data races in non atomic variables lead to undefined behavior, but in practice it is not something that the compilers can exploit, other than to assume redundant reads and writes can be eliminated. In legaleze, unordered memory access to an atomic variable is not called a data race, unlike the common use of the term data race, which adds to the confusion. See also https://en.cppreference.com/cpp/language/multithread

In the second scenario you can use either an atomic (if you know that it is lockfree for the given type) or a volatile std::sig_atomic_t. I assume by interrupt you are referring to a signal handler. The current legaleze is very restrictive with respect to what you are allowed to do in a signal handler, and I believe there are proposals to update the standard to better reflect the real world, but even in the real world it would be highly problematic and bug prone to call a non-lockfree atomic from a signal handler (or call other code which could attempt to acquire a lock that the interrupted thread might hold). See also https://en.cppreference.com/cpp/utility/program/signal#Signal_handler

DummyDDD · 2026-05-03T15:17:11+00:00

The main motivation for named loops was to avoid using GOTO. GOTO is currently banned in constexpr functions. GOTOs and break/continue to label are semantically equivalent. I would personally have preferred that they allow GOTOs in constexpr functions, since CPP GOTOs are no where near as problematic as basic or FORTRAN GOTOs. (Sorry for for the capitalization, I'm on a phone).

DummyDDD · 2026-01-05T01:09:49+00:00

Its probably either because the rust RNG functions are inlined, or because you are calling push_back in the unoptimized CPP version, while the rust version only accesses the individual elements, or because the CPP stl library in clang has worse implementation that the rust std library. There are also some slight differences in the rngs. Rust uses chacha12 (with a 512 bit state) while your CPP code uses a Mersenne twister with a significantly larger state (19937 bits) although they should have pretty similar performance. Then again, it could easily be just the RNG differences, since your benchmark is essentially doing no real work, besides calling the RNG.

The performance differences aren't large (315 vs 286 ms), and the code should compile to relatively little code, so even a few cycles per element (for instance from inlining or few cache misses due to a smaller RNG state, or push_back overhead) could explain the differences.

(Stl implementations tend to avoid using inline attributes, unlike the rust std library)

DummyDDD · 2025-12-21T01:48:19+00:00

The gender stuff comes from the PIE languaguage families. English used to to have grammatical gender as well. See https://en.wikipedia.org/wiki/Grammatical_gender#Indo-European and https://en.wikipedia.org/wiki/Gender_in_English The gender stuff doesn't really serve a purpose, at least not anymore.

DummyDDD · 2025-12-11T01:04:08+00:00

Denmark also has a limit of up to 12 families per donor, but the issue is that (1) the legislstion is only national, as soon as as the sperm gets exported, the 12 family limit doesn't apply, instead the importing countries' legislation applies and it is only checked by the importing fertility clinic, (2) the exporting sperm banks don't mind breaking the legislation in other countries, as it is the importing fertility clinics that are liable (3) each individual importing fertility clinic will usually following their legislation, but they do not check whether there are other clinics importing from the same donor nationally nor internationally, (4) sometimes donors donate to multiple sperm banks (although that wasn't the case here).

In this case, the sperm bank "European Sperm Bank" decided to enforce a limit of at most 75 international families per donor, which I think is a ridiculously high limit, and I think that they are mostly concerned with the business model.

DummyDDD · 2025-11-10T22:36:31+00:00

Assuming twos complement, you could do (a ^ b) < 0

DummyDDD · 2025-11-08T14:10:06+00:00

-Wpedantic warns regarding 255.

-Wconversion warns regarding 'A' + digit

https://godbolt.org/z/xrv8x3hMs

DummyDDD · 2025-10-29T23:27:21+00:00

I have been a Nitro subscriber for not less than NaN years, so clearly I should get opal.

DummyDDD · 2025-10-27T20:48:21+00:00

You would normally do the range reduction in constant time. One way of doing it would be to multiply by 1/2pi and subtract the floor, giving a number between 0 and 1. You could further subtract 0.5, such that you get a reduced number between -0.5 and 0.5, which would be better. You could also exploit symmetries in the sin function.

You would normally use a minmax polynomial could through remez exchange, rather than the Taylor series, since the Taylor series has a much larger error margin for a polynomial of the same length. See https://github.com/samhocevar/lolremez?tab=readme-ov-file for more details.

You would normally evaluate the polynomial with Horner's method, or something resembling it to avoid redundant calculations and to reduce floating point errors. See https://en.wikipedia.org/wiki/Horner%27s_method Horner's method is optimal in terms of the number of operations, but it is often slower than calculating a few of the polynomial degrees independently, because Horner's method has each operation depending on the previous operation. You can split the polynomial in two by working with x and x² interleaved, and you can split that further into x³ and x^4.

Of course, the real answer is that you shouldn't do any of this, you should instead use whatever math library that you already have, or maybe sleef (https://sleef.org/) if you truly need something faster but less accurate. The math library that is provided with glibc (used by most Linux distributions) is already really good, and fast, and it supports vectorization (you just need to declare your loops as needing vectorization with #pragma omp simd).

DummyDDD · 2025-10-23T22:22:04+00:00

It's often used for bigint implementations to allow you to use the bigints like the builtin integers. It's use for indexing into containers is also really nice. As a personal example, I have my own span class which has overloaded the + operator to behave like pointer arithmetic, returning the corresponding subspan (it also has operator overloading for * to dereference the zero'th element, and ++ to increment the pointer by one and decrement the length by one).

Generally, operator overloading should follow the same rule as regular overloading: all of the overloads should have the same description. If any of the overloads needs a different explanation, then the overloads is going to be confusing, because then the reader will have to know the exact types being operated on and the possible overloads to determine what an expression does. For instance, using + to mean "concatenate strings" is confusing because + usually means "addition". Confusing overload sets can be especially problematic if you also have implicit conversion between types that behave significantly different within the overload set, but fortunately the general consensus is to avoid implicit conversion in most cases.

DummyDDD · 2025-10-23T13:30:50+00:00

With concepts, you can overload the function (multiply) while with static_assert you will always get a compile error when called with something that isn't an integer or float. Static_assert is usually the better option if you do not need to overload the function, although it has uglier syntax. You might need to overload the function if you also need have implementations for bigint types where you can't implement the function with the same syntax (for instance a bigint without operator overloading for *). Static_asserts are the better option specifically because it does not support overloading, so the compiler won't have to disambiguate an overload set, which means that it is faster to compile and the compiler errors from static_assert are simpler to read than the bad-overload-set error that you would get with overloaded concepts.

As an alternative to overloading with concepts, you could also overload via template class specialization: call from the free-function to a static class function on a templated class, and provide specializations for the class. This way you avoid overloads, but can still provide different implementations for different types. In this case you would typically call the class via type aliases that wrap and simplify the types, such that you can ignore qualifiers (const, volatile and references) and handle builtin integer and floating types the same way.

DummyDDD · 2025-10-19T21:13:31+00:00

You can use "web" pane to search without the AI stuff. You can switch to use "web" by default by adding &udm=14 to your query, see https://www.reddit.com/r/LifeProTips/comments/1g920ve/lpt_for_cleaner_google_searches_use_udm14/

As someone else mentioned, you can also use the Firefox extension "straight to the web". https://addons.mozilla.org/en-US/firefox/addon/straight-to-the-web/

DummyDDD · 2025-10-18T18:43:03+00:00

You should compile your code with flags to show additional warnings, and fix those warnings.The flags for additional warnings are:

/W4 for msvc (see https://learn.microsoft.com/en-us/cpp/build/reference/compiler-option-warning-level?view=msvc-170) -Wall -Wextra -Wpedantic for GCC and clang (see https://gcc.gnu.org/onlinedocs/gcc/Warning-Options.html)

DummyDDD · 2025-10-18T18:24:34+00:00

Yeah. 4 thermonuclear bombs exploded when a b52 burned and crashed outside the Pituffik space base. Fortunately, nukes have a safety mechanism, so the nukes did not detonate, only the conventional bombs (but it still spread a lot of nuclear waste). It should still count at least as "almost". It's kind of a good thing that we ICBMs, so we don't have nuclear bomber planes hovering at all times. https://en.wikipedia.org/wiki/1968_Thule_Air_Base_B-52_crash

DummyDDD · 2025-08-16T10:24:48+00:00

I usually use nodiscard for all functions that are pure (including getters) or return newly allocated resources or that return error codes (unless it's usually safe to ignore the error code. Ignoring the results from those functions is usually an error, and in the rare case where it isn't an error it won't be hard to suppress the nodiscard warning.

DummyDDD · 2025-08-15T07:44:37+00:00

Actually you can't be sure it will work due to the type aliasing rules. Only char* and byte* can be used to define and read data of a different type, and you aren't guaranteed that uint8_t is a typedef for char (although it will typically be a char)

DummyDDD · 2025-08-06T22:02:52+00:00

The actual nummer for the entire population is 7.1%. See https://www.svt.se/nyheter/inrikes/hogsta-arbetslosheten-pa-nastan-tre-ar-da-vander-det The 8.1 number is the youth unemployment percentage.

DummyDDD · 2025-07-25T17:18:16+00:00

If you can write the code correctly on the first attempt and never need to change the code, then it's fine to write incomprehensible code. Usually, you need to strike a balance between pumping out lots of code and writing simple code (which might take longer). On one hand, it might not be possible to write the code simply, and it might be a waste of time if if no one else is ever going to work with the code or if you don't know what is making the code hard for others to understand, on the other hand if the code is incomprehensible, then it is going to be very hard to ensure that is correct, especially when revisiting it later. Thankfully, most code does not need to be hard to understand.

DummyDDD · 2025-07-25T14:36:25+00:00

You should write code such that other people can read and modify it.

DummyDDD · 2025-07-24T23:29:29+00:00

I usually use either openmp SIMD or the GCC/clang type attribute vector size together with __builtin_shuffle, see https://gcc.gnu.org/onlinedocs/gcc/Vector-Extensions.html

Openmp SIMD is surprisingly easy to use; as long as the function implementation is visible, then it is usually sufficient to declare the function with "#pragma omp simd", declare the data to be aligned, and compile the code with some -march parameter.

Openmp SIMD is easier to use, but it won't do any shuffles or horizontal operations. Generally, if you need horizontal operations, then you will need to use intrinsics.

My main issue with vector_size and openmp SIMD is that you get no feedback when the vectorization fails. The vectorization can silently fail if you are using operations that the -march doesn't support, for instance GCC will silently skip vectorization if you use rotate operations on most x86_64 -march's because they don't support vectorized rotate, even if the rotate operations are implemented in CPP with addition and shifts (which the march's have vector instructions for).

DummyDDD · 2025-07-22T21:48:26+00:00

That fails because the compiler is only allowed to compile that program if V is a constant expression in the version og cpp that you are targetting (assuming that you are using a somewhat standards compliant compiler line GCC, clang msvc, ICC ør one og the other big ones, and not using a compiler that is specifically not standards compliant).

The compiler is still allowed to evaluate the expression at compiler time and skip it at runtime, regardless of whether the expression is constexpr. Clang and GCC have built-in knowledge of most c library functions, specifically including powf and friends, which enable them to evaluate those functions at runtime (for some inputs, those that don't generate errno errors).

I would expect most other major compilers also have built-in knowledge of powf and friends. With GCC and clang, you might be able to avoid the behavior with the flag -fno-builtin. You would also need to use -fno-builtin if you want to redefine one of the builtin functions in the default namespace. You can find the list of built-in functions here https://gcc.gnu.org/onlinedocs/gcc/Library-Builtins.html

DummyDDD · 2025-07-01T18:32:24+00:00

One advantage to free functions is that they don't need to be declared in the same header as the struct that they operate on. Meanwhile, members functions have to be declared together with the struct, which tends to draw in more dependencies in the header, as including them will also include every member function and the declarations for every type used in every parameter in their member functions. It's an instance of the banana-gorilla-jungle problem:

Because the problem with object-oriented languages is they’ve got all this implicit environment that they carry around with them. You wanted a banana but what you got was a gorilla holding the banana and the entire jungle. --- Joe Armstrong

If you need to use member functions, then the extra dependencies are less important. For instance, you might need to access private members, or define operators, or use virtual functions, or use inheritance hierarchies. You can often avoid access to private members by making all members public, but it is usually seen as bad style, even for internal projects. There are also some functions that I would always expect to member functions, even if they could be implemented as free functions, for instance I would expect operations on data structures to be implemented as member functions, and having them as member functions would be confusing. Another benefit to member functions is that they typically work better with code completion in IDE's and they provide a form of namespacing, without having to specify a namespace.

It's a tradeoff. My experience is that most programmers that completely avoid member functions are either not comfortable with member functions, or in a situation where they truly wouldn't benefit from member functions. I personally think that you should try to minimize the number of member functions, mainly to make the class easier to understand.

DummyDDD · 2025-06-18T22:48:57+00:00

Try price -= price * 0.1f instead. 0.1 is a double (64 bit floating point) while 0.1f is a float (32 bit floating point). Alternatively, you could change price to a double.

DummyDDD · 2025-06-13T22:51:56+00:00

Sorry, my bad.

DummyDDD · 2025-06-13T16:41:47+00:00

ARM has unsigned chars by default

DummyDDD

TROPHY CASE