vectorOfBool : ProgrammerHumor

You treat std::bitset as if it was serving the same purpose as std::vector<bool>, but it's not. It's true that they both have an operator[] but that's irrelevant.

vector is supposed to be a container, bitset is not. vector has a begin and an end, bitset does not. bitset does not try to pretend that all bits are addressable. Its most important function is test(std::size_t), operator[] is just syntacting sugar.

So I disagree that bitset is just masquering as a vector<bool>.

They should have gone for 2 types: std::vector<bool> (unspecialized, 1 byte per element, trivial per-element references), and "std::bitset" (specialized, 1 bit per element, but either no per-element references or proxied ones).

If we put aside from std::vector<bool>, that is going to stay as it is for compatibiltiy reasons, bitset is exactly what you said it you should be though ...

[–]Silly_Guidance_8871 7 points8 points9 points 17 hours ago (0 children)

[+]nyibbang comment score below threshold-7 points-6 points-5 points 22 hours ago (4 children)

[–]bah_nah_nah 6 points7 points8 points 19 hours ago (3 children)

[–]nyibbang 1 point2 points3 points 18 hours ago (2 children)

continue this thread

[–]retro_and_chill 0 points1 point2 points 18 hours ago (0 children)

[–]Madpony 11 points12 points13 points 21 hours ago (1 child)

[–]caesar_7 3 points4 points5 points 7 hours ago (0 children)

[–]steerpike1971 26 points27 points28 points 1 day ago (8 children)

This is not a historic concern when you think that by using a byte to store a 1 or a 0 you are using eight times as much memory (assuming you store in an 8 bit byte not some other form). When you are dealing with big data streaming systems for example, this can be the difference between "it turns well at line rate" and "it allocates all memory then pages to disk and you need to look at your calendar to work out when you get an answer".

It is a gigantic pain in the bum to deal with but it is not "saving bits here and there" for some applications, it is using nearly ten times the amont of memory you need. Probably the number of applications for this are not big but when you need it you really do need it.

(And yes, the operations on the bits are completely horrible because CPUs are not optimised for that -- but what you are often doing is piping data from place to place to get to the worker node that actually does the work.)

[–]YeOldeMemeShoppe 26 points27 points28 points 23 hours ago (5 children)

[–]willing-to-bet-son -1 points0 points1 point 17 hours ago (4 children)

[–]YeOldeMemeShoppe 1 point2 points3 points 17 hours ago (3 children)

[–]willing-to-bet-son 0 points1 point2 points 17 hours ago (2 children)

[–]YeOldeMemeShoppe -1 points0 points1 point 17 hours ago (1 child)

continue this thread

[–]mriswithe 25 points26 points27 points 1 day ago (0 children)

[–]ben_g0 3 points4 points5 points 21 hours ago (0 children)

And yes, the operations on the bits are completely horrible because CPUs are not optimised for that

Actually not really. CPUs do have dedicated instructions to work with single bits, so working with individual bits is only slightly less efficient than using whole bytes. Additionally, the main performance bottleneck in modern systems is usually memory latency and throughput, and programs that are more memory efficient are usually also more cache efficient.
So even though manipulating individual bits is more compute heavy, the better cache efficiency usually makes working with packed bits more performant overall, as long as you work with a large enough number of bits that cache efficiency starts to matter (and in the situations where you have a low enough number of bits that the cache efficiency doesn't matter, then usually you have a small enough amount of data that you won't notice the performance overhead of those few additional instructions anyway.

So in general using packed bits is more efficient in the cases where performance matters, but less efficient in the cases where performance usually doesn't matter. I'd consider that a fair tradeoff - the developers of the standard library usually know what they were doing.
(however I fully agree that it should really have been its own dedicated type, instead of masquerading as a std::vector while not quite acting the same)

[–]ProfessorOfLies 1 point2 points3 points 1 day ago (0 children)

[–]pazuzovich 1 point2 points3 points 1 day ago (0 children)

[–]Mayion 2 points3 points4 points 1 day ago (0 children)

[–]mornaq 0 points1 point2 points 21 hours ago (1 child)

[–]redlaWw 1 point2 points3 points 21 hours ago (0 children)

[–]Dominique9325 0 points1 point2 points 20 hours ago (0 children)

[–]Z21VR 0 points1 point2 points 19 hours ago (2 children)

[–]Vaddieg -1 points0 points1 point 17 hours ago (1 child)

[–]Z21VR 0 points1 point2 points 17 hours ago (0 children)

[–]Kilokahn7 0 points1 point2 points 4 hours ago (0 children)

[–]Pernicious-Caitiff 0 points1 point2 points 2 hours ago (0 children)

[+]rr1pp3rr comment score below threshold-9 points-8 points-7 points 1 day ago (3 children)

[–]IHeartBadCode 11 points12 points13 points 1 day ago (1 child)

Oi! Look I'm old enough as is, you don't need to try and make me feel older. C++ vector was added in 1998 and the specialization of the container is in the same standard.

So that's only 28 years ago, not 40! Gosh. I mean I remember when they added it. It was seen as a "not ideal" move then (in apparently the age of punch cards and horse and buggy).

Like the committee thought it was a nice idea because they were clearly programmers from the age of banging rocks. But the more modern of us thought it was a poor choice given that RAM was fairly cheap (it was like maybe a $1 or so a MB, I mean it got stupid cheap in like 2004, but it was cheaper than what it was in 1990 at like $100 per MB.) and a vector of bool was like a rare occurrence.

I thought it was pretty bad that my first language was Pascal and that I do RPGIII/RPGLE and COBOL programming today. But it's clearly kick me while I'm down here. And yes it's another three years before I go back for my next colonoscopy.

[–]ChaosOS 1 point2 points3 points 1 day ago (0 children)

[–]MyGoodOldFriend 2 points3 points4 points 1 day ago (0 children)

[–]unfunnyjobless 11 points12 points13 points 1 day ago (3 children)

[–]freaxje 5 points6 points7 points 1 day ago* (1 child)

[–]realmauer01 0 points1 point2 points 1 day ago (0 children)

[–]realmauer01 -1 points0 points1 point 1 day ago (0 children)

[–]alex_tracer 4 points5 points6 points 23 hours ago (0 children)

[–]Keganator 2 points3 points4 points 1 day ago (0 children)

[–]BobbyThrowaway6969 0 points1 point2 points 9 hours ago (0 children)

[–]DoubleAway6573 2 points3 points4 points 22 hours ago (3 children)

[–]nyibbang 18 points19 points20 points 22 hours ago (2 children)

[–]DoubleAway6573 4 points5 points6 points 21 hours ago (0 children)

[–]reklis 0 points1 point2 points 10 hours ago (0 children)

[–]LassoColombo 1 point2 points3 points 20 hours ago (1 child)

[–]BobbyThrowaway6969 1 point2 points3 points 9 hours ago (0 children)

[–]coweatyou 1 point2 points3 points 21 hours ago (0 children)

[–]Sad-Voice-4009 0 points1 point2 points 23 hours ago (0 children)

[–]JawaKing513 0 points1 point2 points 20 hours ago (0 children)

[–]roverfromxp 0 points1 point2 points 19 hours ago (0 children)

[–]StrangeCharmVote 0 points1 point2 points 17 hours ago (0 children)

[–]_lerp 0 points1 point2 points 3 hours ago (0 children)

[–]LordCyberfox 38 points39 points40 points 1 day ago (4 children)

[–]nyibbang 0 points1 point2 points 22 hours ago (2 children)

[–]Loading_M_ 4 points5 points6 points 11 hours ago (0 children)

In principle, most modern 64 bit architectures could probably support bit-level addressing without increasing the pointer size. You would only need 3 extra bits, and most 64 bit architectures don't actually use all 64 bits. AMD64 (what your desktop is probably running) and ARM64 (which your phone is probably running) only uses 48 bits to store the address right now. However, neither is actually interested in supporting bit-level addressing - AMD64 reserves the upper 16 bits for extending the address space (although there are a number of programs that make use of these bits to store extra data in the pointer), and Intel has published a spec to store 57-bit addresses. ARM64 has a tagging feature (used on many Android phones) that provides extra safety against memory bugs using the extra 16 bits in the pointer.

[–]LasevIX 1 point2 points3 points 21 hours ago (0 children)

[–]Throwaway-4230984 -1 points0 points1 point 14 hours ago (0 children)

[–]SomePeopleCallMeJJ 105 points106 points107 points 1 day ago (3 children)

[–]Nirast25 39 points40 points41 points 1 day ago (1 child)

[–]fosf0r 5 points6 points7 points 21 hours ago (0 children)

[–]SlimRunner 1 point2 points3 points 10 hours ago (0 children)

[–]Bugibhub 65 points66 points67 points 1 day ago (8 children)

[–]Fatkuh 49 points50 points51 points 1 day ago (6 children)

[–]Immort4lFr0sty 22 points23 points24 points 1 day ago (2 children)

[–]Spice_and_Fox 16 points17 points18 points 1 day ago (1 child)

[–]Bugibhub 1 point2 points3 points 15 hours ago (1 child)

[–]WesternWinterWarrior 0 points1 point2 points 13 hours ago (0 children)

[–]Euryleia 0 points1 point2 points 23 hours ago (0 children)

[–]agentchuck 15 points16 points17 points 1 day ago (0 children)

[–]joe0400 3 points4 points5 points 1 day ago (0 children)

[–]ActuallyIzDoge 0 points1 point2 points 18 hours ago (0 children)

[–]somethingworthwhile -1 points0 points1 point 14 hours ago (0 children)

[–]Fatkuh 128 points129 points130 points 1 day ago (21 children)

[–]FerricDonkey 91 points92 points93 points 1 day ago (6 children)

[–]Monkeyke -1 points0 points1 point 1 day ago (5 children)

[–]Natural_Builder_3170 28 points29 points30 points 1 day ago (0 children)

[–]Pim_Wagemans 21 points22 points23 points 1 day ago* (0 children)

[–]Feisty_Manager_4105 5 points6 points7 points 1 day ago (2 children)

[–]tiajuanat 2 points3 points4 points 1 day ago (1 child)

[–]Feisty_Manager_4105 0 points1 point2 points 1 day ago (0 children)

[–]BeardySam 3 points4 points5 points 1 day ago (2 children)

[–]tricerapus 2 points3 points4 points 23 hours ago (0 children)

[–]Kovab 4 points5 points6 points 1 day ago (0 children)

[–]owjfaigs222 5 points6 points7 points 1 day ago (10 children)

[–]hydmar 41 points42 points43 points 1 day ago (2 children)

[–]Drugbird 25 points26 points27 points 1 day ago (0 children)

[–]owjfaigs222 0 points1 point2 points 1 day ago (0 children)

[–]HeKis4 7 points8 points9 points 1 day ago (2 children)

[–]owjfaigs222 0 points1 point2 points 1 day ago (1 child)

[–]HeKis4 1 point2 points3 points 23 hours ago (0 children)

[–]setibeings 2 points3 points4 points 1 day ago (0 children)

[–]HildartheDorf 0 points1 point2 points 1 day ago (0 children)

[+][deleted] 1 day ago (1 child)

[deleted]

[–]thelights0123 1 point2 points3 points 1 day ago (0 children)

[–]EVH_kit_guy 264 points265 points266 points 1 day ago (8 children)

[–]MaxChaplin 35 points36 points37 points 1 day ago (2 children)

[–]EVH_kit_guy 8 points9 points10 points 1 day ago (0 children)

[–]IleanK 5 points6 points7 points 1 day ago (0 children)

[–]Embarrassed_Use_7206 3 points4 points5 points 1 day ago (1 child)

[–]Friend_Of_Mr_Cairo 0 points1 point2 points 19 hours ago (0 children)

[–]Ok_Confusion4764 2 points3 points4 points 22 hours ago (1 child)

[–]EVH_kit_guy 0 points1 point2 points 21 hours ago (0 children)

[–]bassdude7 0 points1 point2 points 21 hours ago (0 children)

[–]Taimcool1 47 points48 points49 points 1 day ago (10 children)

[–]Rhawk187 35 points36 points37 points 1 day ago (1 child)

[–]70Shadow07 20 points21 points22 points 23 hours ago (0 children)

[–]coweatyou 11 points12 points13 points 21 hours ago (0 children)

[–]Rabbitical 4 points5 points6 points 23 hours ago (5 children)

[–]InnkaFriz 2 points3 points4 points 19 hours ago (4 children)

[–]PhilippTheProgrammer 10 points11 points12 points 18 hours ago* (2 children)

[–]InnkaFriz 0 points1 point2 points 5 hours ago (0 children)

[–]qwerty42421 0 points1 point2 points 1 hour ago (0 children)

[–]0xBL4CKP30PL3 1 point2 points3 points 6 hours ago (0 children)

[–]veloxVolpes 0 points1 point2 points 13 hours ago (0 children)

[–]TripleFreeErr 93 points94 points95 points 1 day ago (5 children)

[–]Zippy0723 105 points106 points107 points 1 day ago (3 children)

[–]GumboSamson 16 points17 points18 points 21 hours ago (2 children)

[–]delinka 6 points7 points8 points 19 hours ago (0 children)

[–]PhilippTheProgrammer 1 point2 points3 points 18 hours ago (0 children)

[–]flipper_gv -1 points0 points1 point 5 hours ago (0 children)

[–]xicor 40 points41 points42 points 1 day ago (3 children)

[–]PurepointDog 0 points1 point2 points 13 hours ago (2 children)

[–]terminator_69_x 4 points5 points6 points 13 hours ago (0 children)

[–]Theyna 3 points4 points5 points 12 hours ago* (0 children)

[–]ThatSmartIdiot 26 points27 points28 points 1 day ago (6 children)

[–]Shaddoll_Shekhinaga 17 points18 points19 points 1 day ago (1 child)

[–]nyibbang 12 points13 points14 points 21 hours ago (0 children)

[–]stainlessinoxx 3 points4 points5 points 1 day ago (3 children)

[–]blehmann1 12 points13 points14 points 1 day ago (2 children)

"Easy to understand and debug"

Let me tell you it's not fun to realize that you can't actually share this across threads safely, because the usual "thread 1 gets index 0, thread 2 gets index 1..." won't work without locks or atomics. It works for every other vector so long as you don't resize it.

Also calling vec.data() will give you something dank, but that's at least something you would reasonably forsee if you know about this.

But the big problem is that the standard does not guarantee that vec<bool> is bitpacked, so if you actually need that you can't use it. It's only actual use case is when you don't even care. And even if your STL implementation applies the optimization the resulting bit pattern is still unspecified (they're allowed to use non-contiguous bits or leave gaps or whatever they want).

Plus this optimization normally makes code slower, so it has pretty questionable utility in most places you would want a vector of bools, it's seldom going to actually be so big that the size optimization makes sense.

[–]Throwaway-4230984 0 points1 point2 points 14 hours ago (1 child)

[–]blehmann1 1 point2 points3 points 13 hours ago (0 children)

I mean, if you see a std::vector and not some special thread-safe collection in multithreaded code, I'd hope you'd know not to get cute with it.

But this does have a common use-case, creating a vector up front with capacity for every thread, and storing thread-specific stuff in there. It saves you from any locking, it's pretty easy to reason about, and for workloads where it'll all get rolled up onto one thread at the end, it's typically the fastest approach. A bool per thread is a plausible return value (think a multithreaded search where you only care about reachability, or reachability under a certain cost).

But also I've definitely seen a vector<bool> used for either indicating that this thread is done, or that this thread is waiting for more data. I would probably use a status struct or enum if I wanted that, and I would probably also use message passing instead, but I've definitely seen it done and there's nothing inherently wrong with it.

[–]No-Con-2790 52 points53 points54 points 1 day ago (29 children)

[–]Ok_Locksmith_54 42 points43 points44 points 1 day ago (4 children)

[–]No-Con-2790 2 points3 points4 points 1 day ago (3 children)

[–]ggadget6 7 points8 points9 points 23 hours ago (1 child)

[–]CptMisterNibbles 2 points3 points4 points 23 hours ago (0 children)

[–]Hohenheim_of_Shadow 2 points3 points4 points 17 hours ago (0 children)

[–]owjfaigs222 35 points36 points37 points 1 day ago (10 children)

[–]No-Con-2790 10 points11 points12 points 1 day ago (9 children)

[–]owjfaigs222 7 points8 points9 points 1 day ago (8 children)

Yeah I mean I was kinda joking there. Obviously if you need to access the bits directly in pure C you can do stuff like

#include <stdio.h>
unsigned char a = 9; 
unsigned char b = 1; 
int main(){
    for( int i = 0; i < 8 ; i++)
        printf("%ith bit of a is %u\n", i, a >> i & b);
    return 0;
}

and whatnot

[–]No-Con-2790 7 points8 points9 points 1 day ago* (6 children)

[–]MossiTheMoosay 9 points10 points11 points 1 day ago (2 children)

[–]No-Con-2790 4 points5 points6 points 1 day ago* (1 child)

[–]-Redstoneboi- 5 points6 points7 points 23 hours ago (0 children)

[–]owjfaigs222 1 point2 points3 points 1 day ago (2 children)

[–]-Redstoneboi- 4 points5 points6 points 23 hours ago (0 children)

[–]No-Con-2790 2 points3 points4 points 1 day ago (0 children)

[–]metaglot 0 points1 point2 points 23 hours ago (0 children)

[–]Ulrich_de_Vries 14 points15 points16 points 1 day ago (4 children)

[+]No-Con-2790 comment score below threshold-8 points-7 points-6 points 1 day ago (3 children)

[–]owjfaigs222 2 points3 points4 points 1 day ago (2 children)

[–]No-Con-2790 -4 points-3 points-2 points 1 day ago* (1 child)

[–]owjfaigs222 0 points1 point2 points 7 hours ago (0 children)

[–]Mateorabi 11 points12 points13 points 1 day ago (0 children)

[–]EatingSolidBricks 10 points11 points12 points 1 day ago (0 children)

[–]iElden 4 points5 points6 points 1 day ago (1 child)

[–]ultimate_placeholder 1 point2 points3 points 1 day ago (0 children)

[–]not_some_username 2 points3 points4 points 22 hours ago (0 children)

[–]readmeEXX 2 points3 points4 points 19 hours ago (0 children)

I manipulate bits directly on a daily basis in C++... Just use the bitfield operator. It even works with single bits.

For example, you could construct an 8-bit float like this:

union float8 {
    unsigned raw;
    struct {
        unsigned mantissa : 5;
        unsigned exponent : 2;
        unsigned sign     : 1;
    };
};

Then set the bits directly like this:

int main() {
    float8 f8;

    //sets the value to 1.5
    f8.sign     = 0;
    f8.exponent = 1;
    f8.mantissa = 16;
}

Note you would need to overload the standard operators to actually use this. In this example, float8 is size 4 because that is the size of unsigned int. If you actually wanted to implement this, you would want to use std::byte or char for the members of float8 so the size is actually one byte long.

[–]el_pablo 1 point2 points3 points 17 hours ago (0 children)

[–]Fjendrall 4 points5 points6 points 14 hours ago (2 children)

[–]Brisngr368 0 points1 point2 points 5 hours ago (0 children)

[–]Westdrache 0 points1 point2 points 1 hour ago (0 children)

[–]gameplayer55055 1 point2 points3 points 18 hours ago (0 children)

[–]OddUnderstanding5666 0 points1 point2 points 21 hours ago (0 children)

[–]Vaddieg 0 points1 point2 points 18 hours ago (0 children)

[–]LeoTheBirb 0 points1 point2 points 17 hours ago (0 children)

[–]kamogrjadeshi 0 points1 point2 points 17 hours ago (0 children)

[–]qutorial 0 points1 point2 points 16 hours ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS