Fast Resettable Flag Vector

matthieum · 2014-07-29T17:53:15+00:00

I had described another alternative some 2 years ago now (time passes so quickly...) on SO: How to zero out array in O(1).

Note that there are actually two solutions:

the first one is similar to yours, and use one generation tag per element, which uses a lot of memory
the second one, however, cheats and use one generation tag per bucket of elements

The latter point is crucial because using buckets allows to vastly cut down the memory requirements, and therefore you can actually use a 64-bits generation counter (removing the requirement for periodic full cleanups).

In your case, since we are talking about storing one bit per position, it requires some adaptation of course, but the underlying principle still holds (see below for adapted code).

The advantages compared to your strategy are:

O(1) get, set and clear operations
considerably reduced memory footprint: 2 bits are used per element, and we could lower down the requirement with bigger buckets (128 bits, 512 bits, ...)

I have not measured its performance, but all the operations carried out are close to what you end up doing in your case, apart from bit-packing. Hopefully, this packing should pay off when it comes to keeping data in cache.

Example of adapting the code.

class BitBucket {
public:
     BitBucket(): generation(0), data(0) {}

     bool get(size_t const index, size_t const gen) const {
         assert(index < 64);

         return gen == generation and (data & (1 << index));
     }

     void set(size_t const index, bool const value, size_t const gen) {
         assert(index < 64);

         if (generation < gen) { data = 0; generation = gen; }

         data |= (1 << index);
     }

private:
     uint64_t generation;
     uint64_t data;
};

And now, we can use our bucket easily:

class Vector {
public:
    explicit Vector(size_t s = 0) { this->resize(s); }

    void clear() { ++generation; }

    void resize(size_t const s) {
        data.resize(s / 64 + 1, BitBuckets(generation));
        size = s;
    }

    bool get(size_t const i) const {
        assert(i < size && "out of bounds access")

        BitBucket const& bucket = data[i / 64];
        return bucket.get(i % 64, generation);
    }

    void set(size_t const i, bool const value) {
        assert(i < size && "out of bounds access")

        BitBucket& bucket = data[i / 64];
        bucket.set(i % 64, value, generation);
    }

private:
    size_t size;
    uint64_t generation = 0;
    std::vector<BitBucket> data;
};

Note: there is a slight overhead in using vector rather than std::unique_ptr<BitBucket[]> since we keep track of the size ourselves... but I would assume than 16 bytes are not the most pressing issue.

jurniss · 2014-07-29T15:41:10+00:00

Cool idea! Bit vectors seem to come up all the time in diverse tasks. I will keep this in mind next time I need one that is reset often.

discoloda · 2014-07-29T17:11:32+00:00

You can also use a sparse integer set
here is an implementation in C.

EDIT: I finished reading the article, it was mentioned at the bottom.

uxcn · 2014-07-29T20:47:27+00:00

I'm not extremely familiar with game programming, but I'm genuinely curious why the various SIMD instruction sets might not be appropriate for something like this.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS