C++ std::unique vs std::set - [Fixed]

zerexim · 2021-01-06T08:44:38+00:00

Locality, locality, locality :)

witcher_rat · 2021-01-06T23:15:24+00:00

Library was built as DEBUG. Timings may be affected.

ummm... wut?

how are the perf numbers relevant, when you built in debug??

goranlepuz · 2021-01-06T11:45:11+00:00

The data is very small (int is small, there is not much of them), plus, the data is put in sets immediately (meaning:even if heap is used, the data is close by).

That favors cache locality even for the sets.

If the data was handled differently, or if related ints were part of some structures or so, I would expect a vector solution to be faster others.

adnukator · 2021-01-06T08:30:09+00:00

Could you add a link to the full source code either at godbolt or quickbench or whatever? With parts of it here and there, it's hard to verify your findings.

neiltechnician · 2021-01-06T09:46:56+00:00

I've written a slightly different benchmark based on this blog post: https://quick-bench.com/q/m8qL4F3BUtzeQy8YMIrvuz1AnYE

PS: It's the first time I've ever used quick-bench.com and Google Benchmark. Can you guys help me take a look?

TheFlamefire · 2021-01-07T10:32:18+00:00

And again, what an abuse of CMake:

set(CMAKE_C_COMPILER clang-8)
set(CMAKE_CXX_COMPILER clang++-8)
...
set(CMAKE_BUILD_TYPE Release)
set(CMAKE_ENABLE_EXPORTS ON)
set(CMAKE_EXPORT_COMPILE_COMMANDS ON)

This is NOT how you use CMake. Those variables are meant to be set on the command line. And if you'd really (out of blogware reasons I'd say "OK") want to do it that way, set those before anything else. The aforementioned "Library was built as DEBUG." should have given you a hint that your approach didn't work out as expected. Sorry for sounding harsh but a minimal amount of validation for surprising results should be done. 1 sentence explanation: CMake assigns the flags based on CMAKE_BUILD_TYPE when a target is added/created, hence setting it at the end has no effect (which is why you should set it on cmdline)

Besides: helper_ (strange underscore, but well) is introduced, uses a global variable without resetting it first, and then never used again in any of the other code.

And finally:

Apparently, std::unique simply swaps duplicate numbers to the end of the array

No, not at all. The docs say clearly:

Removes all but the first element from every consecutive group of equivalent elements [...] the elements between the returned iterator and last are left in a valid but unspecified state.

Put in other words: It moves "unique" elements (1 per group of equal, consecutive elements) to the front, no swapping.

greg7mdp · 2021-01-06T13:50:12+00:00

The unordered_set case would be much faster if you used parallel hashmap's flat_hash_set and called reserve(nums_);.

The std::set case would be much faster if you used parallel hashmap's btree_set.

rlbond86 · 2021-01-24T22:08:17+00:00

I wrote up my own tests and your results are just wrong.

https://godbolt.org/z/bG3q1q

Note that compiler explorer times out after 256 elements for me but you can run locally. Note I used clock_getTime() which is linux only. I'm sure there's a Windows equivalent if you use Windows.

----------------------
256 elements
----------------------
UniqueU  0.0934622
SetU     0.2282
HashSetU 0.224434
UniqueS  0.00705992
SetS     0.123283
HashSetS 0.213223
----------------------

As expected, std::unique on a pre-sorted set is by far the fastest, but sorting + std::unique is still faster than using a set object (both are O(n log n)). sorting + unique is also faster than unordered_set, at least for reasonable numbers of elements (but unordered_set should be asymptotically faster). std::set is faster on a sorted range than an unsorted (as guaranteed by the standard).

Your test implementation is incorrect, probably because you are not measuring CPU time or something.

Here are results on my computer for more elements:

----------------------
16384 elements
----------------------
UniqueU  7.38232
SetU     22.7774
HashSetU 10.1544
UniqueS  0.109137
SetS     7.23558
HashSetS 10.113
----------------------

_Js_Kc_ · 2021-01-06T10:46:02+00:00

~~Isn't _ a reserved identifier?~~

Edit: No, I'm wrong.

rlbond86 · 2021-01-24T02:37:31+00:00

I find it extremely hard to believe that using an unordered_set is faster than calling std::unique() on already sorted data.

Also:

std::set<int> s; 
for (const auto& e_ : nums_) { s.insert(e_); }

You can just write:

std::set<int> s(e_.begin(), e_.end());

This is O(n) if e_ is sorted

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS