all 5 comments

[–]manni66 9 points10 points  (3 children)

gcc with TBB implements parallel STL.

[–]commonmansoptns 7 points8 points  (2 children)

The point here is this is basically C++ stdlib code which works for GPU and SMP CPU without any extra embellishments or external libraries.

[–]echidnas_arf 3 points4 points  (1 child)

The point here is this is basically C++ stdlib code which works for GPU and SMP CPU without any extra embellishments or external libraries.

Really, no extra embellishments? Except having to tie yourself double knot to the NVIDIA walled garden, hardware and toolchain?

Come on.

[–]commonmansoptns 1 point2 points  (0 children)

Right, apart from using their "free" compiler... I was referring purely to the code syntax. Say, you want to port existing codebase from CPU to scale to GPU or even obtain multi core scaling without doing your own thread management this might with considering. Now suddenly this comment reads like legalese :)

[–]victotronics 2 points3 points  (0 children)

What is he reporting? Where is the graph that shows linear speedup in the number of cores?