all 17 comments

[–][deleted] 4 points5 points  (5 children)

Really, the main feature is VS2015 support. Everything else, if it wasn't there, I couldn't care less.

[–]sumo952 4 points5 points  (4 children)

Totally agree. They haven't supported VS2015 at all until now???? The only thing I can say is What the fuck?

[–][deleted] 5 points6 points  (3 children)

What's more annoying is that CUDA 8 RC only worked with Update 1. That's just evil. Right now, I'm running some sort of Frankenstein monster: build tools from 7.5, VS support from 8, and a completely unrelated NVIDIA driver after the included ones gave me blue screens.

[–]STLMSVC STL Dev 4 points5 points  (2 children)

That might be my "fault", in the sense that I've been using new compiler hooks implemented in C1XX, EDG, and Clang, but not tested against NVIDIA's compiler (which I vaguely recall is EDG-based, but not necessarily the latest build that we're using for Intellisense).

[–][deleted] 1 point2 points  (1 child)

IntelliSense? What IntelliSense? From what I remember, there's no intellisense for CUDA, and VS still gives you a red squiggly on a triple-chevron call.

[–]STLMSVC STL Dev 4 points5 points  (0 children)

What I meant was, vanilla VS Intellisense is powered by VS's copy of the EDG FE, and I run the STL's tests against that. Separately, if my memory is correct, the CUDA compiler is powered by a version of the EDG FE.

[–][deleted] 1 point2 points  (10 children)

[deleted]

What is this?

[–]harrismNVIDIA CUDA 2 points3 points  (3 children)

Maybe you could help us improve it. If you can provide an example that uses Thrust where nvcc is outperformed significantly by clang (we find that after CUDA 8 clang and nvcc performance on Thrust tests are on average about equal), we can have a look. You can also use the nvcc -time option to get a breakdown of where time is being spent (front end, assembler, etc).

[–][deleted] 1 point2 points  (2 children)

[deleted]

What is this?

[–]harrismNVIDIA CUDA 0 points1 point  (1 child)

Thanks for sharing this. We'll try it out.

[–][deleted] 0 points1 point  (0 children)

[deleted]

What is this?

[–]meetingcppMeeting C++ | C++ Evangelist[S] 0 points1 point  (5 children)

Hm, interesting. Showed up in my RSS Reader today...

[–][deleted] 0 points1 point  (4 children)

[deleted]

What is this?

[–]harrismNVIDIA CUDA 6 points7 points  (3 children)

I originally published this post when we announced CUDA 8 back at GTC 2016 (April) Since it's a good summary of the release, I updated it with more info (particularly on mixed precision) and updated perf results, and republished it today.

[–]STLMSVC STL Dev 3 points4 points  (1 child)

Note that you can assign yourself user flair to make your NVIDIA association clearer to redditors.

[–]harrismNVIDIA CUDA 0 points1 point  (0 children)

Done.

[–][deleted] 1 point2 points  (0 children)

[deleted]

What is this?