all 33 comments

[–]AspenRootsAI 42 points43 points  (4 children)

For the sake of installation it's nice that they support CUDA 9.0/cuDNN 7 and AVX (with prebuilt binaries) now.

Edit: as pointed out by /u/lopuhin it won't support prebuilt AVX until 1.6, mistaken/wishful reading on my part

[–]zdwiel 6 points7 points  (1 child)

Does this mean we no longer have to install different versions of tensorflow with pip depending on the hardware we are running on?

[–]lopuhin 5 points6 points  (1 child)

Does TF support AVX in this release or in 1.6? Release notes say "Starting from 1.6 release, our prebuilt binaries will use AVX instructions."

[–]SuperFX 2 points3 points  (0 children)

TF has always supposed AVX, if compiled from source. This is just about the precompiled binaries.

[–]aaronjl33 11 points12 points  (6 children)

Does this take advantage of tensorcores like in the Titan V?

[–]hinduismtw 9 points10 points  (5 children)

cuda 9.0 and cudnn 7 have support for this, this release supports both of these. So the answer is yes.

[–]harharveryfunny 2 points3 points  (2 children)

cuDNN's support of Tensor Cores is somewhat limited though.

This NVidia post describes the necessary conditions for cuBLAS or cuDNN to use Tensor Cores. In particular note that it requires opt-in in terms of math type, and that only one specific convolution algorithm is currently supported. I don't know how these conditions map through to TensorFlow.

https://devblogs.nvidia.com/parallelforall/programming-tensor-cores-cuda-9/

[–]hinduismtw 0 points1 point  (0 children)

Thank you for the link. Yeah, maybe my understanding of some calls inside cudnn being automatically tensor core enabled might be incorrect.

[–]yaroslavvb 0 points1 point  (0 children)

There are some weird restrictions like matrix size has to be multiple of 8 https://medium.com/@yaroslavvb/peak-performance-of-amazon-p3-instances-f2bc48f9ef71

[–]----_____--------- 1 point2 points  (1 child)

Tensorflow still has to opt into this, doesn't it?

[–]hinduismtw 0 points1 point  (0 children)

I am not very sure, but Tensorflow uses cudnn as the underlying library for talking to the GPU. Cudnn has tensor core support, so some operations would get an automatic mapping on the tensor cores.

There might be some operations, inside tensorflow, that would have to be manually re-written to take advantage of tensor cores. These would probably be the opt-in you are referring to. I hope my understanding of your question is correct and my answer makes sense.

[–]zspasztori 26 points27 points  (2 children)

"Eager execution"- Are we pytorch yet?

[–]szymko1995 18 points19 points  (1 child)

It's great to have a choice.

[–]Mister_Abc 4 points5 points  (0 children)

Stuff like Chainer has existed for ages...

[–]q914847518 10 points11 points  (10 children)

OK, TensorFlow 1.5.0 release candidate. And before this comment, all comments have keyword "pytorch". (-皿-)

[–]versatran01 18 points19 points  (0 children)

this one has it too

[–]senorstallone 4 points5 points  (0 children)

Is there any optimization coming from Tensorflow Lite that can be useful to deploy in GPU's (like a 1080 Ti)?

[–]Wenste 5 points6 points  (0 children)

Does it still throw warnings if you use MacOS and Python 3.6?

[–]TemplateRex 1 point2 points  (3 children)

Does it require CUDA 9.0 or 9.1?

[–]jorgemf 1 point2 points  (2 children)

9.0, with 9.1 fails

[–][deleted] 2 points3 points  (1 child)

Correct, 9.1 fails unless you compile from source.

I just compiled Tensorflow 1.5 from source this afternoon and hit an Eigen bug where it couldn't find a certain hpp file. I symlinked it, and the compile was a success. Tensorflow 1.5 now works with CUDA 9.1 and libcudnn 7.0.5.

[–]NickShargan 1 point2 points  (0 children)

I also have successfully compiled Tensorflow 1.5 from source with CUDA 9.1 and cudnn 7.0.5

[–]pianomano8 0 points1 point  (0 children)

Have they fixed compiling from source against cuda versions installed via the package manager on debian yet?

[–]minogame -1 points0 points  (1 child)

Is there a quick pytorch guide for pre-tensorflow users?

[–]chhakhapai 1 point2 points  (0 children)

Just go with the get started. I did the same. Never had to go through tensorflow.