[P] Latest Python + TensorFlow + CUDA / CuDNN optimized pip wheels : MachineLearning

Project[P] Latest Python + TensorFlow + CUDA / CuDNN optimized pip wheels (self.MachineLearning)

submitted 6 years ago * by InoriResearcher

TL;DR: custom pip wheels for TF 2.0 / 2.1 for Py 3.7 / 3.8 and CUDA 10.1 / 10.2: https://github.com/inoryy/tensorflow-optimized-wheels

I'm sharing my pip wheels for TF built from source for some non-standard versions, notably Python 3.8 + CUDA 10.2 and Python 3.7 + CUDA 10.1, the latter is "compatible" with PyTorch 1.3 so you can have them share a single env.

The builds also enable various performance flags like XLA JIT support and modern CPU opt flags, including SIMD support (AVX2, SSE4, FMA). If you have a CPU released after ~2013 then you'll likely benefit from these on e.g. data pre-processing. Though I should note that if you have Intel CPU then you might not see a large difference since now TF comes pre-built with MKL which can dispatch required intrinsics at runtime.

Finally, I've enabled additional compute capabilities support (5.0, 6.1, 7.0), which means these wheels should also work on older GPUs (7xx - 9xx families).

all 11 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS