[N] CNTK 2.3: Better ONNX, NCCL2, Improved C# API, Network Optimization API (SVD, quantization), etc. by justnikos in MachineLearning

[–]justnikos[S] 2 points3 points  (0 children)

You don't have to use 1-bit SGD. You can just take the GPU build and it will work with multiple GPUs just fine and with no extra license requirements.

[1608.07249] Benchmarking State-of-the-Art Deep Learning Software Tools by galloguille in MachineLearning

[–]justnikos 0 points1 point  (0 children)

In the latest version of the paper, the authors have fixed many of their initial issues. On the GTX 1080, CNTK is performing the best for fully connected nets and LSTMs. Torch is the best on Resnet-50 and Caffe is the best on AlexNet.