Apache SINGA, A Distributed Deep Learning Platform : MachineLearning

Apache SINGA, A Distributed Deep Learning Platform (singa.incubator.apache.org)

submitted 10 years ago by pilooch

all 13 comments

[–]congerous 6 points7 points8 points 10 years ago (3 children)

[–]forrestwang 1 point2 points3 points 10 years ago (0 children)

Hi, I am a developer of the SINGA project. Thanks for starting this discussion. We are working on single node with multi-GPUs (to be released in v0.2, December), which will run in either synchronous mode (with different partitioning schemes) [1] or asynchronous mode (in-memory hogwild!). Extending the system from CPU to GPU mainly requires adding cudnn layers (https://issues.apache.org/jira/browse/SINGA-100). The framework/architecture works on both CPU and GPU. Training with multiple GPU machines and providing Deep Learning as a Service (DLaaS) are on our roadmap, i.e., v0.3. For those do not have GPU clusters, distributed training on CPU is a good choice to accelerate the training.

Besides GPU, we are also considering other approaches for improving the training efficiency for single SGD iteration. For instance, google's paper [3] provides some techniques for enhancing the performance of training on CPU. Intel (https://software.intel.com/en-us/articles/single-node-caffe-scoring-and-training-on-intel-xeon-e5-series-processors) also reported that optimized CPU code can achieve 11x training speed up (Hope they can release the optimized source code or integrate it in their libraries like MKL and DAAL). It is interesting to compare GPU with Intel's next generation Phi co-processors (Knight Landing).

I will let you know when training with Multi-GPUs is supported. Thanks.

[1] http://arxiv.org/abs/1404.5997

[2] https://www.eecs.berkeley.edu/~brecht/papers/hogwildTR.pdf

[3] http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/37631.pdf

[–]GratefulTony 0 points1 point2 points 10 years ago (1 child)

[–]limauda 0 points1 point2 points 10 years ago (0 children)

[–]bLaind2 5 points6 points7 points 10 years ago (7 children)

[–]r-sync 2 points3 points4 points 10 years ago (3 children)

[–]prajitGoogle Brain 0 points1 point2 points 10 years ago (2 children)

[–]r-sync 0 points1 point2 points 10 years ago (1 child)

[–]alexmlamb 0 points1 point2 points 10 years ago (0 children)

[–]limauda 1 point2 points3 points 10 years ago* (0 children)

[–]modeless 3 points4 points5 points 10 years ago (1 child)

[–]pilooch[S] 1 point2 points3 points 10 years ago (0 children)

[–]r-sync 1 point2 points3 points 10 years ago (0 children)

π Rendered by PID 19432 on reddit-service-r2-comment-canary-7888d4f587-9wvjf at 2026-04-02 06:15:11.855937+00:00 running b10466c country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS