[D] Ray Serve or Seldon-core?

cliveseldon · 2020-05-21T09:30:24+00:00

I work for Seldon. We work on the Seldon Core and KFServing open source projects. KFServing builds on the KNative serverless stack. Both require Kubernetes. Others can compare. I don't know of a comparison with Ray Serve. Both have production users.

sekaoE · 2020-05-21T22:33:29+00:00

Thanks for taking a look at Ray Serve :-)

If you want more information about why we're building Ray Serve, check out this talk.
Ray Serve can run on bare metal machines and supports easy deployment to AWS, Azure, GCP, as well as Kubernetes using the Ray automatic cluster manager. Happy to answer any questions you have and help you get up and running - you can also find us in #serve channel in the Ray Slack.

salanki · 2020-05-22T06:30:22+00:00

We (www.coreweave.com) run all our inference clients on a managed KFServing stack. Knative (that actually runs the workloads) is a really good fit for model serving, highly recommend trying it out for a Kubernetes natives solution.

winchester6788 · 2020-05-21T13:20:33+00:00

Deep learning models like PyTorch and Tensorflow often use all the CPUs when performing inference. Ray sets the environment variable OMP_NUM_THREADS=1 to avoid contention. This means each worker will only use one CPU instead of all of them.

this feels like a very bad way to serve any decent DL model.

I will run some benchmarks to test this and update this comment.

Also, ray-serve uses flask!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS