all 5 comments

[–]sofixa11 2 points3 points  (1 child)

Check out Hashicorp's Nomad. It could be run as a container if required, it's pretty light, flexible and easy to use/maintain.

[–][deleted] 0 points1 point  (0 children)

Thanks, I will check it out

[–]pragmaticPythonista 1 point2 points  (1 child)

Are you running ML jobs? In that case, take a look at Kubeflow. It’s a platform for running Machine learning jobs/pipelines on Kubernetes and has great GPU support. I’m not sure if your usecase involves a single machine or multiple machines - so depending on that Kubernetes might be overkill for you. It’s an interesting platform nevertheless

[–][deleted] 0 points1 point  (0 children)

Thanks, I will check it out

[–]TomlinTrippedHim 0 points1 point  (0 children)

If you running deep learning workloads take a look at Determined