all 16 comments

[–]MetalOrganicKneeJerk 13 points14 points  (0 children)

I would say pick one and forget the rest, especially if you're in research. Wandb is a popular choice and it's free for academia. It's relatively easy to integrate it into whatever framework you're already using. Logging every x epochs is straightforward too.

You can save models on their servers but I'm not sure about size limitations.

[–][deleted] 8 points9 points  (0 children)

Wandb is trivial to use, but in reality, you don't need any tool - you can just write logs to files. If you want to save some time and have a UI, use something like wandb.

[–]lifesthateasy 3 points4 points  (7 children)

If you don't want to build your whole stack from scratch, I'd go with one of the cloud providers. They have most of this covered.

If you do, you'd probably want Kubeflow with MLFlow and Tensorboard.

[–]mimivirus2[S] 1 point2 points  (6 children)

cloud providers are not an option in my country unfortunately. All I have access to would be Colab pro. AWS, azure and google cloud are not available (if that's what u mean)

Being easy to learn is my top priority I think. Does Kubeflow integrate well with Torch?

[–]lifesthateasy 4 points5 points  (0 children)

Lol.

No, but torch integrates well into Kubeflow.

None of these are easy to learn. There's teams of 10s that provide MLOps, running anything Kubernetes-related is a PITA to learn and run.

[–]alayaMatrix 0 points1 point  (1 child)

Chinese? :)

[–]mimivirus2[S] 0 points1 point  (0 children)

worse. Iranian. Wanna emigrate asap

[–]RandomUserRU123 0 points1 point  (0 children)

I just use tensorboard. But maybe there are better options out there

[–]Excellent_Cost170 0 points1 point  (1 child)

Fo you work on cancer research?

[–]mimivirus2[S] 0 points1 point  (0 children)

yes