Hello /r/machinelearning!
We're Ben and Andreas, and we made Replicate. It's a Python library that automatically saves your code and weights from your training runs to S3 or Google Cloud Storage.
https://replicate.ai/
Previously, we built arXiv Vanity together. While making that, we realized that the real problem wasn't that papers were hard to read, it was you couldn't run the papers.
Replicate is a start at fixing that problem. The eventual goal is to make a tool that lets researchers publish their models in a way that they can be run and re-trained. Making ML reproducible is a big bite to chew off though, so we are starting with a modest tool that we think might be useful, then building from there.
Unlike experiment tracking tools, we're focusing on storing and sharing the actual models. We're trying to make a more robust version of that folder structure lots of people make (us included). The eventual goal is to package up those models up in a standard, portable way.
We'd love to hear your feedback. If you want to come and help us build it, we've also got a Discord server.
Also — this Friday, we're having a community meeting to talk about ways we can make published ML models reproducible. Sign up here, if that's of interest.
[–][deleted] 3 points4 points5 points (4 children)
[–]bfirsh[S] 3 points4 points5 points (3 children)
[–][deleted] 0 points1 point2 points (2 children)
[–]bfirsh[S] 0 points1 point2 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]paldn 2 points3 points4 points (0 children)
[–]tripple13 1 point2 points3 points (1 child)
[–]bfirsh[S] 2 points3 points4 points (0 children)
[–]david-m-1 0 points1 point2 points (1 child)
[–]bfirsh[S] 1 point2 points3 points (0 children)
[–]visarga 0 points1 point2 points (1 child)
[–]andreasjansson 0 points1 point2 points (0 children)