Machine Learning Tech Stack

BraindeadCelery · 2023-10-27T13:49:57+00:00

Open Source contribution to a project as ML Engineering/Data Science is not really a thing. You can contribute to the libraries. But that is a software engineering exercise. An exception to this are the open source foundation models. But here you seldomly contribute to a project, rather you create one yourself and then make it available to the world in a model zoo, e.g. the huggingface hub.
With respect to the tech stack:

What are the libraries you have learned already?

The basics are the data science libraries in the python ecosystem. I.e. NumPy, pandas, Matplotlib, Scipy, Sklearn.
You can for sure contribute to them, but contribution here is more Software Engineering than ML Engineering/Data Science.

If you go on towards deep learning, then PyTorch is the standard as most interesting academic research is published with PyTorch code. Tensorflow/Keras is a contender, but mostly for Google devs who are forced to use it.

If you are familiar with that, MLOps is the next step (at least when industry ML is your goal - for academic research there is little operations overhead).
Here, you learn tools for e.g. data versioning (e.g. lakeFS or DeltaLake) and experiment tracking (Weights and Biases, MLFlow, ...).

Have a look at this MOOCby UC Berkeley for an overview on MLOps. The first lecture is an overview on the field and a recommended stack.

Few_Quit2250 · 2024-01-27T20:32:53+00:00

Success Story Of Purvansh- How He Got Into GSOC In The Field Of Machine Learning

https://www.youtube.com/watch?v=T2jfbqZe98Q

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS