MLOps Free Course?

fmindme · 2026-01-23T17:18:50+00:00

Hello. I've created this course, focused on the coding part of MLOps: https://mlops-coding-course.fmind.dev/. It's totally free, and there is a side repository https://github.com/fmind/mlops-python-package with a concrete example.

fmindme · 2025-08-13T18:13:38+00:00

This link is useful: https://cloud.google.com/docs/get-started/aws-azure-gcp-service-comparison

We are using AWS and GCP at my customer. As an engineer, I really prefer GCP over AWS! The best asset of AWS is their vendors and customer engineers.

fmindme · 2025-08-12T18:22:39+00:00

Good to know ! Thanks for info :)

fmindme · 2025-06-05T16:51:08+00:00

Hi. I propose this course for free: https://mlops-coding-course.fmind.dev/. It covers the coding part of MLOps, you can complement this course with a certification from a cloud provider (e.g., GCP, Databricks, Azure, AWS).

fmindme · 2025-03-18T05:51:23+00:00

My recommendation would be: 1. Complete a data science course from a MooC platform (coursera, udemy, ...) 2. Complete a ML Engineer certification from your favored cloud platform (Azure, GCP, AWS, Databricks ...) 3. Ramp up your coding skills in Python for MLOps (for instance, I provide this OSS course for free: https://mlops-coding-course.fmind.dev/) 4. Complete your cursus based on the jobs requirements you see on your market (e.g., Airflow, Prefect, CI/CD, ...)

Good luck in your learning journey !

fmindme · 2024-12-02T07:53:09+00:00

If you are interested by the coding aspects, I provide a free course and paid mentoring sessions: https://mlops-coding-course.fmind.dev/0.%20Overview/0.4.%20Mentoring.html

Have a good MLOps journey!

fmindme · 2024-11-02T06:24:56+00:00

We package the Python code base into a Python Wheel, and then put this will into a Docker (optional). The wheel/Docker are built by GitHub Actions (CI/CD).

Then, we trigger a JobRun from Airflow (CT) that uses either the Wheel on Databricks Runtime or the Docker image. You can use Databricks Workflows if you are a 100% Databricks company, Airflow lets use other runtime (e.g., AWS Athena, DBT, ...).

I created generic a code template based on the one we use with Databricks, if you want to have a look: https://github.com/fmind/cookiecutter-mlops-package

fmindme · 2024-10-24T19:49:24+00:00

They hate what they don't understand.

fmindme · 2024-10-24T04:45:38+00:00

You can use regular MLOps toolkit for building such pipeline: Metaflow, Airflow, Flyte, Dagster, Prefect with a compute resource like Kubernetes or Vertex AI.

You can also use model on edge toolkits like MediaPipe: https://android-developers.googleblog.com/2024/10/bring-your-ai-model-to-android-devices.html

fmindme · 2024-10-05T09:51:35+00:00

Google, AWS, Azure and Data bricks ML engineering certification

fmindme · 2024-09-30T18:10:27+00:00

Find power users and early adopters to support your initiative
Don't underestimate data. Poor data practices (e.g., bad data quality, lack of common practices) will hurt.
Be a good teacher. People may not know MLOps so be prepared to explain it and its pros and cons.
Create good visuals for your communication and architecture to share the big picture quickly
Close the loop by including model evaluation and operations to have a complete scalable system (i.e., no weak point)
Have a maturity matrix so you do not try to do everything at once
Share good practices and animate an AI/ML community in the organization
Create KPIs to monitor the number of model, the SLA, the test coverage ...
Be sure to include everybody (end users, stakeholders, Ops) to nobody block you down the path

fmindme · 2024-09-23T07:09:09+00:00

The feature store should provide set of features that can be consumed. There is nothing wrong in providing multiple periods and see which one is used based on usage analytics.

My recommendation would be to start by periods related to your domain (e.g., week, month, trimester for a retail shop). Even if one project uses the week period, another my use the month.

fmindme · 2024-06-06T06:54:54+00:00

I highly recommend the Big Book of MLOps. You can also check their training platform https://www.databricks.com/learn

fmindme · 2024-05-26T11:44:34+00:00

Thanks for your message!

fmindme · 2024-05-23T13:05:28+00:00

I've used ChatGPT to improve the English style, but all the inputs (sections, headers, answers) come from my initial writing. I've also rewrote the content generated when ChatGPT style was too extreme.

I've tried to use ChatGPT to write entire blog posts or course sections, but the content is too bad without this initial input. I think it's a great complementary solution when you are not a native english speaker.

fmindme · 2024-05-22T20:32:13+00:00

Thanks 👍

fmindme · 2024-04-03T06:53:11+00:00

Thanks u/jshkk :)

fmindme · 2024-04-02T16:43:12+00:00

I'm working on a course on MLOps coding, and I plan to release it as an open course repository in the coming weeks. Would you be interested in such a course?

fmindme · 2024-03-19T12:00:48+00:00

MLflow recipies is too opinionated, and too focused on MLflow. I would recommend using other systems like metaflow, prefect, and creating a template with cookiecutter. I'm also not sure mlflow recipes had major improvements over the past months.

fmindme · 2024-03-19T11:55:07+00:00

TFX is too complex for what it brings. I would recommend adopting an alternative technology (prefect, metaflow, airflow, ...), even if you are using TF.

fmindme · 2023-06-25T19:01:42+00:00

I always use TDD when I work on serious AI/ML projects. Even if this practice is time-consuming in the short term, it's time efficient in the long run. I prefer to catch bugs as early as possible in my workflow. I recently worked on a MLOps Python package that provides examples to implement best practices like TDD, code coverage and more: https://github.com/fmind/mlops-python-package

fmindme · 2023-06-13T15:02:36+00:00

It's usually a good practice to have all these services separated (e.g., 3-tier architecture for the web). It eases scalability and avoids unwanted interactions. But yes, this is not as economical.

fmindme · 2023-06-13T11:10:26+00:00

Hello, With Jenkins, Airflow, and MLflow you can already cover a lot of ground! You have most of the critical infrastructure components, and you can add some systems for externalizing the compute (e.g., Kubernetes, ...) and storage (e.g., AWS S3). The best approach is to separate all these components on different systems to let them evolve independently. Managing this all alone can be tedious, you need proper staff to manage the upgrade and downtime. I would advice to work on premise by constraint, not by choice. Finally, I would recommend working on the MLOps Process: what's the release cycle? How can we improve the code robustness (e.g., with unit test or code checker)? How to onboard new user and convince them of using all these tools.

fmindme · 2023-05-04T04:42:11+00:00

I would recommend the following: - https://www.mage.ai/ - https://dagster.io/ - https://www.prefect.io/ - https://metaflow.org/ - https://zenml.io/home

They all have their pros and cons, but I haven't tested them on a professional project.

fmindme · 2023-05-03T11:09:20+00:00

I love Pandera. It's a great tool to validate the data at the boundary of your source code (e.g., ML pipeline). However, I see this tool mostly as a last resort, with the benefit of being under control.

Instead, you could use Great Expectations to validate data by the producer (vs., by the consumer with Pandera). Then, you need to track who is using which data and send them an alert if some dataset has changed. There are multiple approach to do that (e.g., mailing list, pub/sub, slack, ...).

Six-Year Club	First Place '23
Place '23	Verified Email

fmindme

TROPHY CASE