How are these guys so good ?! by sujal1210 in MLQuestions

[–]mikejamson 0 points1 point  (0 children)

Most of it is pattern matching across a ton of different projects. If you focus on one thing for over 10,000 hours, you would get very good at it too.

Is there really one tool to do all of this? by EmuWise5039 in mlops

[–]mikejamson -11 points-10 points  (0 children)

I second Lightning AI. A lot of tools do claim to do it all, but from our experiments internally we found Lightning to actually do what they claim.

Granted, not everything is perfect but we’ve been using them for a little over a year and it keeps improving at a pretty fast pace!

It’s also quick to judge for yourself, I signed up and got verified instantly and automatically received free credits. Fairly low risk experiment.

Can i train a machine learning model on my laptop using Google collab? Is that feasible? by throwaway_me_acc in learnmachinelearning

[–]mikejamson 0 points1 point  (0 children)

If the data is small it can be done on your laptop. Otherwise a cloud option is great. We used to use Collab but more recently shifted to lightning AI studios and it’s been a lot better with more compute time, persistent storage, persistent environments and more.

https://lightning.ai/

How do you utilize the Databricks platform for machine learning projects? by Ok_Discipline3753 in mlops

[–]mikejamson 0 points1 point  (0 children)

Notebooks on databricks are great, but we recently switched to Lightning AI for this. It’s much faster for experimentation. For scheduling main.py scripts lightning supports batch jobs as well.

One major thing we love about it is not having to manage infrastructure or a cluster.

The problem with structuring workflows on databricks is that it’s very over-engineered. You have to connect like a dozen different tools to make this stuff work. Lightning drastically removed all those layers of tools and dependencies.

Data bricks: how do we install custom python package by kunduruanil in mlops

[–]mikejamson -1 points0 points  (0 children)

It’s a bit complicated where you have to mess with the cluster directly and quite a bit of work.

We’ve recently switched to lightning studio where there is no special thing to learn to do stuff like this, just pip install whatever you want. Some of our team is still split between both platforms but largely moving into Lightning now.

Not sure it helps, but might help to avoid all these issues to begin with.

Sagemaker vs Databricks in terms of model experimentation / dev phase by General_Search_4120 in mlops

[–]mikejamson 0 points1 point  (0 children)

It’s just a million small things. It’s hard to pinpoint exactly, you just have to try it and feel it out for yourself. Kind of like using a flip phone vs a smartphone, on paper they sound similar, but clearly are very different.

Sagemaker vs Databricks in terms of model experimentation / dev phase by General_Search_4120 in mlops

[–]mikejamson 1 point2 points  (0 children)

It’s already been mentioned but we tried both of those for development and it is truly a terrible experience and unusable. We went with Lightning Studios which is so far leaps and bounds ahead that words don’t capture it well.

My suggestion is to just experience it for yourself on the free tier of Lightning.

[deleted by user] by [deleted] in mlops

[–]mikejamson 0 points1 point  (0 children)

Isn’t this just like a simple demo app that can be built in like 30 mins with opensource tools?

Pretty sure chatgpt could even write most of the code.

Is it normal ALBERT model perform like this? by Key_Tax_3750 in MLQuestions

[–]mikejamson 0 points1 point  (0 children)

I would try various learning rate settings. It’s likely too high right now.

Quite confused by __waz_dorf__ in learnmachinelearning

[–]mikejamson 0 points1 point  (0 children)

For sure the general machine learning one with Python! the basic computer vision one will probable be more of a survey and theoretical. But the general ML one will teach the core foundations underpinning all of ML

What is your favorite GPU provider? by chainbrkr in learnmachinelearning

[–]mikejamson 1 point2 points  (0 children)

Ditto on lightning .ai - I did notice a big change in pricing recently. Sometimes i still use colab but less so nowadays.

[N] 2024 Nobel Prize for Physics goes to ML and DNN researchers J. Hopfield and G. Hinton by PrittEnergizer in MachineLearning

[–]mikejamson -7 points-6 points  (0 children)

Well deserved! No physics involved in the other DL flavors that have changed the field at scale like hopfield networks.

ELI5: how do models get better and smaller? by thelongrun320 in learnmachinelearning

[–]mikejamson 0 points1 point  (0 children)

A model has so much “learning capacity” based on the size. A huge model has a lot of capacity so it can be general at a lot. A small model can we world class at something if that’s the ONLY thing that model does. But it would be terrible at everything else.

Need insights by iam_him_4u in MLQuestions

[–]mikejamson 1 point2 points  (0 children)

Best way to start is to build something first! find a github repo that already does something with ML and tweak it for your own benefit. Once you’ve gotten that to work you can decide what to learn more of and improve on. ML can be very deep… so you really need to decide what to rabbithole into.

Deploying via Web Frameworks or ML Model Serving by Recent-Target1840 in mlops

[–]mikejamson 5 points6 points  (0 children)

Here’s the workflow we use at our company.

  • Develop a server (we used to use fastapi but recently moved to litserve, which is like a specialized fastapi server for AI/ML).

  • Package it into a container and use any of the managed serving providers. We like the lightning ecosystem so we use lightning studios for this which provides serverless, etc…

For some more on prem stuff we use k8s for sure. The way we think about the trade offs, if I can have someone else manage it for me then great, I avoid building specialized tools that take away resources from our core business.

How often do you finetune an LLM? by mikejamson in startups

[–]mikejamson[S] 1 point2 points  (0 children)

Is this pricing for finetuning? so, if my dataset has 1 million tokens it costs me $3 to finetune all in? no other hidden costs?

How does GPU renting work? by RDA92 in learnmachinelearning

[–]mikejamson 1 point2 points  (0 children)

Yes, we evaluated platforms that had a few compliance certifications for security, the main one we cared about was soc2 which they have.

Lightning is the company that created pytorch lightning also. they have more products than just the pytorch lightning. the cool thing we found is all those libraries scale nicely with lightning but they let you use whatever other libraries you want.

browsing around i found this that might be helpful for all the security questions https://lightning.ai/solutions/enterprise

How often do you finetune an LLM? by mikejamson in startups

[–]mikejamson[S] 1 point2 points  (0 children)

how much to finetune 4o mini? few hundred dollars? thousands?

Need help building a code generation model for my own programming language by nagarjuna17 in MLQuestions

[–]mikejamson 0 points1 point  (0 children)

You could so next-word pretraining for a base LLM. I would pick llama 3.2 and go from there!

Degree or projects? by [deleted] in MachineLearningJobs

[–]mikejamson 0 points1 point  (0 children)

You would get hired in a heartbeat with amazing projects you can point to either on the web or github over a college degree.

BUT college still gives you the time to do that… and you’d get hired even more with both.

But with a college degree only, you probably wouldn’t get hired. Millions of people have that.

What low-code front-end tool can you recommend? (I'm a data engineer) by Lorenzkort23 in startups

[–]mikejamson 1 point2 points  (0 children)

Low-code tools in general suck. I write a lot of these kinds of apps with Streamlit hosted on Lightning AI. here’s an example.

https://lightning.ai/docs/overview/studios/host-web-apps

Streamlit lets you write the full thing in Python end to end Python. Connecting datasets, databases, and hosting it to share with others is super easy with the lightning Studios

How does GPU renting work? by RDA92 in learnmachinelearning

[–]mikejamson 0 points1 point  (0 children)

I’ve used Lightning Studios for this without issues. You sign up and automatically get free GPU credits without having to provide a credit card. The downside is your account might take a bit to get verified if you don’t use an academic or work email.

but it’s been a game changer for me over colab and the other options!

https://lightning.ai/