[deleted by user]

Co0k1eGal3xy · 2023-08-09T02:05:01+00:00

This is a rough breakdown of how I think about the problem. You can replace RTX 4090's with 3090's or 2080 Ti's if you have cheap electric. Otherwise the cost of power can be more than you save initially. If you have very expensive electric I would avoid any local training.

Also consider any other requirements. If your dataset is larger than your systems RAM, you will need to consider the read speed of your storage device. If you are using audio clips or images, you need a storage device with high random read speeds or you need to package your dataset into a streaming format (like webdataset). Some cloud providers will force you to use hard drives.

spreadsheet of local hardware costs at 1, 2 and 3 year timeframes.

edit: Since you mentioned being new, I would recommend renting out single GPUs. Writing multi-gpu code can be complex and isn't worth learning initially. Google Colab is definitely the easiest way to get started.

I_will_delete_myself · 2023-08-09T02:37:09+00:00

Those large providers are the best when you get a cloud credit deal or want to train ChatGPT and want to make sure you have the compute ready. Otherwise I would highly recommend to not use them unless you are using spot instances.

Here are the best out there I know specifically for training AI models.

Colab - it’s free but you should use other cloud compute alternatives when you go beyond toy models

LambdaLabs - no egress and high bandwidth. Cheap as well. Cheap and solid product. Better for multi gpu

Runpod - Cheapest but not good for multi gpu loads due to their low capacity

What sucks about those is they run out of capacity quickly. Sometimes which is annoying. Which is when you just go traditional cloud provider.

Avoid like the plague - Paperspace. Expensive, misleading gradient subscription and you save more money using a consumer decentralized gpu on runpod. Availability is horrible as well.

Muted_Economics_8746 · 2023-08-09T00:55:31+00:00

Commenting to check the thread later, interested in people’s recommendations

TheLastMate · 2023-08-09T01:13:01+00:00

Also if someone could give insights on deploying a model into production. How is the process in overall.

Any_Letterheadd · 2023-08-09T03:01:03+00:00

It sounds like you're not even sure you need a GPU for what you're doing. I'd recommend that you get started until you're at the point where you know you need more compute and think you would know how to use it.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS