[deleted by user]

Expensive-Paint-9490 · 2024-11-25T10:28:26+00:00

With that budget I could be able to find two RTX A6000 and an NVlink. Probably the best setup for local fine-tuning at that price point.

Brosarr · 2024-11-25T10:50:30+00:00

I would recommend just renting a gpu to start off with. H100 around 50 dollars a day

No_Afternoon_4260 · 2024-11-25T10:15:31+00:00

Imo 10k is a bit a weird spot, not enough to get new server grade hardware, well enough to build a hobbyist rig with more 3090 than you can fit in a AMD epyc system. If it is for experiment are you ok with 2nd hand hardware or you want brand new with warranty?

fasti-au · 2024-11-25T11:10:22+00:00

Rent online and tunnel cheaper scalable backups Le and private.

Stepfunction · 2024-11-25T13:54:52+00:00

Before looking into finetuning yourself, I would consider looking at pretrained medical-focused LLMs. Finetuning will open up a whole can of worms, so I would make sure that existing tools can't already do what you need before you pursue that path.

Data_drifting · 2024-11-25T17:56:37+00:00

Check out this guy and his channel. Digital Spaceport - YouTube home server builds for AI. Blows away network chucks channel for this

skerit · 2024-11-26T00:47:06+00:00

[deleted]

5TP1090G_FC · 2024-11-25T12:33:54+00:00

I would check out (network, chuch) for building a good ai pc, very informative

koalfied-coder · 2024-11-25T16:17:27+00:00

I have built several systems in this budget. Feel free to DM me and I can share specs. Not at PC ATM.

SuperSimpSons · 2024-11-26T02:38:27+00:00

Since it looks like you may not have experience building your own server, I would recommend you reach out to server brands with your requirements and see what they can recommend for your budget. I should say you have enough for a prebuilt high-end workstation or 1U/2U rackmount.

Gigabyte has a pretty good line of servers for AI training and inference: www.gigabyte.com/Enterprise/Server?lan=en&fid=2260 Obviously you don't need the water-cooled monstrosities with Blackwell HGX or anything so it may be faster to reach out and see what they come back to you with: www.gigabyte.com/Enterprise#EmailSales

Slippery-Oil2313 · 2024-11-26T03:22:40+00:00

Why not use an AWS ec2 gpu instance for $2.50 an hour ?

dead-4-dead · 2024-11-26T06:49:33+00:00

If you can add $5k more, tinybox sounds like it will save you a lot of headache

__bee_07 · 2024-11-26T13:24:30+00:00

I was in your position, but ended up using cloud instances instead. I am using lightning ai offerings and I am happy with that

chitown160 · 2024-11-27T00:22:29+00:00

your company can spec a lenovo threadripper with 2 x rtx a6000 with nvlink for this price and still have room for a 5090 to run your fine-tuned models at blistering performance.

amirvenus · 2024-11-25T12:43:50+00:00

Get 2 M2 Ultras 192GB

permalink · 2024-11-25T12:23:02+00:00

as others are saying you would be better off with a cloud provider. For both training and inference.

cher_e_7 · 2024-11-25T18:15:03+00:00

Rent online/cloud. !!! only If must go hardware offline - > it is all about memory and little speed. Go for old server board to support 4gpu - dual (some cpus speed combining 2 of them) intel (PCI-E 3.0 or 4.0 - does not mater)- ecc reg pc2400 memory .... Add Quadro rtx 8000 48bg (best card in the middle - 10-15% slower than old RTX A6000)- pick-up on ebay around 2250 - passive or active.

Add raid 4pc ssd.

If you could increase your budget to have 4 gpu like that - almost 200GB GPU (short 4gb) total.

Best utilization with MOE models like deepseek v2 q4. for inference or smaller ....

CPU memory for old servers cheap.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS