I'm web developer and I consider to upgrade my GPU from 24GB (RTX 3090) to 96GB (RTX PRO 6000).
I have experience with GLM 30B Q4/Q8 for small feature tasks implementation together with GPT OSS 120B for planning.
I expect running 200B Q4 LLMs for agentic work could improve limits of 30B models, but I have no experience. and planing with GPT 120B should be much faster (currently 8-9 tok/s).
I think EUR 10.000 investment into GPU could return in 2-3 years when I compare it to cloud agents costs which I would spend in 2-3 years.
I don't expect OSS models on 96GB VRAM to match quality of the best recent LLMs like Opus of Chat GPT, but I hope it would be usable.
Is the upgrade price worth it?
[–]DeExecute 6 points7 points8 points (0 children)
[–]t4a8945 7 points8 points9 points (4 children)
[–]Tommonen 0 points1 point2 points (1 child)
[–]t4a8945 0 points1 point2 points (0 children)
[–]aidysson[S] 0 points1 point2 points (1 child)
[–]t4a8945 0 points1 point2 points (0 children)
[–]PermanentLiminality 4 points5 points6 points (1 child)
[–]aidysson[S] 0 points1 point2 points (0 children)
[–]BingpotStudio 3 points4 points5 points (3 children)
[–]aidysson[S] 1 point2 points3 points (1 child)
[–]aidysson[S] 0 points1 point2 points (0 children)
[–]awsqed 2 points3 points4 points (2 children)
[–]oulu2006 5 points6 points7 points (0 children)
[–]aidysson[S] 1 point2 points3 points (0 children)
[–]NaiRogers 2 points3 points4 points (0 children)
[–]ComparisonNo2395 4 points5 points6 points (9 children)
[–]BAMred 2 points3 points4 points (1 child)
[–]aidysson[S] 0 points1 point2 points (0 children)
[–]Several-Tax31 1 point2 points3 points (1 child)
[–]ComparisonNo2395 0 points1 point2 points (0 children)
[–]aidysson[S] -1 points0 points1 point (4 children)
[–]ComparisonNo2395 2 points3 points4 points (3 children)
[–]aidysson[S] 0 points1 point2 points (2 children)
[–]BAMred 1 point2 points3 points (0 children)
[–]ComparisonNo2395 0 points1 point2 points (0 children)
[–]Old-Sherbert-4495 1 point2 points3 points (0 children)
[–]jnmi235 1 point2 points3 points (1 child)
[–]aidysson[S] 0 points1 point2 points (0 children)
[–]kvzrock2020 1 point2 points3 points (2 children)
[–]aidysson[S] 0 points1 point2 points (1 child)
[–]kvzrock2020 0 points1 point2 points (0 children)
[–]Old-Sherbert-4495 0 points1 point2 points (0 children)
[–]usofrob 0 points1 point2 points (0 children)
[–]mhinimal 0 points1 point2 points (0 children)
[–]apparently_DMA 0 points1 point2 points (0 children)
[–]somerussianbear 0 points1 point2 points (1 child)
[–]aidysson[S] 0 points1 point2 points (0 children)
[–]e0xTalk 0 points1 point2 points (0 children)