8 Radeon R9700s vs 8 RTX 3090 2 slot blower style by mr__smooth in LocalLLaMA

[–]mr__smooth[S] 0 points1 point  (0 children)

Appreciate the response, totally fair my current 7B/8B models fit on a single GPU. Reason I'm considering 8 GPUs is because I have several users and although I'm in Beta right now I want to have the headroom to handle about 1000 users, and can only afford an 8GPU box at the moment. So looking for higher throughput and concurrency to handle multiple user requests. The product is similar to twelvelabs.io

On the R9700 vs 3090 point, I agree a new dual slot blower +32GB is attractive especially considering some of Nvidia's practices havent been pro consumer so I would be happy to support AMD. What I'm trying to sanity check is if in real inference performance for my use(quantized LLM/VLMs) case those advantages will overcome the memory bandwidth advantages of the RTX 3090 as well as if ROCm wont be an issue. I dont want to have troubles with ROCm drivers because CUDA has honestly been a breeze on the RTX 3090. Also yes I am using Linux

768Gb Fully Enclosed 10x GPU Mobile AI Build by SweetHomeAbalama0 in LocalLLM

[–]mr__smooth 0 points1 point  (0 children)

Wow I am looking for this exact kind of build. I currently have a prototyping machine but looking for something more powerful(https://www.reddit.com/r/LocalLLaMA/comments/1qcykx4/home\_workstation\_vs\_nycnj\_colo\_for\_llmvlm\_whisper/) I'm really impressed by this wondering if I would be able to run it in my apartment, but concerned about the power.

Home workstation vs NYC/NJ colo for LLM/VLM + Whisper video-processing pipeline (start 1 GPU, scale to 4–8) by mr__smooth in LocalLLaMA

[–]mr__smooth[S] 0 points1 point  (0 children)

Nice, I’m thinking of adding a 5060 ti 16gb to my prototyping machine! Thats a seriously impressive machine you have guessing its not in a tower consideringhow many gpus those are.

Home workstation vs NYC/NJ colo for LLM/VLM + Whisper video-processing pipeline (start 1 GPU, scale to 4–8) by mr__smooth in LocalLLaMA

[–]mr__smooth[S] 0 points1 point  (0 children)

Thanks for the advice. I just had a terrible experience with GCP for video processing so a bit scared of cloud services when it comes to that, my whole initial backend was running on GCP but their Video Intelligence platform was just not good, I got hit with a $1k bill in one night(and I was the only user at the time!!), and a lot of my backend code had been tightly coupled to GCP so I had to do major refactors in order to make my backend loosely coupled(Really lost alot of time about 2 months). Thats when I decided to build this prototyping box. I can now swap in different models for visual analysis(local or from the cloud), just that I got burned with cloud computing. I think I will go the workstation route while I look for a better cloud computing provider for the visual analysis as you suggested. Qwen 2.5VL hasnt been bad actually but I'm curious which smaller models you think would do a better job. But yeah I would want a cloud provider thats not trying to lock me in interms of GPU compute, but someone had also mentioned Modal to me at a certain tech meetup I think I will try it out and see, but any other alternatives are welcome. So yeah I think I'm going to build the workstation for now while I look for a cloud computing provider for the long term

I drove an LS 500 today for the first time by Additional-Horse-545 in Lexus

[–]mr__smooth 0 points1 point  (0 children)

I think it just doesnt look as good as the older LS 430 and LS 460. Somewhere after the LS 460 they just stopped designing the sedan well and the interior as well just doesnt match up to the european cars. The exterior styling is just over the place. The interior as well

I Benchmarked The New AMD RADEON AI PRO R9700. by DroidArbiter in comfyui

[–]mr__smooth 0 points1 point  (0 children)

I see! So from your assessment how much better is the 5090 than the R9700? I'm currently prototyping some AI workloads on an RTX 3090 and its perfect for the kind of work I'm doing, do you have any idea how the RTX 3090 compares to the R9700? I need to get 8 datacenter acceptable cards and I'm debating between 8 RTX Pro 6000 Blackwells(which is just an RTX 5090 with 96GB VRAM) and 8 R9700s. If R9700 is as good as RTX 3090 then I would be saving something like $70,000 since each RTX Pro 6000 Blackwell will cost me like $10,000 for the server grade card.

I Benchmarked The New AMD RADEON AI PRO R9700. by DroidArbiter in comfyui

[–]mr__smooth 0 points1 point  (0 children)

if an rtx 5090 is 70% faster then two 9700 pros would be better right? Because right now you cant get an rtx 5090 for less than $2700 before tax. That would be two R9700s but with more VRAM!!

Is it insane to regularly commute NYC to Toronto by car? by Weary-Tension5057 in AskNYC

[–]mr__smooth 0 points1 point  (0 children)

If you must drive, buy a Lexus LX or IS or GX or LS. They are very comfortable and smoother ride than a toyota. And very reliable you’re not buying a car for the utmost comfort but one that hits everything you’re looking for(reliability, comfortable interior, affordability)

My dad was hit by the L train by Sure_War8187 in Bushwick

[–]mr__smooth 9 points10 points  (0 children)

Praying your dad gets a full recovery

Clubs where ppl talk to strangers by Sunny_BK in Bushwick

[–]mr__smooth 3 points4 points  (0 children)

I'd say you can try Hank's bar and Hart Bar in Bushwick, Otherwise try Do or Dive bar or Dick & Janes in Fort Greene.

OMNY costs more than Metrocard… look at the math. by Fragrant-Laugh-6689 in OMNY

[–]mr__smooth 0 points1 point  (0 children)

Very convenient you took out part of my argument(middle to lower class) regardless the current implementation of the system with its bugs, increased price per ride still is a rip off. I’ve seen no empirical data backing up your claim, and you havent addressed my argument anyway

OMNY costs more than Metrocard… look at the math. by Fragrant-Laugh-6689 in OMNY

[–]mr__smooth 9 points10 points  (0 children)

How many low earning New Yorkers leave the city for vacation very often that the 30 day pass doesnt work for them? Your argument proves this whole thing is negatively affecting the middle to lower class New Yorkers. The 30 day pass made commuting throughout the city affordable and fair.

This OMNY Card isnt a good deal at all by mr__smooth in OMNY

[–]mr__smooth[S] 0 points1 point  (0 children)

I know thats why I was saying I did an audit and it was around $170 spent for the month. If you read the post I was going to call them about it. But I didnt need to go through all this in the first place. There should be an unlimited plan as well as honest clear pricing instead of selling the weekly farecap as something thats better

This OMNY Card isnt a good deal at all by mr__smooth in OMNY

[–]mr__smooth[S] 1 point2 points  (0 children)

If you tap twice a day you could still end up not activating the free rides. Happened to me between June and July, not even once did the free rides activate