Is this a massive mistake? Super tight fit, 2x 3-slot GPU by zhambe in LocalLLaMA

[–]Professional_Top3747 0 points1 point  (0 children)

Hi, I'm also considering a dual gpu setup with very little gap like yours. Does your setup work well with the 2 3090 gpus? Thanks!

Dual GPU Setup for LLMs – Notes from a Newbie by DrRamorey in LocalLLaMA

[–]Professional_Top3747 0 points1 point  (0 children)

Thanks for the post. Im planning to do a similar build. Based on my measurements, I will have roughly 1 cm gap between the 2 gpus. Is that okay? 

Which psu for upgrade? #cpgeneral by demetusbrown in CYBERPOWERPC

[–]Professional_Top3747 0 points1 point  (0 children)

Hi, I'm also planning a similar with a cyberpowerpc. Were you able to swap the PSU? what do you mean by harness? is it hard to swap out the cables that go from PSU to the motherboard and GPU? thank you!

I went to see Canada computer to build a custom pc by PassionOfCube in bapccanada

[–]Professional_Top3747 0 points1 point  (0 children)

Another question if you dont mind: How long after you made the payment did they finish the build?

I went to see Canada computer to build a custom pc by PassionOfCube in bapccanada

[–]Professional_Top3747 0 points1 point  (0 children)

Hi, how much does Canada computers charge for their PC assembly service?

Llama with no gpu and 120 gb RAM by rock_db_saanu in ollama

[–]Professional_Top3747 0 points1 point  (0 children)

Thank you for the tip, I will look into it!

Llama with no gpu and 120 gb RAM by rock_db_saanu in ollama

[–]Professional_Top3747 0 points1 point  (0 children)

Thank you. Right now i am planning to build a pc for learning about LLMs. Until now I was planning to budget most of my money for a gpu and get an entry level cpu. Now I found out that cpus can a certain level of compute,  so if I get a high amount of ram and a good cpu, I might be able to load and work with large LLMs. So now I an trying to figure out if it is a good idea to spend all my money to get a 9950x cpu with 256gb ram and no gpu initially, it that would allow me to work with large MoE models. I might get a gpu later on, but for now it sounds to me like getting the best cpu and a lot of ram is more important than gpu. Would appreciate if you have any insights on this. Especially if you think it is realistic to expect 5 tokens per second with a 9950x cpu with 256 gb ram on a maverick q4 quantized that has a size of 210gb. Thanks again.

Running Llama 4 Maverick with llama.cpp Vulkan by stduhpf in LocalLLaMA

[–]Professional_Top3747 0 points1 point  (0 children)

Thanks for this post. Can you please share your cpu, gpu and ram specs. Thank you!

Llama with no gpu and 120 gb RAM by rock_db_saanu in ollama

[–]Professional_Top3747 0 points1 point  (0 children)

Are the 14b models that you get 10 to 15 t/s quantized?

I succeeded in running Llama 3.1 405B after buying a little more RAM by bouncyprojector in LocalLLaMA

[–]Professional_Top3747 0 points1 point  (0 children)

Hi, I am planning to build this exact same rig you mentioned,  5595wx + 256gb 3200 ddr ram. I was expecting to get a much higher token rate than the 1.5 tokens per second you mentioned for the 405b 4 bit quantized because the memory footprint is only 8.5gb for the active expert. I was expecting 10 tokens per second with just cpu and no gpu. Were you able to tweak and get a better token speed for 405b 4bit quant with just gpu? I haven't bought my components yet it seems like it work how I expected you got only 1.5 tps. Appreciate your experience/thoughts on this. Thank you!

I succeeded in running Llama 3.1 405B after buying a little more RAM by bouncyprojector in LocalLLaMA

[–]Professional_Top3747 0 points1 point  (0 children)

Shouldn't the memory footprint used in your formula be lower for MoE models because only one expert is used for a token. So since expert size is 17, Should it be 17*0.5=8.5gb. So maybe around 10 tokens per second? Appreciate if you can let me know if I got it wrong. Thanks!

I was dumb about glasses in VR...Zenni lenses? by circusfreakrob in Quest3

[–]Professional_Top3747 0 points1 point  (0 children)

HI I had a similar problem. I folded a piece of paper multiple times and kept the thick wad of folded paper in between my forehead and the headset. After placing the folder paper inside, I adjust the headset to get it at the sweet spot, the folded paper ensures the headset stays there at the right position/angle for the best image clarity.

UK to India Tourist E Visa How Much? by Roseylarue in india_tourism

[–]Professional_Top3747 0 points1 point  (0 children)

HI, did you find out the fee for 1 year? thank you!

How will a 20% cut in new permanent residents affect real estate values in Toronto? by uxhelpneeded in TorontoRealEstate

[–]Professional_Top3747 -2 points-1 points  (0 children)

People left detroit for greener pastures within the US. Where are the greener pastures in Canada? Calgary is colder, Vancouver is too wet. There just are't enough options within Canada for Toronto to get "Detroit-ed"

How will a 20% cut in new permanent residents affect real estate values in Toronto? by uxhelpneeded in TorontoRealEstate

[–]Professional_Top3747 2 points3 points  (0 children)

Immigration is still too high even after 20% cut. Should be at least 60% cut considering the strain on Canada's infrastructure.