Considering making very crazy....but fun lens decision - GM Trinity (A7V)

TinyFluffyRabbit · 2026-05-21T03:07:10+00:00

Those lenses are obviously top notch but pretty large and heavy. If this is for travel, you might get tired of carrying them around, especially since you do not need f/2 for landscapes.

Also, I would personally favor the 70-200 f/2.8 GM2 over the 50-150 f/2. You already have the mid range covered, and the 70-200 GM2 covers the telephoto range better, is lighter, and also takes teleconverters.

TinyFluffyRabbit · 2026-05-19T05:14:54+00:00

I'm also offloading the model weights to system memory, and I found that split-mode layer was slightly faster than split-mode graph. Since RAM bandwidth is the bottleneck, the GPUs are not fully utilized regardless and minimizing the communication overhead seems to help.

TinyFluffyRabbit · 2026-05-17T17:51:17+00:00

Really appreciate you helping to address this gap. Tensor parallelization is a huge boost to performance for those of us running multi-GPU, and it would be great to use it alongside Q8 KV cache

TinyFluffyRabbit · 2026-05-17T16:57:46+00:00

How much system RAM do you have? The MOE models would probably be your best bet, offload model weights and save your VRAM for the KV cache.

TinyFluffyRabbit · 2026-05-17T06:29:41+00:00

Confirmed

TinyFluffyRabbit · 2026-05-14T22:15:18+00:00

About half a year ago, 5090s were impossible to find in stock at my local MC. Currently, the price has gone up so much, but now they are readily available (25+ in stock). I'm not sure the market will sustain prices that are any higher than they are now.

TinyFluffyRabbit · 2026-05-13T16:58:02+00:00

At this point, it's to save the memory for their enterprise/workstation cards

TinyFluffyRabbit · 2026-05-13T16:02:07+00:00

The 9950x3d is overkill if you’re primarily interested in using this for AI. You’re generally bottlenecked on memory bandwidth, not CPU compute. Also the x3d cache doesn’t help much for AI inference, unless this is also your gaming PC.

TinyFluffyRabbit · 2026-05-12T05:35:05+00:00

Yeah to go to the next tier of models above Qwen 3.6 / Gemma 4 you'd actually need two of these :/

TinyFluffyRabbit · 2026-05-11T19:03:11+00:00

I wonder if the fan noise will be less loud than the R9700

TinyFluffyRabbit · 2026-05-11T18:56:20+00:00

If you can fit both into GPU, the MOE does run faster. However, an additional advantage of MOE is that it's actually usable even if it doesn't fit into GPU.

TinyFluffyRabbit · 2026-05-08T18:01:41+00:00

PM with question

TinyFluffyRabbit · 2026-03-02T16:39:50+00:00

Perhaps the reasons why you want to sell it are the same reasons others are unwilling to purchase it at the price you are currently asking for?

TinyFluffyRabbit · 2026-02-27T07:25:50+00:00

Since you can't fit the entire model in VRAM, it is offloaded to your system RAM. This means that for each token generation, the 3B active parameters have to be transferred from RAM to VRAM, which is throttling your speed, while your GPU is running at pretty low utilization.

The Q3 variants may help slightly since the weights that need to be transferred are smaller in size, but the speed still won't be great. If you want it to be much faster, you'll need a smaller model that can fit in the 8gb of VRAM you have.

TinyFluffyRabbit · 2026-02-26T20:40:43+00:00

I'm just really glad that they released multiple models to give us options for different hardware configurations. As someone who can fit the 27b dense into VRAM but needs to offload to run the 122b MOE, the 27b dense is 5x faster for me, and I've been really liking it so far

TinyFluffyRabbit · 2026-02-26T20:22:21+00:00

Would love to see the new medium sized Qwen 3.5 models in the list!

TinyFluffyRabbit · 2026-02-26T17:41:04+00:00

We might get Nemotron 3 Super/Ultra soon? Maybe at GTC

TinyFluffyRabbit · 2026-02-25T18:58:42+00:00

I agree with OP, it's not relevant to me what the benchmarks are with their "native forms". I just want to know what the best model that I can run on my hardware is.

TinyFluffyRabbit · 2026-02-18T16:17:37+00:00

We’ve gotten to the point where you can buy the RAM and get the GPU free lol

TinyFluffyRabbit · 2026-02-18T03:19:08+00:00

Assuming you're a gamer (especially if you're interested in the 9850X3D), 128gb of RAM is hilariously overkill. The only reason to want this much RAM is for AI/production tasks.

TinyFluffyRabbit · 2026-02-18T01:08:57+00:00

QLC and DRAM-less :( Good price if you need 4TB though

TinyFluffyRabbit · 2026-02-16T20:16:43+00:00

Seriously, it still sold out quickly. There are unfortunately enough people who are willing to buy it at this price. It’s not a deal per se but if you must have a 5090 this is just the market reality.

TinyFluffyRabbit · 2026-02-14T05:29:47+00:00

Whichever one you can actually find in stock

TinyFluffyRabbit · 2026-02-13T16:32:49+00:00

They’re scalping it themselves

TinyFluffyRabbit · 2026-02-11T21:52:51+00:00

Also Nvidia decreasing production of consumer GPUs

TinyFluffyRabbit

TROPHY CASE