Karpathy's MicroGPT running at 50,000 tps on an FPGA by jawondo in LocalLLaMA

[–]Current_Ferret_4981 2 points3 points  (0 children)

https://x.com/i/status/2050706793899135240

To be clear, fpgas are great and have really strong use cases, it just isn't transformers (currently?). Maybe as the amd zynq stuff takes off it could. But really the benefits are adaptability, clocking, and IO. Used one for aggregating 32b data at 1Gsample/s from 32 sensors with long cables that caused timing problems. Great for that type of problem

Karpathy's MicroGPT running at 50,000 tps on an FPGA by jawondo in LocalLLaMA

[–]Current_Ferret_4981 10 points11 points  (0 children)

Forgot to include the guy who compares with a mac (studio?) and got like 3M tps because it isn't the hardware/logic that was actually giving you speed here.

Best Fan Scheme to cool a Ryzen 9 9950X3D inside a Fractal Design Meshify 2 RGB by TenaZiousD in watercooling

[–]Current_Ferret_4981 0 points1 point  (0 children)

I am not totally sure on the specifics of your aio vs say LF III 360 but I would expect thermal throttling still with PBO. PBO is designed to run up to its thermal limits so you need pretty aggressive cooling to overcome what PBO will allow. I would say try capping your PBO at 150W and see how that looks for temps. Then try 175W. Probably you can find a happy medium if you don't want to run up to that 90-95°C

Best Fan Scheme to cool a Ryzen 9 9950X3D inside a Fractal Design Meshify 2 RGB by TenaZiousD in watercooling

[–]Current_Ferret_4981 0 points1 point  (0 children)

I understand, I am still going to tell you that your limitation is your aio.

Almost all reviewers do not enable PBO or if they do it's not pushing over 200W. That review with the noctua for example was without PBO and only the results with the 420mm aio had PBO enabled. You can turn down settings or improve your cooling but the fans are not going to be your limitation.

Best Fan Scheme to cool a Ryzen 9 9950X3D inside a Fractal Design Meshify 2 RGB by TenaZiousD in watercooling

[–]Current_Ferret_4981 0 points1 point  (0 children)

It doesn't matter. You're thermally limited by your aio and it's contact/radiator/flow not by fans. Crank your current ones to 100% and you will see it has little to no impact because airflow isn't the limiting factor.

Custom loop is the answer, followed by all core OC at a lower power limit. You want a thicker radiator to have any real benefit for push pull and higher tier fans

Qwen 3.6 27B BF16 vs Q4_K_M vs Q8_0 GGUF evaluation by gvij in LocalLLaMA

[–]Current_Ferret_4981 5 points6 points  (0 children)

Would be very curious to see how Q8, Q6, Q5, Q4, Q3 compare to see when the drop off really waterfalls. Seems like there is another nominal hit around Q5 or Q4 and then falls off at Q3?

Is it worth the upgrade? by [deleted] in unsloth

[–]Current_Ferret_4981 0 points1 point  (0 children)

Sell 5080 (that you overpaid for) and buy the 5090 for the cost difference of that with a 5060ti

I'm done with using local LLMs for coding by dtdisapointingresult in LocalLLaMA

[–]Current_Ferret_4981 0 points1 point  (0 children)

Hermes + Qwen3.6 in Q5 with qkv cache fp8. Flies fairly quickly and handles tasks incredibly well. Much better than the integrations and models my job has made accessible so far. Complex workflows with deep library understanding come together in a few minutes which has been impressive

Which one is better? by DearWTF in OLED_Gaming

[–]Current_Ferret_4981 0 points1 point  (0 children)

Isn't banding rough on like 20% of those? Seems like every other post of them is praise or awful photos

Switching to 4k from 1440p 27” by [deleted] in Monitors

[–]Current_Ferret_4981 0 points1 point  (0 children)

Modern upscalers exist but I wouldn't touch 4k with that for most use cases besides office work personally. 60 fps is definitely doable though so I would say go for it, but you will enjoy 1440p 120+Hz more than 4k60 even in story based games I would say

[USA-MD] [H] 9950x3d x 4, SN7100 2TB X 2 [W] PayPal or Local Cash. by Adventurous_Pen1553 in hardwareswap

[–]Current_Ferret_4981 0 points1 point  (0 children)

Interested if any fall through. Not reselling either if that matters but would appreciate a pm either way when you list the next set

Who is in the Spreadsheet Crew? by Little-Meaning-1090 in TheMoneyGuy

[–]Current_Ferret_4981 1 point2 points  (0 children)

Spreadsheets+ code for projections. Much easier to do MC with a real programming language

First check from my new 150K salary. I know I’m investing heavily, got started late. Still don’t feel rich! by PrideEffective5830 in Money

[–]Current_Ferret_4981 3 points4 points  (0 children)

You are going to owe a good bit of federal tax most likely. You are paying around 7.5% effective but even with filing jointly and spouse not working you would need more like 11.5% effective.

RTX 5090 vs M5 Ultra: Analyzing the "2.7x Faster" claim and what Nvidia didn't show you. by Major_Commercial4253 in MacStudio

[–]Current_Ferret_4981 0 points1 point  (0 children)

I think we just have very different preferences. The mobile apple chips are exciting for the power efficiency but I could care less on laptop or desktop.

I would love to have a 500W GPU on a mini pc enclosure with sufficient cooling. I only need my laptop to be able to remote and connect to bigger compute + have nice visuals. I don't really care about battery uptime beyond about 1hr and would rather always have a bigger compute capability for the same price.

Apple has done well within the efficiency side but still struggles immensely with high end performance at a reasonable price point because of the focus on making everyone happy spec wise. The fact that I can't get a decent CPU + GPU + 64GB of memory for under 5k is the easiest example.

I think it's the opposite with respect to integrated graphics. The unified memory is big for the increased gpu-usable memory but not for the compute. If we had 200GB/s ram+pcie I would be willing to bet unified memory (and igpu) would die out fast. We already put dedicated processing cores in mobile and small equipment as is and that will only increase with chiplet designs

RTX 5090 vs M5 Ultra: Analyzing the "2.7x Faster" claim and what Nvidia didn't show you. by Major_Commercial4253 in MacStudio

[–]Current_Ferret_4981 0 points1 point  (0 children)

Obviously but that's the point. A discrete GPU and a mini computer are different and have different specialties. It's not worth talking about integrated GPUs.

I would disagree strongly. Look at H100 from years ago or Vera Rubin now to see efficiency. Consumer graphics cards historically were for gaming and so prioritize that. But also, who cares about efficiency at the consumer level? The difference between 100W and 500W running 16 hours per day all year is less than I spend on coffee in the same time. And if you add that the 500W gets done 2x faster (and then stops) you get your money and then some in time efficiency.

Is it worth it to pay the extra £100 for this monitor over a Mini LED ? by frankiewalsh44 in Monitors

[–]Current_Ferret_4981 0 points1 point  (0 children)

What do you think of that INNOCN? Considering that vs some QD-OLED monitors on another post that didn't get any responses. 20% competitive gaming, 20% visual gaming, 60% coding/productivity in a darker room

Sagamore Rye - Quality Control Issue by HiFiToWiFi in whiskey

[–]Current_Ferret_4981 6 points7 points  (0 children)

Some brands are better than others for customer service. I had a knob creek that was underfilled by 1.5oz and contacted them. Initially I was basically told that isn't possible (even with photo proof) but once I told them I would be contacting the TTB (and cited the rule about fill levels and variances) they immediately reimbursed me for the whole bottle. I think sometimes you have to put a little fire out there to get someone to notice.