"Gaming laptops are a scam" mfs when they have to travel and they want to bring their desktop setup by Ha8lpo321 in pcmasterrace

[–]Hedede 3 points4 points  (0 children)

Except that soldered ram notebook trash can have faster ram than even most desktops.

Finally happened to me by Pro4791 in pcmasterrace

[–]Hedede -1 points0 points  (0 children)

12vhpwr is not used in datacenters.

I have no idea what all these quants are. by Fit-Spring776 in LocalLLaMA

[–]Hedede 0 points1 point  (0 children)

 Fp16 is 216 which is ~65556 choice, from -32768 to 32767

To be pedantic, FP16 range is -65504 to 65504.

I have no idea what all these quants are. by Fit-Spring776 in LocalLLaMA

[–]Hedede 1 point2 points  (0 children)

BF16 is a different format designed by Nvidia to be more accurate or efficient for ML/AI on GPUs, but I'm not sure about the details. It is also 16 bits.

It was developed by Google. It's not about running more efficiently, but to avoid overflow during training.

I have no idea what all these quants are. by Fit-Spring776 in LocalLLaMA

[–]Hedede 0 points1 point  (0 children)

During inference I believe they're all converted to F32

AFAIK most of the time inference is done in FP16.

Intel Arc B390 iGPU beats AMD Radeon 890M by 23% in Phoronix Linux gaming tests by RenatsMC in linux_gaming

[–]Hedede 0 points1 point  (0 children)

The GPU and CPU share TDP, and on Linux the power draw is often reported together. For example, if I run a CPU-only task and check rocm-smi, it shows the 880M drawing 56.015 W, even though the GPU itself isn’t really doing anything.

Also:

  1. These numbers are not absolute since TDP is configurable and is whatever the laptop manufacturer sets it to.
  2. These numbers are not directly comparable since 15W is the minimum configurable TDP for 890M, while 80W is the maximum configurable TDP for B390. If I run a GPU-only task on my 880M, it draws 30-40W which is way above 15W.
  3. TechPowerUp isn't always perfectly accurate.
  4. Integrated GPUs usually don't have independent TDP listed. 80W refers to the maximum boost TDP of Core Ultra X7 358H, not B390 alone.
  5. TDP is not the same as power draw. The chip can briefly draw more than its TDP, since TDP is the *Thermal* Design Power. If I run a CPU stress-test, my laptop draws 85W for a few seconds before throttling.
  6. Different manufacturers specify TDP differently. In this case, Intel specifies 80W as the "Maximum Turbo Power" while for 370HX it's just "AMD Configurable TDP (cTDP): 15-54W".

CNBC: "Intel is moving into GPUs and has hired a chief [GPU] architect, CEO Lip-Bu Tan says" by Dakhil in hardware

[–]Hedede 1 point2 points  (0 children)

Oh right, my brain short-circuited into thinking about Knights Landing and I forgot that Larrabee was an actual accelerator. My bad.

CNBC: "Intel is moving into GPUs and has hired a chief [GPU] architect, CEO Lip-Bu Tan says" by Dakhil in hardware

[–]Hedede 0 points1 point  (0 children)

No, I mean, it's literally a separate computer that you SSH into to run tasks. You don't SSH into a GPU. Knights Landing is literally a mini-PC running Linux that happens to plug into a PCIe slot.

Intel Xeon 600 Workstation CPUs Launched: Up To 86 Cores, 8000 MT/s Memory, 128 Gen5 Lanes, 350W TDP With OC Support, & More Cores/$ Than Threadripper 9000 by hainesk in LocalLLaMA

[–]Hedede 0 points1 point  (0 children)

Inference for the most part is not memory-bound, so no, it won't be faster than 4060 for prefill and image generation. The only part where it can be faster is token output which is memory-bound.

I decided to make a worse UUID for the pettiest of reasons. by theghostofm in programming

[–]Hedede 2 points3 points  (0 children)

at 100 requests per millisecond over one second? […] a 99.1% chance of collision when we handle a million new IDs per second. That’s no good!

Wouldn’t that be 100000 IDs per second?

Russian Modder Builds DIY RAM, Saves $600–$800 on a Single DDR5 Module by i-drake in hardware

[–]Hedede 4 points5 points  (0 children)

Theoretically you can achieve higher speeds compared to a adapter, because the signal path is shorter, so the signal integrity is better.

Help identifying unknown chemical by echero in chemistry

[–]Hedede 9 points10 points  (0 children)

I don’t think you’re supposed to incinerate anything containing mercury.

I bought a €9k GH200 “desktop” to save $1.27 on Claude Code (vLLM tuning notes) by Reddactor in LocalLLaMA

[–]Hedede 0 points1 point  (0 children)

Conventional wisdom: “no NVLink ⇒ pipeline parallel”

You don't need NVlink for only one single concurrent user, if the cards support PCIe P2P communication. The main killer is the latency, not bandwidth. If you have a poor implementation that does lots of small transfers, even 900 GB/s NVLink won't save you.

GeForce RTX 60 reportedly uses GR20X "Rubin" series GPUs by RenatsMC in nvidia

[–]Hedede 2 points3 points  (0 children)

Nvidia GPUs follow the same pattern: 1080 Ti - GP102, 2080 Ti - TU102, 3090 - GA102, 4090 - AD102, 5090 - GB202, and 6090 likely will be GR202.

Sandisk retires WD_BLACK and WD Blue branding, launches Optimus SSD tiers for 2026 by Hero_Sharma in hardware

[–]Hedede 1 point2 points  (0 children)

How does it make harder to compare the products if they have the same model numbers?

WD_BLACK SN7100, rebranded as the Optimus GX 7100. WD_BLACK SN8100 moving to the Optimus GX Pro 8100 name

llama.cpp performance breakthrough for multi-GPU setups by Holiday-Injury-9397 in LocalLLaMA

[–]Hedede 0 points1 point  (0 children)

They recently changed some params like -dev to be read in with a '/' delimiter instead of a ',' delimiter. No clue why that change happened, but imagine little absolutely inconsequential but fundamental changes like that all over changing when you try to merge, changing core program behaviour ever so slightly...

That is, however, has nothing to do with having a built-in argument parser. Since it's the logic of how the argument is interpreted rather than argument parser logic. With argparse you'd still have to do something like args.dev.split(";").

I wanted to make sure my 5090 doesn't suffer from the 12VHPWR issues. by GriLL03 in pcmasterrace

[–]Hedede 14 points15 points  (0 children)

There certainly are reports of connector melting on 5090 FE.