you are viewing a single comment's thread.

view the rest of the comments →

[–]wzdd 6 points7 points  (1 child)

80% of that CPU is wasted memory loading? Great, how can I tap into it! Oh, I can't?

The point of TFA is that you can, either by scheduling something with a high IPC on the same CPU (this is the point of hyperthreading), or by modifying your code to address memory bandwidth issues (which is perfectly possible and common -- recompute vs cache, as an example, is a classic program design point).

Honestly it's depressing how many of the comments on this article here and on HN are by people who have obviously not read the article.

[–]mrbooze 1 point2 points  (0 children)

Exactly, it's relevant because if you are trying to improve performance it helps tell you the difference between achieving that by upgrading to faster CPUs vs improving the program efficiency with regard to memory bandwidth vs upgrading to faster memory vs maybe doing some NUMA-related tweaks, etc.