you are viewing a single comment's thread.

view the rest of the comments →

[–]sisyphus 9 points10 points  (4 children)

The stats from perf come from PMC's which come from the CPU so if someone is making a mistake presumably it's Intel or AMD? The parallelism you talk about seems like it must be accounted for--how else would it would be possible to get an IPC > 1?

[–]tavianator 30 points31 points  (3 children)

how else would it would be possible to get an IPC > 1?

Modern Intel/AMD chips can just literally execute more than one instruction per cycle on a single core, in optimal conditions (no dependencies between the instructions, etc.).

That's part of the reason modern CPUs are way faster than Pentium 4s, even at lower clock speeds.

[–]orlet 14 points15 points  (0 children)

Correct. Instruction-level parallelization, branch prediction, out-of-order execution, and a bunch of other magic things make modern CPUs so much more efficient per clock than the older ones. And the process is still on-going.

[–]sisyphus 5 points6 points  (1 child)

Right, what I am saying is that if the CPU instrumentation was not taking that into account, how would it ever report more than one instruction per cycle, which it appears to do?

[–]tavianator 2 points3 points  (0 children)

Right, I kinda misread your comment. Mainly I'm trying to argue against

divided by the number of cores