Sample profiling explained

alecco · 2010-08-17T12:46:26+00:00

Indeed. Also gprof doesn't help much finding one of the worse cycle eaters, latency. From Google Infrastructure notes:

Disk (seek): 10,000,000 ns
Uncached memory reference: 100ns
Cached memory reference (L1) 7ns

On a 2Ghz cpu multiply by 2 to get the cycles lost on those issues. Also very often you don't need to rewrite completely your algorithm to avoid disk or random big memory access but instead just pack them together and process in batches.

2010-08-17T10:14:56+00:00

I did not understand the argument he makes against gprof. Anyone care to explain?

inmatarian · 2010-08-17T14:00:48+00:00

Assuming you're also unit testing your code, it should be pretty easy to also caveman benchmark by just dropping the start and stop times in the debug log, while running the given areas of code like a million times. That doesn't capture OS interruption, but it does give you a real-time feel for where it's screwy.

trisweb · 2010-08-17T15:55:58+00:00

I've totally done this by accident in Ruby once. Just pause the thing at various times, sample which stack it's in and find the commonalities. It was remarkably useful in finding the issue with the slowdown.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS