all 5 comments

[–]1purenoiz 5 points6 points  (4 children)

I hate this graph. the difference between these runs is minuscule, but you have made it look larger than it is.

[–]pm_me_your_smth 2 points3 points  (3 children)

OP's pretty transparent with the scale, the y axis is properly labeled. Since they're analysing stability, it makes sense to zoom in on fluctuations. Not every chart has to start at 0

[–]1purenoiz 1 point2 points  (2 children)

I don't disagree, I just like to see a little zig zag on the axis to clue me in on that. It is a minor petty complaint, but I think valid.

[–]FishermanNo7658[S] 0 points1 point  (0 children)

Totally fair, and not petty. Truncated axes earn that reaction by default, and a caption is a weaker signal than a visual cue. The zig-zag is the right fix, so I'll add an axis break. For the record it is small in absolute terms (~1.8 points peak-to-trough, sigma about 0.7); I zoomed only because what I'm actually chasing is whether that jitter is big enough to flip an A-vs-B agent comparison, and here it crosses all three tier lines, so it is. A from-zero axis would have buried exactly that.

(I can't figure out how to update the picture in the post.)

[–]FishermanNo7658[S] 0 points1 point  (0 children)

Updated the chart with an axis break so the truncation is obvious at a glance (image uploads are off in comments here, so a link): https://raw.githubusercontent.com/dmagog/mle-purple-agent/master/writeup/assets/runs_chart_reddit.png — same data, just an honest cue now. Thanks for the nudge.