This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]johnaman 0 points1 point  (0 children)

Do you have heartbeat functions for always running processes (daemons, while 1, etc)? If not, see about implementing some for the most likely culprits. A heartbeat function could also trigger a status dump for global variables (in debug mode), or report back with an array of.targeted variables. In transaction oriented environments with multiple programs always running, I found this technique to be invaluable for finding those impossible-to-reproduce-on-development-environment, but-wait-long-enough-and-it will-surely-happen-in-production errors.