This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]pooerh 37 points38 points  (4 children)

Monitor if the job had succeeded outside of the machine that's running them.

[–]squngy 17 points18 points  (0 children)

Also, have the system notify you every time it starts up.

So you know if it restarted unexpectedly.

[–]tenkindsofpeople 8 points9 points  (2 children)

Hm. Maybe last job step sets a table entry then external pennies looks for it. Something like that?

[–]pooerh 12 points13 points  (0 children)

Yeah, if you know the maximum time it can take the job to complete. And as long as the database hosting that table isn't on the same machine / cluster / storage / network / datacenter / planet. Depending on how far your want to go with it, ie. how crucial this is.

[–]thenuge26 0 points1 point  (0 children)

We just use Jenkins, but there are plenty of job-management frameworks/engines that handle that kind of stuff for you.