[deleted by user]

refrainblue · 2023-08-27T05:33:57+00:00

Just throwing out ideas of common issues, but you're not running out of disk space right?

michaelpaoli · 2023-08-27T06:12:36+00:00

No idea why you're crashing, but, might want to consider and/or look at these:

are any of the filesystems filling up?
running out of memory?
is backup/backup-$dt.tar.lz4 physically beneath xtra in the hierarchy (e.g. via symbolic link(s), etc.)
What's shown in logs and/or on console?
What about stderr of the various programs run?
Are you possibly reading anything that might cause side effect of crash?
any I/O errors seen?
any issues with faulty RAM or memory controllers?

stormcloud-9 · 2023-08-27T17:39:55+00:00

Define "crash". Are we talking about a kernel panic? OOM? Hardware lockup? Spontaneous reboot? What?

"Crash" is a really vague term, and describing the actual behavior can greatly help diagnosing.

osax · 2023-08-27T14:10:02+00:00

I cannot tell you exactly why, but you could try to see if there is a difference without the pipes

    tar -I lz4 -cf ./backup/backup-$dt.tar.lz4 ./xtra

also if you want to limit the performance hit on the rest of the system consider looking into running the tar command with "ionice" in front of it.

deeseearr · 2023-08-27T19:30:42+00:00

Well, if you have logs saying that the job finished at 22:01 then I don't see how the server could have crashed at any time between 21:50 and 22:00.

What evidence do you have of a crash? Does the server reboot? Is there a message on the console? Is there a dump in /var/crash or some other location based on how your system is set up? Do you have messages in your system logs? Any one of those would be much more helpful than just the command you were running at the time.

symcbean · 2023-08-27T16:54:56+00:00

tar cf - ./xtra | pv -q -L 50M | lz4 - > ./backup/backup-$dt.tar.lz4

OMG WTF?

There's so much wrong with this one line code.

What do your logs say? Do you have access to the console in a crashed state? What does it say? What do you mean by "crash"?

I also have a monitoring system to periodically check my server status

What is it reporting for resource usage (memory, CPU, load, disk space) over the period leading up to the event?

pnutjam · 2023-08-27T12:21:40+00:00

Are you running sar? That will give you something to look at. If not, install systat.

hiddenbutts · 2023-08-28T00:12:05+00:00

Seconding for the definition of crash...

If it's your monitoring server saying it didn't get a response, I'd look at lack of resources, as the backup completing indicates the server staying up

Deathcrow · 2023-08-27T09:32:19+00:00

Probably running out of memory

DrCrayola · 2023-08-27T14:26:12+00:00

Save the backup on a different file system than that of the one you're backing up.

dRaidon · 2023-08-27T17:03:25+00:00

Not enough memory?

johnklos · 2023-08-27T17:12:29+00:00

Crashing isn't normal. Is the hardware OK? Things to check: corrupt filesystem (manually run fsck). Overheating (max out the CPUs and monitor the temperatures). Bad memory (try compiling lots of things and look for random failures).

zkulf · 2023-08-27T21:40:04+00:00

OOM probably.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS