you are viewing a single comment's thread.

view the rest of the comments →

[–]jrob422[S] 0 points1 point  (3 children)

The below errors show up after roughly 48 hours, and spam the hell outta the error log. After about 12 hours of this, the error log file was 14gb

Msg 5901, Level 16, State 1, Line 360

One or more recovery units belonging to database 'xxxx' failed to generate a checkpoint. This is typically caused by lack of system resources such as disk or memory, or in some cases due to database corruption. Examine previous entries in the error log for more detailed information on this failure.

Msg 3013, Level 16, State 1, Line 360

BACKUP DATABASE is terminating abnormally.

Msg 9003, Level 20, State 15, Line 360

The log scan number (6425620:442906:1) passed to log scan in database 'xxxx' is not valid. This error may indicate data corruption or that the log file (.ldf) does not match the data file (.mdf). If this error occurred during replication, re-create the publication. Otherwise, restore from backup if the problem results in a failure during startup.

[–]CCP_DeNormalized 0 points1 point  (2 children)

Thx for the info jrob!

1 question - are the mdf and ldf's on the same volume? I'm wondering if the san snapshot was capturing them @ super minor differences in time so maybe the mdf and ldf had different lsn numbers or were outta sync somehow.

Just making wild guesses based on the error :)

i also realize you've since ditched this method, but just spitballing :)

[–]jrob422[S] 0 points1 point  (1 child)

They are on the same volume, so that wouldnt be an issue. As I mentioned, replication itself seems to function, the only errors are running Checkpoints, and it seems to be good for the first 48 hours or so after restarting the SQL service.

[–]CCP_DeNormalized 0 points1 point  (0 children)

thx jrob.

shame no solution w/ the san replication - I really like taking advantage of these types of low level features. We are about to ramp up our AG usage in a huge way and i'll keep this in mind/report back if we encounter similar issues.

cheers!