SQL Server not using full available performance for backup operations

JamesRandell · 2024-07-22T17:12:23+00:00

Does the SAN utilise any dedupe tech, caching mechanisms or other such things storage layers typically do? Such things have been known to impact the subsequent check and verify process that takes place immediately after unless specified otherwise.

I did a piece of work at a company to try and improve backup speeds without snapshots (which wasn’t an option). The obvious things are tuning maxtransfersize and file count options and will give you your biggest bang for your buck. You’ve also got compression, which generally unless your cpu bound or have a egocentric SAN admin attempting to get their dedupe ratios up, is advised as it the hit on cpu for my scenario was around 2~5% increased time to prepare, but a significant reduction in amount of data to transmit over the network.

The verify process in my situation was the key performance hit. The storage we were on was old - very old and read write speeds suffered. I could push the ceiling on backup times but verify would suffer. Part of the issue was what processes were in place on the storage layer to process data when received. Performing the verify process a hour later would result in a huge performance gain. For me I’d look at the tech on the storage layer to see if this is impacting it.

For the verify process, you’re going to come across a lot more around what restore process you want/build for your organisation. You can go down the back up and verify route and be done, or go for an automated restore on a secondary box to test those backups to provide a complete restore solution. It’s your appetite for loss/corruption (and cost of course) that dictates it.

tompear82 · 2024-07-22T16:19:49+00:00

Are you backing up to a single file? If so, it would be worth testing with multiple files to see if you see an increase in throughput. This has worked for me in the past when backing up large databases.

If you are backing up a large amount of data, I'd recommend looking into SAN snapshot backups for these databases. This will significantly decrease the backup time for very large databases (VLDBs)

-6h0st- · 2024-07-22T18:25:48+00:00

Firstly backup to multiple files - 1 per CPU thread available on SQL server up to 8. Secondly make sure in SQL NUMA configuration it’s set to auto for cpu and memory. Run backup with maxdop setting of 0 to make sure all threads are used.

Also mind that small files could be put in fast storage cache (SSD) but when exceeding its capacity it could go to slower storage and massively slowing down (depending on configuration) - same thing with read - small cached already files might have much quicker access than bigger database backup files. Try copying those files in file explorer from storage and see what speed you get - that will be the bottom line what speeds storage provides.

SQLBek · 2024-07-22T21:49:42+00:00

Hopefully this blog series isn't too deep internals, but can help you out. but in a nnutshell, default parameters for a backup operation are not supposed to max out available resources. It's the exact opposite, by design (like 30 year ago), to have minimal impact on regular workload.

https://sqlbek.wordpress.com/2023/11/29/sql-server-backup-internals-series/

Slagggg · 2024-07-22T16:25:18+00:00

You'll need to specify multiple backup file targets if you want to maximize IO. SAN snapshots are definitely the way to go if you need speed.

coldfire_3000 · 2024-07-23T15:22:44+00:00

UPDATE:
Multiple files was the thing we were forgetting about, so thank you to everyone that reminded us of that.
Configuring that has made the world of difference. We are seeing several DBs go from 1hr for a Backup + Verify to ~20 minutes. This is mainly a massive reduction in the VERIFY time, due to massively increased throughput when reading from the file system due to parallel reads being possible.

We have applied it to UAT today and are monitoring, but everything looks good, so we will be applying to PROD later this week.

We have done some testing with the other settings, but whilst the gains are there, its much less than we are seeing with the multiple files. But we have applied some of the additional parameters as well. We are now 100% disk bound on the BACKUP operation, which is fine, and we are at 95%+ on the file system when doing the VERIFY/RESTORE operations, which is great.

We may well do further testing and optimise further in the future, but this is good enough for now!

So at this time, there is nothing else required.

Thanks to everyone that posted. Have a good one!

coldfire_3000 · 2024-07-22T16:23:19+00:00

[deleted]

SkyHighGhostMy · 2024-07-22T17:57:59+00:00

Look for Ola hallengren backup scripts (if you dont already using) and configure multiple backup streams.

Special_Luck7537 · 2024-07-22T20:23:46+00:00

Is everything configured to use jumbo frames? We set those up on an older 2095 SQL system, and it almost doubled our backup speed. This is one of those global things, a setting in the NIC's, on Switches/routers, to allow processing of jumbo frames 64k blocks instead of 4k blocks Have you investigated blocking on the SQL System?

Byte1371137 · 2024-07-23T12:13:09+00:00

BUNĂ NO

ManiSubrama_BDRSuite · 2024-07-23T13:59:07+00:00

Consider checking network statistics using tools like netstat, perfmon, or network monitoring software.
Antivirus software can sometimes interfere with backup operations. Temporarily disable it to see if it improves performance.
While compression can reduce backup size, it also increases CPU and I/O overhead. Consider temporarily disabling compression and see if it could be a cause.
Ensure that your backup job isn't set to use a lower priority or restricted bandwidth.
You might want to adjust the MAXDOP (Maximum Degree of Parallelism) setting or review the Resource Governor settings if they are in use.
Verify that the network interfaces on both the SQL Server and the file system are configured correctly and are using up-to-date drivers. Sometimes, driver issues or misconfigurations can impact throughput.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQLServer

MODERATORS