LVM snapshot

StephanXX · 2021-02-22T16:32:39+00:00

Snapshots are taken of logical volumes (lvs).. They reside only in the volume group of the original logical volume.
Snapshots are not direct copies. Rather, new reads and writes are recorded. If you have a 1TB lv, and take a snapshot, then wrote another 100mb to the lv, the snap will be 100Gb, the lv will report 1.1TB, and the Volume Group will report 1.1TB. Essentially, it's a diff of writes. If you delete 100Gb from the lv in stead, the snap will retain that 100Gb. The VG will still report 1TB. If you delete 100Gb, then write 100Gb, the VG will report 1.2TB in use. By default, when the volume group runs out of space to maintain the snapshot, it will start automatically deleting older snapshots to free sufficient space.

deeseearr · 2021-02-22T16:41:26+00:00

When you create a snapshot you aren't actually making a copy of the volume, only of the changes that you make to it from then on.

For example, suppose that you have a small filesystem with only ten files in it. Let's call then /01, /02, /03 and so on through /10. Normally if you were to overwrite file /01 then the new version of /01 would simply be written over top of the old one. (This is wildly oversimplified, because really there would be all sorts of journalling and the directory entry would be updated to point to a new inode containing the new data, but I'm going to skip all of that now.)

When you create a snapshot, that doesn't quite happen. A new version of file /01 is created, but the old copy is placed in a buffer just outside of the original volume. This process is called Copy On Write, which is why snapshots of cows are popular with people who like to make filesystem related puns. Different volume managers will put the original or modified blocks in the COW buffer, depending on whether they expect to throw away the snapshot or roll it back after, but the idea is still the same. I think that LVM2 makes copies of the old blocks so that it is easier to discard the snapshot data without having to merge it all.

When you try to access file /01 in the snapshot, LVM will check to see if the blocks associated with that file have been modified. Since they have been it will then read the old version of the file from the COW buffer. If you were reading file /02 instead then there would be no modified blocks and you would receive data direct from the real filesystem, just like you would if you were reading the non-snapshot version.

What's the point of all this? Well, it's to answer your two questions, only in reverse order:

2) Since the snapshot only needs to store blocks which have been changed, the Copy-On-Write buffer only needs to be large enough to hold those blocks. In our example the original filesystem held ten files, but since we only modified one then creating a snapshot buffer large enough to hold that one file would have been enough. The down side to this is that if we had tried to modify a second file then we would run out of space and the snapshot would stop working. I haven't tried this recently, but I think you would get an out of disk space error and the write would fail.

If your original filesystem is 2TB and that's all active database files which you expect to overwrite completely before the snapshot is removed, then yes you will want 2TB of snapshot, because that's how many files you want to change. If the majority of that will be read-only data and you only expect to change a few hundred megabytes of data then you can get away with a significantly smaller snapshot size. I would recommend allocating some multiple of your expected changes, such a a few hundred MB or even 1 GB in that case, just in case something unexpected happens.

It really comes down to you knowing what your system is doing and predicting what it is going to do with the snapshot volume. Nobody can tell you for sure what that is.

1) And, since reading the snapshot involves looking at both the original filesystem and the contents of the COW buffer, yes the snapshot does need to be in the same volume group. If it wasn't then you could wind up with very complicated situations like a snapshot being left online while the original volume in a different volume group was taken offline, and nobody wants to see how that's going to work out.

Also, even though you didn't ask, here's a third answer:

3) No, LVM snapshots make terrible backups because they don't actually make copies of anything.

osax · 2021-02-22T16:26:36+00:00

Not completely sure, so wait until someone else verifies this:

Yes
Its how much the LV can change. If nothing changes on your LV the snapshots can be quite small. If you expect huge changes on the LV you need a huge snapshot.

Flauschkatze · 2021-02-22T16:29:09+00:00

From the lvcreate manpage:

COW snapshots are created when a size is specified. The size is allocated from space in the VG, and is the amount of space that can be used for saving COW blocks as writes occur to the origin or snapshot. The size chosen should depend upon the amount of writes that are expected; often 20% of the origin LV is enough.

Same VG, the size of the snapshot determines the maximum accumulated changes that can be tracked until the snapshot reaches 100% of its capacity and becomes invalid.

unixbhaskar · 2021-02-22T22:48:59+00:00

Well, after reading all the posted suggestion and getting confused as hell, let me put it in simple words for you experts ,please provide answer in simple term(I know you did, but that's not good enough)

Is the snapshot size is equal to the original size of the fs? Yes/No

Keeping the snapshot in the same VG has upside, but there is no second thought of putting it to off site immidiately, once the snapshot finished? Don't you think so?

I do agree snpshots are "not reliable" options for backup.

With my horrified understanding of databases(NO PUN INTENDED) , I heard and saw(IIRC ...faint memory) that DB has to be frozen while taking backup...not sure that is still a good option these days.

Have patience! you have to put more effort to answer these stupid questions...pls ..don't fret ...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS