The Difference Between VM Snapshots vs Backups
Backups and snapshots are two popular methods that enterprises use to mitigate and avoid data loss. In fact, more than 91% of organizations rely on backups to protect their data and mitigate the consequences of cyber threats and disasters.
Snapshots and backups may seem to serve the same purpose at first glance since they both allow you to recover data. However, snapshots are not an alternative to backups. They have varying use cases and cannot be used interchangeably.
This post explains both methods and lists their pros and cons. Read on to discover the differences between snapshots and backups.
What Is a VM Snapshot?
A snapshot is an exact record of a machine or system at a specific point in time. It preserves the state and the data of the VM including its power state (on, off or suspended) and all its files such as disks, memory and network interface cards. Snapshots are generally used to restore a VM after a system failure, bad update or error.
How Snapshots Work
When you take a snapshot, the state of the base virtual disk is recorded and maintained at that moment. All changes are captured and written to newly-created delta disk files that are stored in the same folder as the base disk. The system creates child disks for each source disk of every snapshot.
The first snapshot is an exact image of your machine and the subsequent snapshots collect the changed data blocks only. This allows for quicker versioning and enhances protection against data loss incidents.
Note though that the child disks are directly dependent on the corresponding parent disk. If the latter is deleted or corrupted, then you can no longer recreate the VM. The recovery process reads all the files in a sequential order starting with the parent disk file followed by the related delta files.
VMware snapshots
In the VMware vSphere client, you can right-click the VM name, select Snapshots and click Take Snapshot to create a snapshot.
The VMware snapshot operation creates several files with different extensions:
- .vmdk – The flat.vmdk file contains the raw data in the base disk.
- -delta.vmdk – The delta disk is represented in the format of .00000x.vmdk. It contains the difference between the current state of the virtual disk and the state that existed at the time that the previous snapshot was taken. Data in the child VMDK disk is saved in the SPARSE format using the copy-on-write mechanism.
- .vmsd – This file is the database file for the snapshot itself. It contains the snapshot metadata and is the primary source of information for the snapshot manager. The entries contained in this file are the snapshots and relationships between each snapshot and its child disks.
- .vmsn – The .vmsn file includes the active state of the VM and captures the memory state at the point of the snapshot. This allows you to revert to a running state of the machine. If you create a snapshot without including the memory file, you recover the VM in a turned off state.
Hyper-V checkpoints
In Microsoft Hyper-V, VM snapshots are known as checkpoints. They operate in a similar way by saving or copying the state of a VM at a specific point in time.
To create a Hyper-V checkpoint, right-click a VM and choose Checkpoint.
In Windows Server, a snapshots folder is created containing the binary file format:
- .vmcx – This file includes the configuration information of the virtual machine.
- .vmrs – This is the VM runtime state information file.
Also, a differencing disk with the .avhdx format is created. This records the delta changes that are made post checkpoint creation.
Snapshot Use Cases
In a virtual environment, snapshots act as a failsafe that can be used to roll back to a certain point in time. Snapshots are usually done before performing actions that might potentially damage the system or corrupt files such as installing updates, uninstalling components or editing software.
In addition, snapshots are useful in development environments since you can conduct “rinse and repeat” testing to validate code changes and recover recently changed data if necessary.
Pros and cons of snapshots
Snapshot pros:
- Quick creation: Easily take a VM snapshot with minimal impact on the production server.
- Automated process: Schedule snapshots before specific actions.
- Instant rollback: Restore a machine to a previous point in time almost instantly.
- Reduced TCO: Minimize the total cost of ownership (TOC) by eliminating the need for native Windows backup tools.
Snapshot cons:
- Large storage consumption: Snapshots can take up large storage space if kept for too long. Based on the volume of data you generate, it is recommended to keep a snapshot for one to three days.
- Short-term retention: In addition to large storage consumption, snapshots impact network performance if stored for more than a few days. This prevents long-term data retention.
- Single point of failure: If anything happens to your production server, you risk losing all your data since source disk and delta disk files are stored in the same location.
- Dependency on source data: The delta files created after you take a snapshot are directly dependent on the source machine. If the original file is deleted then you can no longer recreate the VM.
What Is a Backup?
Backups are copies of entire systems (including OS, configuration files and app data) or of individual files. When you perform any type of backup, you create copies of data that are completely independent from the source machine. This means that you can store backups for as long as you need in a location other than the original: offsite, in a public cloud, on tape or hardware storage devices like NAS or SAN.
Modern backup solutions offer high flexibility when it comes to recovery, retention and storage optimization. You can restore entire machines or individual files and folders, instantly boot machines from backups or perform cross-platform recovery. Backup solutions also enable you to rotate recovery points and use compression and deduplication along with other features for better performance and storage consumption.
Backup Use Cases
Backups are the core of any data protection strategy since they are great for storing a copy of your data for a long period of time. By sending backups to multiple storage locations, you eliminate a single point of failure and guarantee business continuity.
Pros and cons of backups
Backup pros:
- Multiple storage destinations: Keep backups onsite, send them to an offsite storage or store them in the cloud.
- Easy access: Manage and recover your backups for anywhere and at any time.
- Flexible recovery: Choose among different recovery options depending on your needs.
- Long-term retention: Store backups for as long as you need using efficient rotation schemes and data reduction techniques to save on storage costs.
Backup cons:
- Slow process: Performing full backups on a large volume of data can take a lot of time to complete compared to snapshots.
- Network overload: The backup creation process can put an excess load on your network, especially during business hours. Luckily, you can use network acceleration features to solve this issue.
Snapshot vs Backup: Key Differences Summarized
Using snapshots and backups interchangeably is not recommended and can lead to data loss. It is crucial to understand the main differences between these two methods to create a robust data protection strategy.
The table below offers a clear comparison:
Snapshot | Backup | |
Purpose | Preserves the state of a machine at a specific point in time | Creates a copy of entire machines or individual files |
Dependency | Dependent on source data | Completely independent from source data |
Process Duration | Generally quick for small VMs | Can vary from slow to quick depending on the backup type and VM size |
Storage location | Same location as source VM | Same location as source VM or different location: offsite, cloud or hardware storage |
Storage Consumption | High and can cause performance issues if stored for more than a few days | Can vary from low to high depending on data volume, backup type, compression and deduplication features, etc. |
Use cases | Development and testing | Data protection |
Retention period | Short, generally up to 72 hours | Long |
Recovery | Relatively quick | Can vary from slow to quick depending on the recovery option |
Modern Backup Solutions
Modern data protection solutions designed for virtual environments are agentless, meaning that they don’t require an agent to be installed on each VM to protect it. They leverage VM snapshot technologies in VMware vSphere and Microsoft Hyper-V to create backups at the virtualization level. Once the backup is created, the VM snapshot is deleted. These are known as image-based VM backups, that is, point-in-time copies of VMs that include guest OS disks, configuration files and applications.
Backup from storage snapshots
As hardware vendors develop new solutions, some backup vendors are starting to integrate these emerging technologies in their solutions. One such technology is storage snapshots that are leveraged by modern backup vendors to reduce the load on production resources that can come from regular VM snapshots, especially for very large VMs.
Third-party software like NAKIVO Backup & Replication include the backup from storage snapshot feature. When used with certain storage devices, the solution triggers a temporary VM snapshot first then a storage snapshot. The VM snapshot is deleted as soon as the storage snapshot is taken. Storage snapshots can be produced relatively quickly and contain all the delta and CBT data needed to perform a VM backup.