Backup Deduplication

Backup deduplication is a method of reducing the backup size by excluding duplicate data blocks from the backup. In any organization, VMs contain duplicates of data, such as VMs deployed from the same template, VMs with the same OS, and VMs that have some (semi)identical files, such as database entries. Block-level data deduplication enables to reduce backup size by saving only unique data blocks to the Backup Repository while replacing duplicated blocks with references to the existing ones.

backup deduplication

NAKIVO Backup & Replication automatically deduplicates all backups across an entire Backup Repository. This means that all data blocks are taken into account by backup deduplication, even if you back up VMware VMs, Hyper-V VMs, and AWS EC2 instances to the same backup repository. While backup deduplication is turned on by default, you can disable it if, for example, you want to use a hardware-based data deduplication device such as an EMC Data Domain.

VM backup deduplication can provide 10X to 30X reduction in storage capacity requirements. For example, you have 10 VMs running Windows 2016 Server, which occupies 10 GB each. While the total amount of data is 100 GB, only one copy of OS data (10 GB) will be written to a backup repository with data deduplication, which provides 10 to 1 storage space savings.

More efficient disk space utilization allows storing more recovery points per VM backup. In addition, lower storage space requirements save money on direct storage costs (as fewer disks are needed to store the same amount of information) and on related costs (such as cooling, electricity, and maintenance).