"Eliminating redundant data in virtual machine backup processes is key due to the high degree of duplicate data across virtual machines. Source-side deduplication is ideally suited to these environments provided the deduplication processing doesn’t burden the host’s shared resources. Arkeia’s Progressive Deduplication approach addresses this conundrum with a ‘no trade-off’ approach. Its low-impact fingerprinting method makes deduplication fast and efficient at the source, introducing greater network and storage capacity optimization."
As the growth of enterprise data accelerates, organizations struggle to find more efficient ways to manage this data. One emerging approach to controlling data growth is data deduplication. Data deduplication technologies are especially relevant to backups because—despite careful use of differential and incremental backup strategies—a large fraction of backups consists of duplicate data.
Data deduplication is a data compression technology that shrinks data volumes by identifying and eliminating redundant file blocks. Deduplication gained an early foothold in organizations by making possible "better" network-attached storage (NAS). The primary benefit of "better" NAS was reduced storage volumes—though cost savings were sometimes meager.
Deduplication offers still greater benefits when integrated into backup agents by making backups faster and by eliminating the cost and complexity of deploying stand-alone deduplication technology.
Arkeia Software acquired Kadena Systems in November 2009 to gain access to Kadena's data deduplication technology. Kadena’s Progressive Deduplication™ technology is an improvement over established “fixed-block” and “variable-block” deduplication technologies:
- Source-side, target-side or mixed (rather than uniquely target-side)
- Block-grain (rather than file-grain)
- Content-aware (because the blocks are sized based on file type)
- In-line (rather than post-processed)
Arkeia’s Progressive Deduplication uses a sliding-window algorithm combined with “progressive-matching”, a technology protected by a combination of granted patents and trade secrets. The first Kadena patent was filed in 2004 and the technology has proven fast and reliable for more than 6,000 end-user customers. Learn more about Kadena Systems and Arkeia’s acquisition of Kadena here.
Source-side deduplication is important in backup applications because it accelerates the backup process by reducing the amount of data that is transferred over the network and stored on disk. Faster backups permit shorter backup windows, minimizing resource contention with other applications. Source-side data deduplication is especially attractive in environments with large amounts of duplicate data, such as virtual environments.