Source-side deduplication happens at the backup client before data is transferred over the network. Using a cached record of blocks on the backup server, the backup agent determines if the backup server already has a copy of the specific block of data found on the client. If so, the agent does not transfer the duplicate block across the network. With target-side deduplication, all blocks are transferred to the target before duplicate blocks are discarded.
By significantly reducing the amount of data being transferred over the network, source-side deduplication enables faster network backups without requiring any additional network investments.
Source-side Deduplication for Virtualized Environments
Virtualized environments typically have vast amounts of redundant data because multiple virtual machines maintain distinct copies of the same operating systems and applications. Because virtual environments are not constrained by the direct cost of hardware, administrators often experience virtual server sprawl, deploying a greater number of (virtual) machines than would normally be present in physical environments.
Slow backups, network and disk congestion, and hypervisor load are common in highly virtualized environments. As a result of these issues, traditional backup methods often fail to meet the desired backup window. Source-side deduplication in virtual environments reduces network traffic, minimizes storage requirements, and reduces the load on the hypervisor.
Arkeia’s source-side deduplication can be performed within the guest OS or be integrated into hypervisor-specific backup agents, such as those based on VMware’s vStorage APIs for vSphere and Microsoft’s VSS APIs for Hyper-V.
Source-side Deduplication for Distributed Environments
Environments with geographically dispersed sites and only a slow WAN connection between sites pose a difficult backup challenge. Backups in these environments often require shipment of tapes offsite because sending nightly backups over the WAN is impossible. Deploying stand-alone solutions at each site makes it difficult to protect and manage the backup process.
Source-side deduplication makes backup in distributed environments feasible by drastically reducing the amount of data that is sent over the network. By combining deduplication with replication, enterprises have the ability to efficiently move data between sites for disaster recovery or redundant off-site storage.
Source-side Deduplication for the Data Center LAN
Data continues to grow at an exponential rate and although disk is becoming more affordable, network bandwidth—even over a LAN—still poses a problem: backup windows that can't be met. The cost of upgrading network infrastructure is significant. Source-side deduplication allows enterprises to implement a solution that expands storage capacity and network bandwidth capacity, without the costs of upgrading storage or network infrastructure.