Data Domain system : EMC Data Domain storage systems are traditionally used for disk backup, archiving, and disaster recovery. An EMC Data Domain system can also be used for online storage with additional features and benefits. A Data Domain system can connect to your network via Ethernet or Fibre Channel connections.
Data Domain systems use low-cost Serial Advanced Technology Attachment (SATA) disk drives and implement a redundant array of independent disks (RAID) 6 in the software. RAID 6 is block-level striping with double distributed parity. Most Data Domain systems have a controller and multiple storage units.
A Data Domain system is:
- A storage system used for backup and archiving workloads that:
o Performs high-speed deduplication to maximize storage efficiency
o Ensures recoverability of data through integrated data integrity intelligence
o Can replicate data automatically for disaster recovery
o Easily integrates via Ethernet and Fiber Channel into existing backup infrastructures
- Safe and reliable
o Provides Continuous recovery verification, fault detection, and healing for end-to-end data integrity
DD Boost : EMC Data Domain Boost extends the optimization capabilities of Data Domain systems for other EMC environments, such as Avamar and NetWorker, Greenplum, Quest vRanger, Oracle RMAN etc. DD Boost is a private protocol that is more efficient for backup than CIFS/NFS. DD Boost shares the work of deduplication by distributing some of the processing with the application host. This feature is called distributed segment processing (DSP). The DD Boost protocol enables backup servers to communicate with storage systems without the need for Data Domain systems to emulate tape. The application host is aware of, and manages replication of backups created with DD Boost. This is called Managed File Replication.
There are three basic features to DD Boost:
- A private protocol that is more efficient than CIFS or NFS. DD Boost has a private, efficient data transfer protocol with options to increase efficiencies.
- Distributed segment processing (DSP). An optional feature to DD Boost shares portions of the deduplication process with the application host, improving data throughput. DSP distributes parts of the deduplication process to the NetWorker storage node using the embedded DD Boost Library (or, for other backup applications, using the DD BOOST plug-in), moving some of the processing normally handled by the Data Domain system to the application host. The application host performs a comparison of the data to be backed up with the library and looks for any unique segments. Thus it sends only unique segments to the Data Domain system.
- DD Boost provides systems with centralized replication awareness and management. Using this feature, known as Managed File Replication, backups written to one Data Domain system can be replicated to a second Data Domain system under the management of the application host. The application host catalogs and tracks the replica, making it immediately accessible for recovery operations. Administrators can use their backup application to recover duplicate copies directly from a replica Data Domain system.
Data Domain and Oracle Oracle RMAN is a built-in tool that allows the database administrator (DBA) to easily back up and recover data in an Oracle database. RMAN handles the coordination required to ensure that transaction integrity is preserved, and sufficient information is maintained to recover the database to any appropriate point. RMAN can create backup sets that comprise as much or as little recovery information as the DBA requires but usually include information from the database datafiles, control files, and redo and archived log files. RMAN supports performing backups to a local tape drive 1 a local disk, or a NAS device, as well as integration with traditional enterprise backup applications, as shown in the below figure
Benefits of using a Data Domain system as a target for Oracle RMAN By eliminating redundant data segments inline, Data Domain systems allow many more backups to be retained than would be possible using traditional storage. In particular, Data Domain systems use a variable-length segmentation process that is extremely efficient at finding identical segments within backups of monolithic files, such as Oracle datafiles.
The ability of the Data Domain system to store several weeks or months of full Oracle backups enables the DBA to implement a backup and recovery scheme with great flexibility and protection while consuming a minimal amount of physical storage. The integration of the Data Domain system into an Oracle/RMAN environment is seamless
Since the Data Domain system presents itself either as an NFS or CIFS shared storage server. Oracle/RMAN already supports and documents this type of installation for effective RMAN storage.
If an enterprise backup software solution such as Oracle Secure Backup or EMC NetWorker is already in use, the Data Domain system can be seamlessly integrated into this environment as well. In this case, the Data Domain system can appear as an SBT_Tape device or a disk device to the enterprise backup software solution.
For critical Oracle environments, it is a best practice to replicate the production Oracle data to a secondary recovery location. The DBA has many options to choose from, including technologies from Oracle such as Oracle Data Guard, solutions offered by primary storage providers, and third-party solutions. Data Domain Replicator software offers extremely bandwidth-efficient replication that is also easy to deploy, enabling DBAs to leverage RMAN to provide disaster recovery capabilities for Oracle databases.
The primary benefit of Data Domain Replicator is the fact that only deduplicated and compressed data is transferred across the network. Because deduplication is happening inline, replication takes place while the RMAN backup process is still active. As the RMAN backup process proceeds, the unique segments and metadata representing each file in the backup set are queued for replication to the remote site, -to- many cases, replication is completed within a short period of time after the initial backup completes.
References For additional information, see the following:
EMC Data Domain Family products and deduplication technology http://www.emc.com/products/family/data-domain-family.htm
EMC Data Domain solutions for Oracle http://www.datadomain.com/solutions/oracle.html
EMC Backup, Recovery, Archive solutions for Oracle http://www.emc.com/solutions/samples/oracle/backup-recovery-archive-oracle.htm EMC
Solutions for Oracle http://www.emc.com/solutions/application-environment/oracle/index.htm
EMC Data Domain Global Deduplication Array http://www.emc.com/products/detail/hardware/data-domain-global-deduplication-array.htm
EMC Data Domain Boost software http://www.emc.com/products/detail/software/data-domain-boost.htm