Académique Documents
Professionnel Documents
Culture Documents
RECOVERY4X CHEAPER, 8X
FASTER, AND 10X BETTER
SYMMETRIX VMAX, DATA DOMAIN,
AND NETWORKER
DATA DOMAIN TRANSFORMS EMC ITS ORACLE
BACKUP AND RECOVERY INFRASTRUCTURE
ABSTRACT
Migrating from a legacy availability infrastructure for Backup and Recovery
creates challenges in terms of what are the best practices for a new Backup and
Recovery deployment with EMCs Oracle databases for Global Data Warehouse
and mission-critical Oracle applications. This white paper will illustrate the
transformation of EMC IT Oracle Backup and Recovery Infrastructure and
highlight how the Data Domain appliance transforms EMC IT Oracle Backup
infrastructure.
August 2012
WHITE PAPER
TABLE OF CONTENTS
EXECUTIVE SUMMARY ........................................................................................................................... 4
Audience ............................................................................................................................................ 4
EMC ITS BACKUP AND RECOVERY JOURNEY ......................................................................................... 5
EMC IT Overview ................................................................................................................................ 5
EMC ITs Backup and Recovery business drivers ...................................................................................... 5
EMC ITs Backup and Recovery Legacy Architecture................................................................................. 5
EMC Backup Profile Legacy EMC IT VTL infrastructure...................................................................... 5
Legacy Backup and Recovery Components........................................................................................ 5
Legacy Backup and Recovery Pain Points ............................................................................................... 8
EMC ITs New Backup and RecoveryData Domain Infrastructure ............................................................. 8
Components ................................................................................................................................. 8
EMC Data Domain Deployment Models ............................................................................................. 9
GDW/CRM BACKUP AND RECOVERY COMPONENTS.............................................................................. 10
Backup and Recovery enablers ............................................................................................................ 10
EMC ITS ORACLE BACKUP AND RECOVERY METHODS ......................................................................... 11
EMC ITs Offloaded Backup and Recovery Process for Oracle Databases .................................................... 11
Step 1- TimeFinder Clone .............................................................................................................. 11
Step 2- Clone Mounted on Proxy/Backup Host .................................................................................. 11
Step 3- RMAN/NetWorker Script to Backup up a Mounted Clone to Data Domain .................................. 11
EMC ITs Regular Backup Process ......................................................................................................... 12
EMC ITS OFFLOADED BACKUP EXAMPLES ........................................................................................... 13
Global Data Warehouse (GDW) Backup Size .......................................................................................... 13
GDW Pain Points ................................................................................................................................ 13
Oracle CRM Backup and Recovery Problem Statement ............................................................................ 13
Advantages of Proxy Host Solution ....................................................................................................... 13
EMC TimeFinder Clone .................................................................................................................. 13
EMC TimeFinder Snapshot ............................................................................................................. 14
ADVANTAGES OF ORACLE BACKUP AND RECOVERY DATA DOMAIN DEPLOYMENT ............................... 15
Reuse of the legacy EMC IT Backup and Recovery process ...................................................................... 15
EMC Data Domain Deduplication technology ....................................................................................... 15
Deduplication Benefits................................................................................................................... 15
Full Backups versus Incremental .................................................................................................... 15
ROI/TCO ........................................................................................................................................... 15
4X Cheaper.................................................................................................................................. 15
10X Better ................................................................................................................................... 15
EMC IT LESSON LEARNED .................................................................................................................... 16
Bottlenecks ....................................................................................................................................... 16
Disks .......................................................................................................................................... 16
Proxy Server: 8X Faster ............................................................................................................... 16
EMC NetWorker Server .................................................................................................................. 16
Utilization versus Vulnerability ....................................................................................................... 16
CONCLUSION ....................................................................................................................................... 17
References ........................................................................................................................................ 18
Acknowledgments .............................................................................................................................. 18
EXECUTIVE SUMMARY
EMC IT has seen explosive growth over the last five years accelerating the need to move from its legacy, Virtual Tape Library
(VTL) Backup infrastructure to a new EMC Data Domain Backup infrastructure.
This creates challenges in terms of what are the best practices for a new Backup and Recovery deployment with EMCs Oracle
databases for the Global Data Warehouse and mission-critical Oracle applications.
EMC IT implemented a phased approach because of the size of the Oracle environments:
Production
Test/Development
QA
Performance
Patch
Moving to the new EMC IT Data Domain infrastructure, for EMCs mission-critical Oracle Global Data Warehouse (GDW) and
Oracle CRM production environments has delivered the following advantages:
4X Cheaper - The Data Domain appliances are a quarter the cost of the legacy EDL/VTL
8X Faster Move from an incumbent EDL/VTL speed of 500 MB/hour to Data Domain speed of 4 TB/hour
Reliability - More reliable than tapes guaranteed that backups can be restored.
Density - Stores more backups for a longer period of time, even old backups can be quickly restored.
Protection Production Data Domain appliances are replicated to a remote Data Domain appliance off-site.
Speed Both backups and restores are significantly faster. Backups are now completed within the Service Level
Agreement (SLA) with fewer support resources.
Complexity Standard full backups are much easier to execute and restore than incremental backups.
Cost Data Domain units are one-fourth the cost with up 10 times the capacity, after deduplication.
This white paper will illustrate the journey and benefits of EMC ITs Oracle Backup and Recovery Infrastructure and highlight how
the Data Domain appliance transformed EMC IT Oracle Backup architecture. It will also include lessons learned from EMC IT on
this migration from a legacy VTL infrastructure to a Data Domain infrastructure.
Audience
This white paper is intended for CIOs, Oracle architects, Backup and Recovery architects, storage architects, Oracle Database
Administrators (DBAs), and server and network administrators.
EMC IT Overview
EMC is a company with over 53,000 users of IT services. It supports over 400,000 customers and partners in 5 Data Centers
with over 16 PB of storage. EMC IT has a portfolio with over 500 business applications and tools and over 8000 OS images with
more than 90 percent of all servers virtualized in 80 countries and 20 languages.
Inability to meet EMCs backup or restore SLAs with the legacy EMC VTL infrastructure
o
Daily hand-holding/fire fighting of the backup process by EMC IT Backup and Oracle SMEs
Limitations on the number of backup images stored on the legacy EMC VTL infrastructure
Mission-Critical Dev and Test environments backed up every two days FULL
Non Mission Critical and dev/test environments backed up FULL 2X per week
Archive log backups are run daily and when triggered by space alerts
have significant advantages over their tape counterparts as they dramatically improve throughput speeds. This is even more
significant when it comes time for a restore. Since the backups already exist on disk, restores do not suffer from the mechanical
limitations of either the tape drive streaming speeds or the robotic arms or even the fast forward and rewind actions associated
with finding the backup set.
The following is a high-level illustration of the legacy Backup and Recovery infrastructure:
NetWorker
Database Server
Proxy Host
A Proxy Host is a server used to mount a cloned copy of a database for backup purposes. It is primarily used to offload the
backup overhead from the production database. It can be used by many databases and has the advantage that the VTL can be
directly mounted to a single Proxy Host, giving significantly more throughput with SAN protocols, vs. backups over NFS. This is
much more important when performing partial restores, as the database can be rapidly restored to the Proxy Host and a partial,
or surgical restore can be performed very rapidly. Of course, if the data required is from the most recent backup, the database
does not even need to be restored from the VTL, as it still exists on the Proxy Host.
NetWorker
Proxy Host
Database Server
The configuration of the proxy server does not require significant CPU resources. The focus of the server is strictly for IO
throughput. For this reason, EMC IT selects a server with a large number of expansion slots. As an example, for EMC ITs CRM
environment the proxy server configuration is shown in Table 1.
Quantity
Hardware
lpe1200E dual-port 8GB HBA 4 ports zoned for VMAX, 4 ports zoned to VTL device
SOFTWARE
Oracle RDBMS single instance
RedHat 5.6
ASM
TimeFinder Clone
EMC TimeFinder is an EMC replication technology that is used by EMC Symmetrix VMAX storage arrays to instantly create an
exact copy of a set of LUNS. For a database, these LUNS would represent the database files for building a standby or backup
copy and optionally include the redo files, for creating a reporting copy. This capability allows the backup or clone of a database
without putting the database in a HOT backup mode.
Components
The new Backup and Recovery infrastructure consists of EMC ITs current Backup and Recovery tools:
Oracle RMAN
Proxy Hosts
The major difference, between the new infrastructure and the legacy infrastructure, is the destination of the backups. In the new
infrastructure, EMC IT writes to the Data Domain appliances. The Data Domain appliances, though the deduplication feature,
allows EMC IT to store many more backups in a fraction of the space previously required. This capability has eliminated the
need for the EDL/VTL and more importantly the need for any fragile tapes.
For the offsite requirements, EMC IT uses Data Domain replication to replicate the backups to a secondary datacenter 600 miles
away, completely eliminating the need for any tapes.
Data Domain
EMC Data Domain deduplication storage systems are designed and optimized, specifically for backup and archive of data.
Support for any conventional backup or archive application through generalized support for network-attached storage (NAS)
interfaces over Ethernet; a virtual tape library (VTL) interface option over Fibre Channel; and product-specific interfaces such
as NetBackup OpenStorage and EMC Data Domain Boost
High-speed, inline deduplication using small, variable-sized sequences to identify and eliminate redundant data segments
before storing to disk
Integrated data protection technologies such as RAID 6, post-backup data verification, and periodic validation checks of
existing data sets
Automated replication of backup data for disaster recovery (DR) using cost-effective, low-bandwidth WAN links, which
enables faster time-to-DR readiness
location for managing and cataloging backups. EMC NetWorker, along with many of the other solutions, also includes
technology that can significantly improve backup throughputs.
Data Domain
Database Server
VTL deployment
When the Data Domain system is deployed as a VTL, RMAN must be integrated with an enterprise backup application like EMC
NetWorker. This is because, like all VTLs, Data Domain systems operate with VTL or Open Storage protocols and RMAN is not
capable of interfacing with them directly.
With a VTL deployment, Data Domain systems can be connected though a 1 GB or 10 GB network typically dedicated to backup
traffic. They can also be connected directly to database server HBA ports achieving much higher rates of throughput.
NetWorker
Data Domain
Database Server
As the diagram illustrates, EMC IT deploys a shared Backup and Recovery infrastructure using the following components and
best practices:
RedHat AS 5.6
10
2.
Include the redo logs so that the database can be verified prior to the backup. This is not necessary, but adds an extra
level of confidence that the database backup is good. It also allows the database to be opened, read-only for
investigation, reporting or extract purposes.
3.
4.
Invoke the NetWorker/RMAN scrip to backup the TimeFinder Clone to Data Domain infrastructure
Once the Clone is originally created, subsequent recreate operations only apply the
incremental changes made to the source since the last recreate of the Clone. This reduces the first step of the backup
process. EMC ITs daily clone typically takes 40 minutes for a 20TB database.
After 40 minutes, the database has a viable Backup and Recovery solution. This backup, is now available for surgical or
complete restores.
Allows full backup to run on the Proxy Host producing no performance impact to the production database and production
user community
Enables surgical restores from the Proxy Host from the most recent backup or from a restored backup from any point in
time. Having a point-in-time restore on a separate host allows individual tables or tablespaces to be restored using
export/import or transportable tablespace or even database links for very specific requirements.
Enables ad-hoc querying for end-users. Queries can run from the clone for ad-hoc reporting allowing full autonomy for
reports running from the clone. This can stop less efficient or resource intensive reports from interfering with the source
database (Production).
Reverse Clone For databases that require a quick and efficient RTO (Recovery Time Objective) the Reverse Clone allows
super quick restores back to the source.
Offloaded Stats Generation before the next backup run, the database can be opened and stats generated on the cloned
version of the database. This allows for complete level of generation, with no impact to the production system.
Virtually no change to existing backup scripts, RMAN catalogs or pre-determined scheduling the only consideration or
potential change being a change to the filesperset parameter.
11
The deduplication benefits of Data Domain allow many more images of full backups to be stored with minimal storage
overhead. This is especially significant for databases that carry a lot of unchanged history as the bulk of the data, which is
the case in all data warehouses and most OLTP databases.
Data Domain enables EMC IT to eliminate the need for a traditional Oracle incremental backup strategy. This simplifies the
restore steps and shortens the recovery time window.
12
support requirement is the ability to backup and provide realistic and flexible recovery options to the business groups that rely
on this data.
GDW runs on a six-node Linux-based Oracle Real Application (RAC) Cluster that went live in November of 2009, as a 10TB
database. Since 2009, GDW has significantly grown and as of May of 2012 GDW has doubled to 20 TB with archive log
generation totaling from 500 to over 1500 GB daily.
13
14
Deduplication Benefits
Data Domains deduplication technology has enabled EMC IT is to retain many more backup images with minimal storage
overhead. EMC IT can now restore backup images from several months pastwithout having to go through the difficult process
of locating off-site tapes and hoping that they are still valid.
ROI/TCO
4X Cheaper
Typically, it is difficult to calculate actual dollars saved when utilizing shared infrastructure and the costs to administer an
environment that is not meeting SLAs. In EMC ITs case, the actual cost of the Data Domain units EMC IT replacing its EDL
with are one fourth the cost and between double and quadruple the capacitydepending on the actual de-dupe rates.
In addition, to these savings, Data Domain technology is also increasing EMC ITs ability to retain backups longer, and
completely eliminate the need for tapes. When factoring in all of these additional cost reductions, it is clear that overall costs
greatly exceed the 4x cheaper claim.
10X Better
The following is a mix of quantitative/qualitative measurements of Better that include people, process, and technology:
1) Faster8X faster with Proxy server deployment with 8GB FC deployment
2) ReliabilityMore reliable than tapes guaranteed backup can be restored
3) CostCosts less for hardware and environments (power and cooling)
4) DensityStores more backups creating a greater retention period
5) ProtectionReplicated remote Data Domain appliance can be deployed off-site
15
Bottlenecks
Disks
When configuring for high speed backups, there are many potential locations for bottlenecks. The most obvious of these is the
disk layout, in the storage array. When it comes to I/O, the capability of the disks is most often the bottleneck.
To give some idea, a 15K fibre channel drive is capable of driving between 47 and 53 GB/hour. This means that to support an
8TB/hour backup, you would need 155 of these disk drives.
For the Data Domain Backup and Recovery environments, EMC IT does not need 8 TB/hour. This means EMC IT configured its
disk pool with only 76 disks and is achieving 4 TB/hour.
16
CONCLUSION
EMC IT implemented a Backup and Recovery solution to deal with the growing size of its Oracle environments:
Migrating from a legacy EMC EDL/VTL solution to Data Domain brought the following advantages to EMC ITs Backup and
Recovery deployment:
Moving away from limited Backup and Recovery backup retention to enabling of simple full backup to reduce the complexity
of incremental restore process
Accommodates months of full Backup and Recovery backups enabled by the Data Domain appliance
Deployment of a Proxy host architect in the Backup and Recovery architect delivers the following:
o
Off load backup process off of production enabling EMC business users experience no disruption or performance issues
on the CRM production server
Implementation of a shared infrastructure for EMC ITs mission critical CRM and GDW environments
EMC TimeFinder technology, which was able to create database clones in minutes (18 TB in 40 minutes) off the production
server and mount on the proxy server and backup in less than four hours
EMC IT also learned valuable lessons from the EMC EDL/VTL to Data Domain Backup and Recovery journey:
DisksTo meet the throughput in a Data Domain the source database Array needs the correct number of drives. To drive to
8 TB/hour the array would need 155 drives
NetWorker ServerIdentify a near capacity NetWorker server and plan for enough nodes to meet your total Backup and
Recovery community needs
Utilization versus VulnerabilityMonitor the backup and restore times in a shared deployment to increase the Backup
and Recovery infrastructure utilization while diminishing vulnerability of long running backups , failed backed or long running
restores or failed restores
Moving to the new EMC IT Data Domain infrastructure for EMCs mission-critical Oracle Global Data Warehouse (GDW) and
Oracle CRM production environments has enabled the following:
8X FasterMove from a legacy EDL speed of 500 MB/hour to an EMC Data Domain speed of 4 TB/hour
DensityStores more backups for a longer period of time; even restores for old backups can restored quickly
ProtectionProduction Data Domain appliances are replicated to a remote Data Domain appliance offsite
SpeedBoth backups and restores are significantly faster; backups are completed within the SLA with fewer support
resources
17
ComplexityStandard full backups are much easier to execute and restore than incrementals
CostEMC Data Domain appliances are one-fourth of the cost with up 10 times the capacity, after deduplication
References
EMC Symmetrix VMAX
http://www.emc.com/storage/symmetrix-vmax/symmetrix-vmax.htm
EMC TimeFinder
http://www.emc.com/storage/symmetrix-vmax/timefinder.htm
EMC Data Domain
http://www.emc.com/backup-and-recovery/data-domain/data-domain.htm
EMC NetWorker
http://www.emc.com/backup-and-recovery/networker/networker.htm
IDC Study Worldwide Purpose Built Backup Appliances
http://www.emc.com/collateral/analyst-reports/11530-idc-ww-pbba-2011-2015-forecast.pdf
EMC IT
http://www.emc.com/microsites/emc-it-proven/index.htm
Acknowledgments
The author would like to thank the EMC IT teams for assistance in the creation of this white paper.
CONTACT US
To learn more about how
EMC products, services, and
solutions can help solve your
business and IT challenges,
contact your local
representative or authorized
reselleror visit us at
www.EMC.com.
EMC2, EMC, the EMC logo are registered trademarks or trademarks of EMC Corporation in the
United States and other countries. VMware [add additional per above, if required] are registered
trademarks or trademarks of VMware, Inc., in the United States and other jurisdictions.
Copyright 2012 EMC Corporation. All rights reserved. Published in the USA. 08/12
EMC White Paper H10986
EMC believes the information in this document is accurate as of its publication date.
The information is subject to change without notice.
EMC Corporation
Hopkinton, Massachusetts 01748-9103
1-508-435-1000 In North America 1-866-464-7381
www.EMC.com
18