Data Domain: Leadership and Innovation
description
Transcript of Data Domain: Leadership and Innovation
Extending VMware Consolidation Benefits to Data Protection with Data Domain
Achieving Efficient Data Protection for Virtualized Environments with Data Deduplication
•Deduplication Storage Systems > 6,500 systems installed > 2,500 customers > 745 petabytes under Data Domain protection worldwide
•A History of Industry Firsts
Data Domain: Leadership and Innovation
First Dedupe NAS
First Dedupe Volume Replication
First Dedupe Gateway Largest Dedupe Array
First DedupeDirectory Replication
First Dedupe VTL
2003 2004 2005 2006 2007
First Dedupe Nearline Storage
2008
Data Domain: #1 in Deduplication Storage
Deduplication Defining a New Tier of Storage
Storage1.0 PRIMARY TAPE
Storage2.0 PRIMARY
SATA & RAID TAPE
Storage3.0
PR
IMA
RY
SATA & RAID & DE-DUPE
TAP
E
Clients Server Primarystorage
Backup/mediaserver
OnsiteRetentionStorage
Offsite Disaster
Recovery Storage
Retention/Restore Replication DRBackup
Archive to tapeAs required
WAN
Data Domain: Dedupe Simplified
• High-speed, inline deduplication storage; disk target for nearline applications Any leading backup software, archive apps, or custom nearline use All data types: structured and file Any fabric: NFS / CIFS / NDMP via Ethernet, or VTL via Fibre Channel Disk storage: Internal, or gateway to SAN array One dedupe infrastructure: remote office, datacenter with inline replication
OnsiteRetentionStorage
Offsite Disaster
Recovery Storage
Data Domain
Archive
Archive Application
Server
‘Drag&Drop’ Archiving
6
The VMware and Data Domain Joint Solution
Massive data retention Works especially well in VMware environments VMs protected quickly with efficient replication Data invulnerability Seamless integration
Data Domain Deduplication Storage
VMware Backup to Data Domain Deduplication
Storage
Data Domain Basics
2U or 3U(15) 500 GB SATA drives
RAID-6NVRAMN+1 Fan
2 - 6 Ports5.4 to 35 TB with Shelves
File System
(Gateway to: EMC, HDS, Nexsan, Pillar, NetApp, 3PAR)
Replication
CIFS, NFS, NDMP, OST
Ethernet
FC = VTL
Easy Integration with Existing Environment
Backup and Nearline Applications Data Domain DD500 or DD600 Appliance
Series
Second Friday Full Backup
B C D E F L G H
Data Deduplication: Under the Hood
A B C D E F G H I J
Friday Full Backup
A B C D A E F G
Mon Incr A B H
Tues Incr C B I
Thurs Incr A C K
Weds Incr E G J
BACKUP DATA LOGICAL ESTIMATED PHYSICALREDUCTION
Monday Incr 100 GB 7-10x 10 GB
Tuesday Incr 100 GB 7-10x 10 GB
K L
Wednesday Incr 100 GB 7-10x 10 GB
Thursday Incr 100 GB 7-10x 10 GB
2nd FRIDAY FULL 1 TB 50-60x 18 GB
TOTAL 2.4 TB 7.8x 308 GB
FRIDAY FULL 1 TB 2- 4x 250 GB
Store more backups in a smaller footprint.
Retain: Store More for Longer with Less
Week 1
BACKUP DATA LOGICAL ESTIMATED PHYSICALREDUCTION
April 14 3.8 TB 10x 366 GB
April 21 5.2 TB 12x 424 GB
April 28 6.6 TB 14x 482 GB
May 31 12.2 TB 17x 714 GB
June 30 17.8 TB 19x 946 GB
TOTAL 23.4 TB 20x 1178 GB
April 7 2.4 TB 8x 308 GB
Over 1 year of retention in 3µ of Data Domain protection storage.
Week 2
Week 3
Month 1
Month 2
Month 3
Month 4 July 31 23.4 TB 20x 1178 GB
10
Challenge: Managing and Protecting Server Environments
Storage Infrastructure and Bandwidth Costs Backup data > original data Replication of active system volumes for high availability
is complex and expensive Replication of backup data is inefficient
Data Center DR Facility
Replicate1TB
Primary Data
10TB
Backup Data
Expensive
HA Only
ReplicateImpractical
Challenge: Achieving the Data Protection Benefits of Virtualization
• Virtualization enables simplified data protection anddisaster recovery through the encapsulation ofsystems as files.
• However, there is often a significant storagerequirement in order to capitalize on these dataprotection benefits (i.e. image-level .vmdk backups)
Interestingly: 37% of organizations said the amount of data they need
to protect increased after implementing server virtualization*
*Source: ESG Research Report, The Impact of Server Virtualization on Storage, December 2007
Solution: Using Deduplication to Protect vmdk Files
VirtualMachine
VirtualMachine
VirtualMachine
VMwareESX Server System
Inline deduplication
vmdkvmdk
normal backup process
WAN replication
vmdk
vmdk File Structure At least 1 per virtual machine Large, 2GB+ File Unique structure includes many empty
blocks
Data Domain Deduplication Benefits Dramatic reduction in protected vmdk file
size – average 40x-60x reduction Disk-speed protection and reliability Data Domain Replicator Software option
provides network-efficient disaster recovery over existing networks
Variable-Length Segments Are Critical• Data Domain variable-length segment deduplication can identify
change within each vmdk file
• Fixed segment lengths cannot deduplicate beyond the first point of data change:
Call me Izzy. Some years ago - never mind how long precisely - having little …
Call me Ish. Some years ago - never mind how long precisely - having little …
• Variable-length segments maximize redundancy!
Call me Izzy. Some years ago - never mind how long precisely - having little …
Anchor Points
Call me Ishmael. Some years ago - never mind how long precisely - having …
Redundant Segment
Confidential14
Maximize Disaster Recovery Benefits
Eliminate Tape Dependency for Disaster Recovery
99% reduction in bandwidth required to replicate vmdk files
Rapid rebuild of virtual machines • Recover the entire virtual machine – system state,
configuration, file system and application data• Enable system rebuild quickly and easily into
dissimilar server equipment• Mix of physical and virtual systems all recoverable
from Data Domain deduplication storage
VirtualMachine
VirtualMachine
VirtualMachine
VirtualMachine
VirtualMachine
VirtualMachine
vmdk files
file and application data
Problems with Tape for VMware DR
Time required to write and read large vmdk files to tape
Burden and expense of managing tape rotation
Delay in locating tapes Risk of lost tapes, failed tapes and
lost data Environmental impact of transporting
tape media between sites
WAN
Inline Deduplication for Optimized Time-to-DR
• Post-process DR restore point is usually obsolete
Replicate During Backup
DR-ReadyData DomainInline Dedupe/
Replication
Backup to Cache Dedupe & Replicate DR Ready
Post-ProcessDedupe
VTL/Tape/Truck Backup to VTL Copy to Tape Truck to DR Site
DR-Ready
Backup Window Additional 2-3x backup time to get to DR Ready
Network-Efficient Replication for True DR
WAN
home
home
DIR A
Source: Remote Sites
Destination: Data Center Hub Supports hundreds of remote sites
95- 99% Cross-site Bandwidth Reduction
1- 5%
1- 5%
1- 5%
True DR; lowers WAN costs; improves SLAs.
Archive Data
Backup Data
DD120
DDX with DD690s
DD580
DD120
‘Green’ IT Initiatives Demand Efficient Storage
By 2012, de-duplication will be applied to 75% of backups.*-- Gartner
IT managers looking to boost storage efficiency next year willembrace online storage services, push de-duplication in the datacenter and adopt solid-state disk drives to help fuel hardwareconsolidation strategies and green initiatives.**
-- IDC
One-third of organizations with over 25TB of data cite that physicalfootprint and energy efficiency has become a more a importantpurchasing consideration of secondary storage systems***
-- ESG
***Source: ESG Research Report, Data Protection Market Trends, January 2008
*Source: Gartner, Predicts 2008: Emerging Technologies Make Storage Hardware and Software More Effective, December 21, 2007
**Source: IDC, Top 10 Storage Predictions for 2008, December 24, 2007
Enable Simple and Efficient Data Protection for VMware Environments
Requirement
Greener IT Efficiencies
Reduce floor space and energy consumption of servers by improving utilization
Reduce floor space and energy consumption of storage with massive data reduction; average 40x-60x reduction of vmdk Files
Comprehensive Data Protection
and DR
vmdk for easy server rebuild VCB for best protection on
VMware Consolidate DR server
equipment
Disk-based performance and reliability Data Invulnerability Architecture Consolidated data protection for mixed
physical and virtual systems 99% less bandwidth required for efficient
network-based DR Dramatically improved recovery point
and recovery time service levels
Operational Simplicity
Automated system recovery Application and system high
availability through maintenance
Eliminate tape dependence Reduce the cost of long-term data
preservation to less than $0.35/GB – less than the cost of tape
Simplify DR walk-through proceduresReduced Costs Proven 80-90% reduced
operational Costs on Servers Proven 20x average reduced operational
costs on storage and bandwidth
Data Deduplication for VMware Environments
Please visit:http://www.datadomain.com/solutions/vmware.html
THANK YOU!