Comdex Brasil August 2004 Storage Presentation

26
COMDEX Brasil São Paulo Storage Conference 19 August 2004 Afternoon 45 minutes Choosing the Best Solutions for Business Continuity Tom Flesher E-Net Corporation

description

presentation made at Comdex Brasl 2004

Transcript of Comdex Brasil August 2004 Storage Presentation

Page 1: Comdex Brasil August 2004 Storage Presentation

COMDEX Brasil São PauloStorage Conference

19 August 2004 Afternoon 45 minutes

Choosing the Best Solutions for Business

Continuity

Tom FlesherE-Net Corporation

Page 2: Comdex Brasil August 2004 Storage Presentation

2

Choosing the Best Solutions for Business Continuity

Session Abstract & Outline Abstract: Choosing the best solution for business continuity and log replication

– Data replication and high availability technologies are essential for business continuity. In this presentation, the speaker compares and contrasts options for data contingency and business continuity, including case studies. Outline

– Topic 1: Remote replication benefits Business Continuity Availability Load Balancing Special applications for replication

– Topic 2: Remote replication alternatives Hardware and software solutions Data Base Shadowing – uni-directional replication Peer-to-peer Replication Disk Mirroring Remote Vaulting & Journaling

– Topic 3: Cost comparison of the different solutions– Topic 4: Case studies

US Financial Services Industry US Securities Industry

– Topic 5: Discuss: Next step?

Page 3: Comdex Brasil August 2004 Storage Presentation

3

Choosing the Best Solutions for Business Continuity

Why replicate critical databases over extended distance?

Simple answer: “…to have a hot standby copy of your organization’s most critical data at a remote location in case you need it.” Your needs may include one or more of the following:

Business continuity Your primary copy of the data is knocked out or is unavailable for an

unacceptable length of time – in other words, an “unscheduled outage”

Continuous availability Your primary copy will be unavailable due to a planned and deliberate

event, such as maintenance, migrations or reorganizations

Workload balancing, data mining and replication You can run data mining applications against the replica copy and

avoid the overhead on the primary copy Replica copy can, with some solutions, be “paused” at a point of logical

consistency at any desired point in time

Page 4: Comdex Brasil August 2004 Storage Presentation

4

Choosing the Best Solutions for Business Continuity

Demands on Your Data

CALL CENTER

Production Database

EDI

OLTP

INTERNET

Analysis

Data Mining

Page 5: Comdex Brasil August 2004 Storage Presentation

5

Choosing the Best Solutions for Business Continuity

Benefits of Extended Distance Database Replication

Production Database

ReplicaDatabase

Business Continuity– Not compromised by

regional outage

Availability– Switch to backup site

during scheduled outages

Application-driven– Ad hoc queries– Warehousing– Data mining

Page 6: Comdex Brasil August 2004 Storage Presentation

6

Choosing the Best Solutions for Business Continuity

What’s the difference between mirroring and replication?

Mirroring is hardware-based replication…– All changed data is sent to the remote location at the physical track

level– Always homogeneous (identical) and uni-directional– Can be synchronous over relatively short distances

– Must be asynchronous over extended distance …but there’s also software-based replication

– Change data may be sent in the form of log/journal records, MQ messages, time-triggered “sweeps” of source

– May be heterogeneous (different) and bi-directional, star clustered or peer-to-peer

Mirroring is, by definition, a hardware-based solution Replication may be accomplished using hardware and/or

software facilities

Page 7: Comdex Brasil August 2004 Storage Presentation

7

Choosing the Best Solutions for Business Continuity

Using mirrored disks for hardware-based replication

Mirroring assumes a “system restart” at takeover time – All database files, logs, catalogs must be mirrored faithfully– Notion of “time-consistent restart state” – simulation of the

local/production site environment at a precise point in time– Enables database management systems to perform warm restart

process – backout of inflight transactions (ACID) and cleanup of dirty pages (e.g. Aries/DB2) - to restore database to a consistent state

– After warm restart, data is ready to be used for application takeover Point-in-time copies can be spun off when needed

– For recovery testing– To support read-only applications– To freeze the entire environment while a new full copy is being

transmitted

Page 8: Comdex Brasil August 2004 Storage Presentation

8

Choosing the Best Solutions for Business Continuity

Refresh vs. Change Propagation

A Refresh involves a full copy of all data– Time-consuming – Needs lots of bandwidth– Required for all replication solutions

Some permit tape / CTAM refresh…..

– Potential impact on production application

Change propagation involves capture and transmission of changes– Physical (track) vs. logical (record)

Bandwidth considerations

– Synchronous and asynchronous components

Page 9: Comdex Brasil August 2004 Storage Presentation

9

Choosing the Best Solutions for Business Continuity

Track/Page vs Record/Row

Page 10: Comdex Brasil August 2004 Storage Presentation

10

Choosing the Best Solutions for Business Continuity

Challenges over Extended Distance

Propagation Delay– A fully asynchronous solution should have adequate buffering and

recovery capabilities to run over any distance– Satellite links have huge propagation delays, yet typically very high

bandwidth! Bandwidth

– How many bits must move across the network over time?– Include refresh in addition to change propagation requirements

Time to re-sync in event of extended network outage– Just changed data or all data?

Cost– Pure hardware cost, count extra copies!– Recurring network costs – still high over extended distance– Administration and maintenance costs

Page 11: Comdex Brasil August 2004 Storage Presentation

11

Choosing the Best Solutions for Business Continuity

Solutions – Major Categories

Disk mirroring synchronously – short distances Disk mirroring asynchronously

– with host data mover component– without host data mover component

Software-based replication and change propagation using logs – “shadowing”

Other software-based replication solutions– Peer-to-peer replication

Vaulting and journaling

Page 12: Comdex Brasil August 2004 Storage Presentation

12

Choosing the Best Solutions for Business Continuity

Synchronous mirroring:Disk subsystem-to-subsystem

e.g. IBM PPRC, EMC SRDF/S

Distance limitedapprox 100km

Page 13: Comdex Brasil August 2004 Storage Presentation

13

Choosing the Best Solutions for Business Continuity

Asynchronous mirroring:Using host data mover (e.g. IBM XRC)

Distance virtually unlimited

Page 14: Comdex Brasil August 2004 Storage Presentation

14

Choosing the Best Solutions for Business Continuity

Asynchronous mirroring:Disk subsystem-to-subsystem

e.g. IBM PPRC-XD, Hitachi HARC, EMC SRDF/A

Distance virtually unlimited

Page 15: Comdex Brasil August 2004 Storage Presentation

15

Choosing the Best Solutions for Business Continuity

Database log

Log has after-images of changed records Sometimes only changed

fields or columns Used for forward/backward

recovery processes Concise basis for change

propagation Can be used for recovery or

replication

Page 16: Comdex Brasil August 2004 Storage Presentation

16

Choosing the Best Solutions for Business Continuity

Replication software-based:Using logs for change propagation

Distance virtually unlimited

Workload balancing, data mining and replication

Replica copy can be “paused” at a point of logical consistency at any desired point in time

Page 17: Comdex Brasil August 2004 Storage Presentation

17

Choosing the Best Solutions for Business Continuity

Emerging trend: Peer-to-peer or “multi-master”

New York Node

West Coast Node Central Node

Multiple copies of the same database – all or some in update mode

Applications designed to tolerate ACID “fuzziness”

Page 18: Comdex Brasil August 2004 Storage Presentation

18

Choosing the Best Solutions for Business Continuity

Remote Vaulting

Production ServerVaulting Site

Recovery uses full backups Backups created (“vaulted”) at vaulting site Tape and/or disk backups maintained at vaulting site Channel extension technology

– Usually does not require host server at vaulting site

Page 19: Comdex Brasil August 2004 Storage Presentation

19

Choosing the Best Solutions for Business Continuity

Remote Journaling

Database logs/journals are sent to journaling site– One log file at a time – Or, continuous real-time remote logging using software

Journaling site has small server Recovery uses full backups plus database logs & journals

– Recovery possible to log truncation point Full backups sent off-site via courier or using vaulting solution

Production ServerJournaling Site

Page 20: Comdex Brasil August 2004 Storage Presentation

20

Choosing the Best Solutions for Business Continuity

DataLoss(RPO)

0

1 Day

1 Hour

1 Second

Recovery Time (RTO) 0 15 Minutes 6-24 Hours1-2 Hours

RPO/RTO and Daily Backups….

RPO – Recovery Point ObjectiveHow much transactional data is lost when an offsite recovery is required?

RTO – Recovery Time ObjectiveHow much time is required to perform offsite recovery?

Daily BackupsCTAM / Vaulting

Traditional

Page 21: Comdex Brasil August 2004 Storage Presentation

21

Choosing the Best Solutions for Business Continuity

DataLoss(RPO)

0

1 Day

1 Hour

1 Second

Recovery Time (RTO) 0 10 Minutes 6-24 Hours1-2 Hours

RPO/RTO for Advanced Solutions

Journaling (and/or Vaulting) Solutions

Asynchronous Database Replication

RPO – Recovery Point ObjectiveHow much transactional data is lost when an offsite recovery is required?

RTO – Recovery Time ObjectiveHow much time is required to perform offsite recovery?

Daily BackupsCTAM / Vaulting

Traditional

Asynchronous Disk Mirroring

Synchronous Disk Mirroring

Page 22: Comdex Brasil August 2004 Storage Presentation

22

Choosing the Best Solutions for Business Continuity

Relative Bandwidth Requirements

Time period: one weekS1 – software-based replicationS2 – hardware-based replication

S1S2

Page 23: Comdex Brasil August 2004 Storage Presentation

23

Choosing the Best Solutions for Business Continuity

Relative Cost Levels for Various Widely-Deployed Solutions

0

20

40

60

80

100

120

Tape w ithcourier

Remote tapeVAULTING

RemoteJOURNALING

Remotedatabase

shadow ing

Disk mirroringover shortdistance

Disk mirroringover longdistance

Relative Costs

CPU Memory

Disk Tape

Comm People

Page 24: Comdex Brasil August 2004 Storage Presentation

24

Choosing the Best Solutions for Business Continuity

New York Node

West Coast Node Central Node

Case study 1US Financial Services Industry

AA

A A BB

CC

CC

BB

DD E E F F

DD

E E

F F

Page 25: Comdex Brasil August 2004 Storage Presentation

25

Choosing the Best Solutions for Business Continuity

Case study 2US Securities Industry

Production Site

Backup Site #1 Backup Site #2Mirroring onlyShort distance

Vaulting, Journaling & Data replication over Extended Distance

Page 26: Comdex Brasil August 2004 Storage Presentation

26

Choosing the Best Solutions for Business Continuity

Next steps….

Analyze your organization’s requirements Engage multiple subject areas

– Disaster recovery– Operational management– Application design and deployment

Estimate costs for two or maybe three solutions Present best choices to management