How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President,...

35

Transcript of How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President,...

Page 1: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:
Page 2: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

How Oracle Database 10g Revolutionizes Availability and Enables the Grid

Juan LoaizaVice President, Systems Technologies

Oracle Corporation

Session id: 40164

Page 3: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

3

From High Quality Parts to High Quality Systems Traditionally Low Cost = Low Quality High quality systems were built by combining

high quality, high cost parts – Mainframe model Oracle enables a new model Oracle combines high volume inexpensive

processors and storage to produce a high quality system

Unbreakable Inexpensive Systems

Page 4: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

4

Low Cost Fault Tolerance

Grid ClustersGrid ClustersLow Cost Fault ToleranceLow Cost Fault Tolerance

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Page 5: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

5

Commercial Grids and Availability

Grid pools standard low cost nodes and modular disk arrays

Perfect for RAC HA Failover can happen to

any node on the grid Grid load balancing will

redistribute load over time

Designed to Tolerate Failures

Page 6: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

7

New Economics forData Protection & Recovery

Disk Based RecoveryDisk Based Recovery

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Trade cheap disk spacefor expensive downtime

Page 7: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

8

New World: Disk Based Data Recovery

Disk economics are close to tape

Disk is better than tape– Random access to any data

We rearchitected our recovery strategy to take advantage of these economics

– Random access allows us to backup and recover just the changes to the database

Backup and Recovery goes from hours to minutes

2000’s - 200 GB

1980’s - 200 MB

1000x increase

Page 8: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

9

Resiliency using Low Cost Storage

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Storage FailureStorage Failure

Human ErrorHuman Error

CorruptionCorruption

Site FailureSite Failure

Four Failure Types

Page 9: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

10

Data Mirroring with ASM ASM mirrors data across

inexpensive modular storage arrays

No additional logging or expensive NVRAM to recover mirrors

– Database logging recovers mirrors

Automatically remirrors when disk or array fails

Designed to tolerate failures

Failure Resiliency using Low Cost Storage

Page 10: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

11

Collapsing the Cost of Human Error

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Storage FailureStorage Failure

Human ErrorHuman Error

CorruptionCorruption

Site FailureSite Failure

Page 11: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

12

Human Error

Goal is to quickly analyze and repair – For Localized damage

Need surgical analysis and repair Example – deleted wrong order

– For Widespread damage Need complete back-out to avoid long downtime Example – batch job deletes this month’s orders

Human Errors

Other Downtime

Single Biggest Cause of Downtime

Page 12: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

13

Flashback Time Navigation Flashback Query

– Query all data at point in time

Flashback Versions Query– See all versions of a row between

two times– See transactions that changed the

row

Flashback Transaction Query– See all changes made by a

transactionTx 1

Tx 2

Tx 3

Select * from Emp AS OF ‘2:00 P.M.’ where …

Select * from Emp VERSIONS BETWEEN ‘2:00 PM’ and ‘3:00 PM’ where …

Select * from DBA_TRANSACTION_QUERY where xid = ‘000200030000002D’;

Page 13: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

14

Flashback Database

A new strategy for point in time recovery Flashback Log captures old versions of

changed blocks– Think of it as a continuous backup– Replay log to restore DB to time– Restores just changed blocks

It’s fast - recover in minutes, not hours It’s easy - single command restore

Flashback Database to ‘2:05 PM’

“Rewind” button for the DatabaseData Files Flashback

Log

New BlockVersion

Disk Write

Old BlockVersion

Page 14: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

15

Flashback Error Correction Recovery at all levels Database Level

– Flashback Database restores the whole database to time

Uses Flashback Logs Table Level

– Flashback Table restores rows in a set of tables to time

Uses UNDO in database– Flashback Drop restores a

dropped table or a index Recycle bin for DROPs

Row Level– Restore individual rows

Uses Flashback Query

Order

Database

Customer

Page 15: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

16

Flashback for All UsersEND USER• Flashback Query• Flashback Versions Query

DEVELOPER• Flashback Versions Query• Flashback Transaction Query• Flashback Table

DATABASE ADMIN• Flashback Database• Flashback Drop

SYSTEM ADMIN• Data Guard

Page 16: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

17

Revolution in Recovery

Flashback Revolutionizes Recovery– Operates on just the changed data– Time to correct error equals time to make error

Minutes instead of hours

Flashback is Easy– Single command instead of complex procedure

Correction Time = Error Time + f(DB_SIZE)

Page 17: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

18

Prevention & Recovery of Corruptions

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Storage FailureStorage Failure

Human ErrorHuman Error

CorruptionCorruption

Site FailureSite Failure

Page 18: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

20

Flash Recovery Area

Fully automatic disk based backup and recovery

– Set and Forget

Nightly incremental backup rolls forward recovery area backup

– Changed blocks are tracked in production DB

Full scan is never needed– Dramatically faster (20x)– Blocks validated to prevent

corruption of backup copy

Use low cost ATA disk array for recovery areaTwo Independent Disk Systems

Flash RecoveryArea

Nightly ApplyValidatedIncremental

WeeklyArchiveTo Tape

DatabaseArea

Page 19: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

21

Low Cost No Compromise Disaster Recovery

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Storage FailureStorage Failure

Human ErrorHuman Error

CorruptionCorruption

Site FailureSite Failure

Page 20: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

22

Existing Site Recovery Tradeoffs

User can delay log apply to protect from user errors but:– Failover takes hours– Reports run on hours old data

After failing over to standby, production DB must be rebuilt– Production has updates that did not get to standby

ProductionDatabase

StandbyDatabase

TransactionShipping

4 HourDelayApply

ReportingOn Delayed

Data

Page 21: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

23

Low Cost No Compromise Disaster Recovery

Flashback DB removes need to delay apply of logs to correct errors Flashback DB removes the need to reinstantiate primary on failover Real-time log apply enables real-time reporting on standby Data Guard works transparently across GRID clusters

– Standby can use fewer CPU resources than primary

Flashback Log

ProductionDatabase

StandbyDatabase

TransactionShipping

(Real Time Apply)

NoDelay

ReportingOn Real Time

Data

Flashback Log

Some Nodes Used for

Other Computing

Page 22: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

24

Highest Data Protection Lowest Cost

Combine the Features to

Achieve Any Level of Data Protection

Dramatic Advances in Ease of Use

ASM Mirroring Storage Failure

Protection

Flashback Human Error

Protection

Data Guard Site Failure ProtectionFlash

Recovery Area

Corruption Protection

Page 23: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

27

Allow any change to the system with no downtime

Online ReconfigurationOnline Reconfiguration

Goal

No Cost System Changes

Rolling UpgradesRolling Upgrades

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Page 24: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

28

No Cost System Changes – Capacity on Demand

CPU– Add/remove CPUs on SMP online

Cluster Nodes– Add/remove cluster nodes online– No data movement needed

Memory– Grow and shrink shared memory

and buffer cache online– Auto tuning of memory online

Disk– Add/remove disks online– Automatically rebalance– Move datafiles

Page 25: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

29

Rolling Patch Upgrade using RAC

Initial RAC Configuration Clients on A, Patch B

OraclePatch

Upgrades

OperatingSystem

Upgrades

Upgrade Complete

HardwareUpgrades

Clients Clients

Clients on B, Patch A

Patch

1 2

34

A B A B

A BA B

B

APatch

Page 26: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

30

Rolling Release Upgrade using Data Guard

Initial SQL Apply Config

Run mixed to testSwitch to B, upgrade A

Upgrade node B to X+1

Major Release

Upgrades

Patch SetUpgrades

Cluster Software & Hardware Upgrades

Clients

LogsShip

Version X Version X

1

Clients

2

Clients

LogsShip

3

Clients

LogsShip

4

UpgradeLogsQueue

X X+1

X+1 XX+1X+1

Page 27: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

31

Online RedefinitionOnline RedefinitionEvolution without InterruptionEvolution without Interruption

Competitive pressures demand continual change

Need to change data with no interruption to the application

– location, format, indexing, or even definition

Goal

No Cost Data Changes

Computer Computer FailuresFailures

Data Data FailuresFailures

System System ChangesChanges

Data Data ChangesChanges

UnplannedUnplannedDowntimeDowntime

PlannedPlannedDowntimeDowntime

Page 28: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

34

Maximum Availability Architecture (MAA)

Operational Practices are key– Technology alone is not enough

MAA is a blueprint for achieving HA & DR

– Tested, validated, and documented best practices Database, Storage, Cluster,

Network 10 person year effort

– otn.oracle.com/deploy/availability

M.A.A.How to Prevent, Tolerate, & Recover From Outages

Maximum Availability = Unbreakable Architecture + Best Practices

Page 29: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

35

Highest Availability at Lowest Cost

Highest Availability– Fault Tolerant Clusters– Flashback Error Correction– Automated Disk Backup– No Compromise Disaster Recovery– Rolling Upgrades– Online Redefinition

At Lowest Cost– Low Cost Grid servers– Low Cost Modular Storage Arrays– Automated & Simple to Use

Oracle10g is Unbreakable & Inexpensive

Page 30: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

36

Next StepsHigh Availability Sessions from Oracle

11:00 AM

How Oracle Database 10g Revolutionizes Availability and

Enables the Grid

3:30 PM

Oracle Recovery Manager (RMAN) 10g: Reloaded

5:00 PM

Proven Techniques for Maximizing Availability

8:30 AM

Oracle Database 10g - RMAN and ATA Storage in Action

 

11:00 AM

Oracle Data Guard: Maximum Data Protection at Minimum Cost

 

1:00 PM

Oracle Database 10g Time Navigation: Human-Error Correction

 

4:30 PM

Data Guard SQL Apply: Back to the Future

Wednesday in Moscone Room 304Tuesday in Moscone Room 304

For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/

Page 31: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

37

Next StepsHigh Availability Sessions from Oracle

8:30 AM in Moscone Room 304

Oracle Database 10g Data Warehouse Backup and Recovery:

Automatic, Simple, Reliable

8:30 AM in Moscone Room 104

Building RAC Clusters over InfiniBand

Thursday

For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/

Real Application Clusters

Data Guard

Database Backup & Recovery

Flashback Recovery

LogMiner, Online Redefinition, and Cross Platform Transportable

Tablespaces

Database HA Demos All Four DaysIn The Oracle Demo Campground

Page 32: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

AQ&Q U E S T I O N SQ U E S T I O N SA N S W E R SA N S W E R S

Page 33: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

39

New Oracle Database 10g HA Features

Clusters Portable Clusterware Cluster file system for

Linux & Windows Automated Patching

Data Guard SQL Apply Support for Longs Support for multi-byte

CLOBs and NCLOBs Support for Index

Organized Tables Simplified zero data

loss failover Real time apply allows

real time reporting Zero downtime

instantiation

Rolling Upgrades Rolling Upgrades Using

Data Guard SQL Apply

Online Redefinition Support of Unique

Indexes One Step Cloning of

Dependent Objects Columns can be

Populated Using Sequences & Sysdate

Signature Based Dependency Tracking Using Synonyms

Online Segment Shrink

Data Guard Generic Data Guard Broker

support for RAC Named Data Guard

Configurations Real Time Apply Flashback Standby

Database Flashback

Reinstantiation Improved Recovery

Parallelism

Page 34: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id:

40

New Oracle Database 10g HA Features

Flash Backup & Recovery

Automated Management of B&R Disk Space

Simplified Backup Using Image Copy

Change Aware Incremental Backups

Incrementally Updated Backups

Compressed archive logs

Tuning Improved Recovery

Parallelism Faster Instance Startup

& Cache Warm

Flashback Flashback Drop Flashback Row History Flashback Table Flashback Transaction

History Flashback Database Better map of time to

SCN for flashback query

LogMiner Automated Specification

of Logs to Mine Support for Shared

Server Configurations Fine Grained

Supplemental Logging

Backup & Recovery Simplified Recovery

Through Resetlogs Restore Tolerates Missing

Backups Proxy Backup of Archives Automated TSPITR

Instantiation Full DB Begin Backup Automated Backup

Channel Failover Simplified RMAN

cataloging of backup files Automated File Creation

during Recovery Drop Database Rename Tablespace

Page 35: How Oracle Database 10g Revolutionizes Availability and Enables the Grid Juan Loaiza Vice President, Systems Technologies Oracle Corporation Session id: