C14-High availability disaster recovery and performance.dmm

54
1 1

Transcript of C14-High availability disaster recovery and performance.dmm

Page 1: C14-High availability disaster recovery and performance.dmm

11

Page 2: C14-High availability disaster recovery and performance.dmm

2

Page 3: C14-High availability disaster recovery and performance.dmm

3

Page 4: C14-High availability disaster recovery and performance.dmm

4

Page 5: C14-High availability disaster recovery and performance.dmm

5

See the example later in the presentation

Page 6: C14-High availability disaster recovery and performance.dmm

6

Additional info on the HADR simulator, including where to get it, in a later slide.

Page 7: C14-High availability disaster recovery and performance.dmm

7

When buffer is large, you may notice that the throughput reported by the primary is higher than that from the standby. This is because primary considers send done when send() call returns (data is copied to TCP send buffer, but may not have reached standby). While standby counts data as received only when it actually gets the data. So the standby side number is more accurate. "Large" here is relative to total amount of data shipped. In the below 1MB buffer example, 289 MB of data was sent in 60 seconds (at 4.8 MB/sec), so 1MB is relatively very small and should not cause much discrepancy in primary and standby numbers. When buffer is large, you can use longer test runs to increase total amount of data shipped to make the buffer relatively small.

Page 8: C14-High availability disaster recovery and performance.dmm

8

db2logscan usage: db2logscan [options] <logFile1> <logFile2> ... <logFileN>

-interval <n> Sample logging rate every <n> seconds. 0 for sampling once perlog file. Default 60.

-checksum Check page checkSum of input files. Detault no check.-infoLevel <s> Logging information display level. Incremental levels are:

rate: Show logging rate. Default level.file: Show meta data of each file.flush: Show each flush.page: Show each page.record: Show each record.

-transLevel <s> Transaction information display level. Incremental levels are:summary: Just a summary at end of run. Default level.open: Show open tranactions at end of run.all: Show start and end of each transaction during the run.

-f <file> Read file names from <file>, one per line.-retrieve <cmd> Run "<cmd> fileName" before opening a file to retrieve the file

from archive, etc.

Page 9: C14-High availability disaster recovery and performance.dmm

9

Page 10: C14-High availability disaster recovery and performance.dmm

10

Page 11: C14-High availability disaster recovery and performance.dmm

11

Page 12: C14-High availability disaster recovery and performance.dmm

12

The operations on the critical path are

Sending log records

Receiving log records

Writing log records to disk

Replaying log records

Page 13: C14-High availability disaster recovery and performance.dmm

13

Page 14: C14-High availability disaster recovery and performance.dmm

14

Page 15: C14-High availability disaster recovery and performance.dmm

15

Page 16: C14-High availability disaster recovery and performance.dmm

16

Page 17: C14-High availability disaster recovery and performance.dmm

17

Page 18: C14-High availability disaster recovery and performance.dmm

18

Page 19: C14-High availability disaster recovery and performance.dmm

19

Page 20: C14-High availability disaster recovery and performance.dmm

20

Page 21: C14-High availability disaster recovery and performance.dmm

21

Page 22: C14-High availability disaster recovery and performance.dmm

22

Page 23: C14-High availability disaster recovery and performance.dmm

23

See a later slide for more info on the log scanner and the hadr calculator.

Page 24: C14-High availability disaster recovery and performance.dmm

24

Page 25: C14-High availability disaster recovery and performance.dmm

25

Page 26: C14-High availability disaster recovery and performance.dmm

26

Page 27: C14-High availability disaster recovery and performance.dmm

27

Superasync vs Log Spooling:

Superasync mode

Eliminates back pressure on the primary

Useful network is slow

Log spooling

Allows receipt of log records regardless of log replay on the standby

Useful with standby is slow

Can be used with any sync mode

Both features can be combined if required

Slow network and a slow standby

Page 28: C14-High availability disaster recovery and performance.dmm

28

Page 29: C14-High availability disaster recovery and performance.dmm

29

Page 30: C14-High availability disaster recovery and performance.dmm

30

Page 31: C14-High availability disaster recovery and performance.dmm

31

Use the database monitor to determine whether the LOGBUFSZ parameter is optimal or not by looking at the following:

Log pages read = 0

Log pages written = 12644

In general, an ideal value would be zero log pages read while seeing a good number of log pages written. When there are too many log pages read, it means a bigger LOGBUFSZ is needed.

Page 32: C14-High availability disaster recovery and performance.dmm

32

Info on the log scanner in a later slide.

Page 33: C14-High availability disaster recovery and performance.dmm

33

Page 34: C14-High availability disaster recovery and performance.dmm

34

See http://www.ibm.com/developerworks/data/library/techarticle/dm-0708naqvi/index.html?ca=dat- for a review of parsing the STMM log.

Page 35: C14-High availability disaster recovery and performance.dmm

35

Page 36: C14-High availability disaster recovery and performance.dmm

36

Page 37: C14-High availability disaster recovery and performance.dmm

37

Page 38: C14-High availability disaster recovery and performance.dmm

38

Page 39: C14-High availability disaster recovery and performance.dmm

39

Changes to monitoring HDAR in v10.5

New behavior to existing fieldsWhen HADR_SPOOL_LIMIT is AUTOMATIC (new value in Kepler), the STANDBY_SPOOL_LIMIT monitor field will return the computed spool size in unit of pages, rather than some special value that represents AUTOMATICIf a member has never connected to the remote db, remote info such as MEMBER, HOST, INST is unknown. Before some of the info might have been returned as blanks or 0. Now we return NULL.

New monitoring fieldsHEARTBEAT_MISSED (available in fp1)

Number of heartbeat messages not received on time on this log stream, accumulated since database startup on the local member Used with HEARTBEAT_EXPECTED to determine network health

HEARTBEAT_EXPECTED (available in fp1)Number of heartbeat messages expected on this log stream, accumulated since database startup on the local memberUsed with HEARTBEAT_MISSED to determine network health

STANDBY_SPOOL_PERCENTPercentage of spool space used, relative to configured spool limit

STANDBY_ERROR_TIMEThe most recent time when standby encountered a major error

HADR_FLAGSASSISTED_REMOTE_CATCHUPASSISTED_MEMBER_ACTIVE STANDBY_LOG_RETRIEVAL STANDBY_RECV_BLOCKED STANDBY_LOG_DEVICE_FULL STANDBY_REPLAY_NOT_ON_PREFERRED

Page 40: C14-High availability disaster recovery and performance.dmm

Positive number = member ID

-1 = current member

-2 = all members

40

Page 41: C14-High availability disaster recovery and performance.dmm

41

Page 42: C14-High availability disaster recovery and performance.dmm

42

Page 43: C14-High availability disaster recovery and performance.dmm

43

To configure data studio alerts:

In the web console, select Open > Alerts and click Health Alerts Configuration.

Select a database for which to view and edit the configurable alert parameters.

Enable or disable database health monitoring for the database. To display alert information for the database in the Health Summary and Alert List, database health monitoring must be enabled. Enable health monitoring by selecting the Monitor database health check box.Note: When you add a database connection, health monitoring is turned on by default. Disable health monitoring by clearing the Monitor database health check box.

Set the refresh rate for the database. The refresh rate controls how often the database is checked for conditions that might trigger alerts.

Configure alerts for the database.

Select an alert type and click Edit.

If prompted, sign in with a database user ID that has permission to manage alerts on the database.

Enable or disable the alert for the database, and configure the critical and warning thresholds for the alert type.

Page 44: C14-High availability disaster recovery and performance.dmm

44

Page 45: C14-High availability disaster recovery and performance.dmm

45

Page 46: C14-High availability disaster recovery and performance.dmm

46

Page 47: C14-High availability disaster recovery and performance.dmm

47

Page 48: C14-High availability disaster recovery and performance.dmm

48

Page 49: C14-High availability disaster recovery and performance.dmm

49

Page 50: C14-High availability disaster recovery and performance.dmm

50

Page 51: C14-High availability disaster recovery and performance.dmm

51

Page 52: C14-High availability disaster recovery and performance.dmm

52

Page 53: C14-High availability disaster recovery and performance.dmm

53

Page 54: C14-High availability disaster recovery and performance.dmm

5454