- DOAG Deutsche ORACLE ...... Oracle Cluster File System, Cloud Edition. ASM / ACFS Enhancements ......

Post on 04-Mar-2020

7 views 0 download

Transcript of - DOAG Deutsche ORACLE ...... Oracle Cluster File System, Cloud Edition. ASM / ACFS Enhancements ......

<Insert Picture Here>

Oracle RAC und das 11.2.0.2 Patch Set

- mehr als ein nur Patch Set !?Markus Michalewicz

Principal Product Manager Oracle RAC

The following is intended to outline our general

product direction. It is intended for information

purposes only, and may not be incorporated into any

contract. It is not a commitment to deliver any

material, code, or functionality, and should not be

relied upon in making purchasing decisions.

The development, release, and timing of any

features or functionality described for Oracle’s

products remain at the sole discretion of Oracle.

<Insert Picture Here>

Agenda

• Introducing Oracle Database 11.2.0.2

• New Features in 11.2.0.2

• Quality of Service (QoS) Management

• Oracle RAC One Node Integration

• Oracle Cluster File System, Cloud Edition

• Redundant Interconnect Usage

• Virtualized Flexibility (policy change)

• Cluster Health Management with 11.2.0.2• Cluster Health Monitor

• Hang Manager

Oracle Database 11g Release 2, Patch Set 1� 11.2.0.2 – the new Patch Set generation

• Upgrade + New Features

• Upgrade + Full Installation

• And always “Out-of-place” (*)

(*) Patch uses new Oracle Home

<Insert Picture Here>

Agenda

• Introducing Oracle Database 11.2.0.2

• New Features in 11.2.0.2

• Quality of Service (QoS) Management

• Oracle RAC One Node Integration

• Oracle Cluster File System, Cloud Edition

• Redundant Interconnect Usage

• Virtualized Flexibility (policy change)

• Cluster Health Management with 11.2.0.2• Cluster Health Monitor

• Hang Manager

• Assure predicable performance Assure predicable performance Assure predicable performance Assure predicable performance for consolidated applicationsfor consolidated applicationsfor consolidated applicationsfor consolidated applications

• Dynamically allocate resources Dynamically allocate resources Dynamically allocate resources Dynamically allocate resources to meet to meet to meet to meet SLAsSLAsSLAsSLAs

• Reduce IT costs by optimizingReduce IT costs by optimizingReduce IT costs by optimizingReduce IT costs by optimizingresource useresource useresource useresource use

Quality of Service ManagementQuality of Service ManagementQuality of Service ManagementQuality of Service Management

Improving the Corporate Bottom-LineProactive SLA Management

PolicyPolicyPolicyPolicy----DrivenDrivenDrivenDrivenArchitectureArchitectureArchitectureArchitecture

Implement and Control

Define and Enable

Evaluate and Report

Analyze and Recommend

Classify and Measure

Quality of Service Management (QoS)The Oracle approach in action

Quality of Service Management (QoS)Integrated task-based mananagment

<Insert Picture Here>

Oracle RAC One

Node Integration

Oracle RAC One NodeThe “Always On” Single Instance Database

• Always On Single Instance Database– Online replacement of servers and storage

– Online patching and upgrades of operating

system and infrastructure software

– Online database patching

– Automated cluster failover

• Better consolidation – Extreme consolidation of servers & storage

– Load balancing to protect service levels

• Enhanced virtualization– Extends and improves database availability and

flexibility when running in a virtual server

Oracle RAC One NodeWhat’s New in 11.2.0.2

• DBCA support– In 11.2.0.1 an Oracle RAC One Node Database was created as a standard Oracle

RAC database and converted to an Oracle RAC One Database later

– 11.2.0.2 DBCA enables allows to create an Oracle RAC One Node Database so that

the post-installation step are unnecessary.

• SRVCTL support

– 11.2.0.2 SRVCTL provides complete Oracle RAC One Node support

• Enterprise Manager support

– 11.2.0.2 Enterprise Manager DB Control provides Oracle RAC One Node support

• Omotion is now “Online Database Relocation”– 11.2.0.2 SRVCTL supports “Online Database Relocation”

• Oracle RAC One Node will be supported

on all OS certified for Oracle RAC

DBCA Support for Oracle RAC One Node

<Insert Picture Here>

Oracle Cluster File

System, Cloud Edition

ASM / ACFS EnhancementsExtending ASM to manage ALL data

• ACFS is now: Oracle Cluster File System, Cloud Edition

• ACFS/ADVM ports for Solaris and AIX

• ACFS snapshot for Solaris and AIX

• New Features for Linux and Windows

– ACFS Tagging

– ACFS Replication

– ACFS Security

– ACFS Encryption

ASM / ACFS EnhancementsWhat's new in 11.2.0.2 regarding Oracle ASM?

• ACFS Tagging assigns a common naming attribute

to a group of files

• File operations can be performed based on tagging

attributes

• Tagging is configured and managed by ‘acfsutil tag’

command line

Oracle Cluster File System, Cloud Edition(ACFS) Tagging

• Enables replication of ACFS file systems across the

network to a remote site providing DR functionality

• Replication logs are used

to replicate from primary to standby

• A replication configuration is established through

‘acfsutil repl’ command line

Oracle Cluster File System, Cloud Edition(ACFS) Replication

• Enables setting security policies and groups for

accessing file system objects

• Fine grained control in addition to OS ACLs

• Enforces security policies through

realms, rules, rule sets and command rules

• Integrated with Oracle system management tools

– ASMCA support for basic configurations

– Extensions to ACFS command line tools for easy use via acfsutil

Oracle Cluster File System, Cloud Edition(ACFS) Security

• Provides protection for data-at-rest

– User data is encrypted before being written to secondary storage

– Data stored on disk is useless without encryption keys

• Encrypt the entire file system or just files or directories

• Transparent to users

– Users authorized to read a file get decrypted data

• Transparent to applications

– Applications work unchanged with encrypted files

• Encrypted and unencrypted files can co-exist in a file system

Oracle Cluster File System, Cloud Edition(ACFS) Encryption

<Insert Picture Here>

Redundant Interconnect Usage

Redundant Interconnect UsageIntegrated interconnect HA for the RAC database

• Redundant Interconnect Usage can be used as a bonding alternative

– It works for “private networks” only; the nodeVIPs use a different approach

– It enables HA and Load Balancing for up to 4 NICs per server (on Linux / Unix)‏

– It can be used by Oracle Databases 11.2.0.2 and Oracle Clusterware 11.2.0.2

– It uses so called HAIPs that are assigned to the private networks on the server

HAIP1

HAIP2

HAIP3

HAIP4

Redundant Interconnect UsageHow it works – in general

• A multiple listening endpoint approach is used

– The HAIPs are taken from the “link-local” (Linux / Unix) IP range (169.254.0.0)‏

– To find the communication partners, multicasting on the interconnect is required

– Details in My Oracle Support Note with Doc ID 1212703.1:

11.2.0.2 Grid Infrastructure Install or Upgrade may fail due to Multicasting

HAIP1

HAIP3

HAIP4

HAIP1

HAIP2

<Insert Picture Here>

Virtualized Flexibility

VMware for Oracle RAC

• Oracle VM is Oracle's standing recommendation

• Oracle VM is and will be more integrated with Oracle's solutions

• Oracle RAC can now be used in VmWare

– This is not a certification

– This does not imply full support

– Restriction has been lifted to

support Oracle RAC in VmWare to the degree

a Single Instance Oracle Database is supported.

• For details see: My Oracle Support Note Doc ID 249212.1

Support Position for Oracle Products Running on VMWare

Virtualized Environments

Oracle RAC was not support in VmWareRestriction has been lifted to Oracle Database standard

<Insert Picture Here>

Agenda

• Introducing Oracle Database 11.2.0.2

• New Features in 11.2.0.2

• Quality of Service (QoS) Management

• Oracle RAC One Node Integration

• Oracle Cluster File System, Cloud Edition

• Redundant Interconnect Usage

• Virtualized Flexibility (policy change)

• Cluster Health Management with 11.2.0.2• Cluster Health Monitor

• Hang Manager

What is the Cluster Health Monitor (CHM)?Introduction

• The Cluster Health Monitor (CHM)

(formerly a.k.a. Instantaneous Problem Detector for Clusters or IPD/OS)

is designed to

– detect and to analyze operating system (OS)

– and cluster resource related degradation and failures

– in order to bring more explanatory power to many issues that occur in clusters,

in which Oracle Clusterware and / or Oracle RAC are used, e.g. node evictions.

Why should you use CHM?Because there is a Monday morning for example

• Assume the following scenario:

– Leaving the office Friday night

– Getting an email that one node in the cluster rebooted on Sunday morning

– Getting a question from your manager why that node rebooted on Monday

• Typical way of addressing this question:

– Gather and analyze Oracle Clusterware and operating system logs

(e.g. following MOS doc 330358.1 - CRS 10gR2/ 11gR1/ 11gR2 Diagnostic Collection Guide)

– Open a Service Request with Oracle Support

• Possible outcomes:

– Oracle Support finds the answer in one of the logs

– Oracle Support needs more node specific information to answer the question

• For the latter: This why you need Cluster Health Monitor (CHM) for example

Why should you use CHM?Because you want to prevent another incident

• Based on the previous scenario:

– It is determined that the reboot was caused by

an abnormally high CPU load in conjunction with extreme IO waits.

– Your manager asks you:

What caused the high CPU load? What can we do to prevent this in future?

• For the latter: CHM provides a historical view on collected data for analyzes

– >crfgui -d "00:05:00" -m 192.168.2.8

– Cluster Health Analyzer V1.10 Look for Loggerd via node 192.168.2.8

...reading 300 sec from the past

Connected to Loggerd on rac1

Note: Node rac1 is now up

Cluster 'MyCluster', 2 nodes. Ext time=2010-08-18 23:22:30

Future Development of CHMWhat you will find in Oracle Grid Infrastructure 11.2.0.2

• Cluster Health Monitor has been integrated with

Oracle Grid Infrastructure 11.2.0.2 as follows:

– The data gathering part of the tool will be part of the standard installation

– CHM will therefore be installed into the Oracle Grid Infrastructure home

– Some parts of the architecture changed (no oproxyd), principles remain

– The GUI remains as a separately downloadable item

� Note that the 11.2.0.2 version is currently pending

• Going forward, all OS supported for Oracle Grid Infrastructure

will be supported for Cluster Health Monitor.

– More Operating Systems are planned to be supported for CHM

as 11.2.0.2 becomes available on these Operating Systems

<Insert Picture Here>

Frequently Asked Questions (FAQ) ‏‏ ‏‏

FAQ #1Is CHM a CVU (Cluster Verification Utility) replacement?

• NO

• CVU is a separate tool with

a completely different purpose.

• CVU does not gather nor

provide the same data that

CHM provides.

• CVU has also been integrated

with Oracle Grid Infrastructure 11.2.0.2 (CVU resource) ‏‏ ‏‏

• For more information on CVU

got to http://www.oracle.com/goto/rac

– On this page, follow this link: Cluster Verification Utility - Download

FAQ #2Can CHM be used as an OS Watcher replacement?

• YES

• OS Watcher (OSW) is a collection of UNIX shell scripts(*) intended to

– collect and archive operating system and network metrics

– to aid support in diagnosing performance issues.

• OSW is provided by and made available through Oracle Support.

– For more information see MOS doc ID 301137.1 - OS Watcher User Guide

• Note: OS Watcher may have some specific environments, in which it

provides additional information (e.g. Version 3.0 OS of Watcher adds

additional collections for Exadata, as per the MOS note mentioned.)

(*) on Unix – there is also a Windows version of OS Watcher

<Insert Picture Here>

Hang Manager

Hang Manager IntroductionWhy is a Hang Manager required?

• Customers experience database hangs for a variety of reasons

• High system load

• Workload contention

• Network congestion or errors

• Bugs & Internal performance issues...

• Before Oracle Grid Infrastructure 11.2.0.2

• Oracle required information to troubleshoot a hang

– e.g.: System state dumps

– For RAC: global system state dumps

– Customer usually had to reproduce with additional parameters

• With Oracle Grid Infrastructure 11.2.0.2

• Mechanism to automatically detect and resolve hangs

Ultimate

waiter

Hang

Deadlock Self-deadlock

Hang Manager IntroductionTerms and Definitions

• Deadlock

• Closed Chain or Cycle

• Hang Manager does not resolve deadlocks

• Hang

• Open Chain with a

• Root:

– Final blocker of hang or open chain

• Immediate Waiter:

– Session being blocked by root of hang

• Ultimate Waiter:

– Blocked session in the end of the chain or hang

root

Immediate waiter

Hang Manager ConceptsHow it works – general considerations

• The starting point is generally a wait event

• The RDBMS has specific mechanisms to indicate a session is blocked

• A session can be blocked waiting on internal or external resources

• The main problem: When does a wait represent a hang?

• Sessions can be blocked for a very long time

• Once we have detected a hang – do we always want to remove it?

• A hang might be a local hang or a global hang (RAC only) ‏

• A hang might affect a few or a huge number of sessions

• A hang might be caused by a background process

• A hang might affect a foreground as well as background processes

Hang Manager ConceptsHow it works – it's a process

• Detection: 3 snapshots over 97 to 128 seconds

• Heuristics (internal implementation) – consideration points:

• Consider local and global hangs (more severe) ‏

• Some hangs can get ignored, sometimes because they resolve themselves later. Temporary hangs will always be ignored at first.

Detect

Heuristics

Analyze and Verify

Decide to resolve

Hang Manager ConceptsHow it works – it's a process

• Analyze and verify: the verification interval defaults to 46 seconds

• A hang resolution today will take around 3 minutes (+/- a few secs.)

• Hang resolution:

• The scope can be configured using “off”, process (default), instance

• Before resolving a hang ORA-32701 incident is triggered, written to alert.logs of all instances. Additional information in DIAG / trace.

Detect

Heuristics

Analyze and Verify

Decide to resolve

<Insert Picture Here>

More Information

• My Oracle Support Notes:

– ID 1212703.1 - 11.2.0.2 Grid Infrastructure Install or Upgrade

may fail due to Multicasting Requirement

– ID 249212.1 - Support Position for Oracle Products Running

on VMWare Virtualized Environments

• http://www.oracle.com/goto/rac

– Oracle Real Application Clusters 11g Release 2 Technical Overview

– Oracle Real Application Clusters One Node 11g Release 2 Technical

Overview

• http://www.oracle.com/goto/clusterware

– Oracle Clusterware 11g Release 2 Technical Overview

• http://www.oracle.com/goto/asm

More Information

<Insert Picture Here>

QuestionsandAnswers