VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE...

88
LAST UPDATED: 06/23/2020 VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

Transcript of VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE...

Page 1: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

     

LAST UPDATED: 06/23/2020

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

Page 2: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

 2

Table of Contents1 Introduction __________________________________________________________5

1.1 Revision ....................................................................................................................................... 5

1.2 Target Audience .......................................................................................................................... 5

1.3 Before You Begin ......................................................................................................................... 5

1.4 Terminology ................................................................................................................................ 5

1.5 Prerequisites ............................................................................................................................... 5

1.6 Supported vSphere versions  ..................................................................................................... 6

2 Solution Overview _____________________________________________________7

2.1 Introduction to InfiniBox Active-Active replication................................................................... 7

2.2 Introduction to VMware vSphere Metro Storage....................................................................... 7

2.3 Uniform / Non-uniform Host Access types ................................................................................ 7

2.4 ESXi hosts and Active-Active Volumes Relationships................................................................ 7

2.5 Stretched Storage Architecture.................................................................................................. 8

3 Preparing the vSphere and InfiniBox Environment __________________________10

3.1 Using INFINIDAT Host PowerTools for VMware ....................................................................... 10

3.2 Setting InfiniBox Best Practices for vSphere ........................................................................... 10

3.3 Active-Active link between the InfiniBox systems................................................................... 10

4 Configuring vMSC and Active-Active Datastores ____________________________11

5 vMSC with Uniform Host Access _________________________________________12

5.1 Setting up the ESXi Hosts on InfiniBox..................................................................................... 12

5.2 Setting the optimized InfiniBox system for each host ............................................................ 15

6 vMSC with Non-uniform Host Access _____________________________________18

6.1 Configuring the ESXi hosts for Non-uniform access................................................................ 18

6.2 Setting up the ESXi Hosts on InfiniBox..................................................................................... 19

7 Provisioning Active-Active Datastores ____________________________________22

7.1 Mapping the other peer(s) ........................................................................................................ 24

7.2 Ensuring proper volumes access.............................................................................................. 24

7.3 Creating VMFS ........................................................................................................................... 26

8 vSphere Cluster Configuration __________________________________________28

Page 3: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

 3

8.1 vSphere Availability  ................................................................................................................. 28

8.2 VM/Host Groups and Rules ....................................................................................................... 28

8.3 Virtual Machines and Datastore Placement ............................................................................ 32

8.4 vMSC with Active-Active Datastores Configuration Diagram.................................................. 33

9 Failure Scenarios and Expected Response _________________________________35

9.1 ESXi Host Failures ..................................................................................................................... 35

9.2 Storage Array Failures............................................................................................................... 35

9.3 Complete Storage Failures on Non-uniform Configuration ................................................... 35

9.4 Complete Loss of a Access to a Storage Array on Uniform Configuration  ............................ 39

10 How InfiniBox Handles Failures__________________________________________45

10.1 InfiniBox Witness....................................................................................................................... 45

10.2 Preferred system  ...................................................................................................................... 45

10.3 Storage Failover  ....................................................................................................................... 45

10.4 Storage Replication Resynchronization and Recovery ........................................................... 45

11 InfiniBox Components Failures   _________________________________________46

12 Introduction _________________________________________________________47

12.1 Revision ..................................................................................................................................... 47

12.2 Target Audience ........................................................................................................................ 47

12.3 Before You Begin ....................................................................................................................... 47

12.4 Terminology .............................................................................................................................. 47

12.5 Prerequisites ............................................................................................................................. 47

12.6 Supported vSphere versions  ................................................................................................... 48

13 Solution Overview ____________________________________________________49

13.1 Introduction to InfiniBox Active-Active replication................................................................. 49

13.2 Introduction to VMware vSphere Metro Storage..................................................................... 49

13.3 Uniform / Non-uniform Host Access types .............................................................................. 49

13.4 ESXi hosts and Active-Active Volumes Relationships.............................................................. 49

13.5 Stretched Storage Architecture................................................................................................ 50

14 Preparing the vSphere and InfiniBox Environment __________________________52

14.1 Using INFINIDAT Host PowerTools for VMware ....................................................................... 52

14.2 Setting InfiniBox Best Practices for vSphere ........................................................................... 52

Page 4: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

 4

14.3 Active-Active link between the InfiniBox systems................................................................... 52

15 Configuring vMSC and Active-Active Datastores ____________________________53

16 vMSC with Uniform Host Access _________________________________________54

16.1 Setting up the ESXi Hosts on InfiniBox..................................................................................... 54

16.2 Setting the optimized InfiniBox system for each host ............................................................ 57

17 vMSC with Non-uniform Host Access _____________________________________60

17.1 Configuring the ESXi hosts for Non-uniform access................................................................ 60

17.2 Setting up the ESXi Hosts on InfiniBox..................................................................................... 61

18 Provisioning Active-Active Datastores ____________________________________64

18.1 Mapping the other peer(s) ........................................................................................................ 66

18.2 Ensuring proper volumes access.............................................................................................. 66

18.3 Creating VMFS ........................................................................................................................... 68

19 vSphere Cluster Configuration __________________________________________70

19.1 vSphere Availability  ................................................................................................................. 70

19.2 VM/Host Groups and Rules ....................................................................................................... 70

19.3 Virtual Machines and Datastore Placement ............................................................................ 74

19.4 vMSC with Active-Active Datastores Configuration Diagram.................................................. 75

20 Failure Scenarios and Expected Response _________________________________77

20.1 ESXi Host Failures ..................................................................................................................... 77

20.2 Storage Array Failures............................................................................................................... 77

20.3 Complete Storage Failures on Non-uniform Configuration ................................................... 77

20.4 Complete Loss of a Access to a Storage Array on Uniform Configuration  ............................ 81

21 How InfiniBox Handles Failures__________________________________________87

21.1 InfiniBox Witness....................................................................................................................... 87

21.2 Preferred system  ...................................................................................................................... 87

21.3 Storage Failover  ....................................................................................................................... 87

21.4 Storage Replication Resynchronization and Recovery ........................................................... 87

22 InfiniBox Components Failures   _________________________________________88

Page 5: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INTRODUCTION  –  5

••

••

••

••••

1 IntroductionThe following guide explores the best practices and configurations for using INFINIDAT InfiniBox Active-Active Replication with VMware vSphere Metro Storage Cluster (vMSC). InfiniBox Active-Active Replication was introduced with the 5.0 release, and together with VMware vSphere provides a highly resilient solution for protecting applications data, availability and allows an extremely fast recovery even in the event of a full site failure. 

The most up-to-date version of this document can be found on the Infinidat Support site: https://support.infinidat.com/hc/en-us/articles/360002174197

1.1 RevisionLast updated on: March 1, 2020

1.2 Target AudienceThis document is intended for storage, system and VMware administrators that plan to deploy or manage InfiniBox Active-Active replication with VMware vSphere Metro Storage Cluster configuration.

The authors of this document assume that the reader is familiar with the following:

InfiniBox storage resources and Active-Active replication.VMware vSphere, vCenter Server and High Availability (vSphere HA) solutions.

For more information and assistance with INFINIDAT InfiniBox please visit support.infinidat.com

1.3 Before You BeginPrior to setting up InfiniBox Active-Active replication with VMware vSphere Metro Storage Cluster (vMSC) it is advised to read the INFINIDAT InfiniBox documentation for Active-Active replication and VMware vSphere® Metro Storage Cluster Recommended Practices.

1.4 TerminologyActive-Active volume: a volume that is undergoing Active-Active (A-A) replication.Peers: a pair of volumes that are undergoing Active-Active replication relationship, are also referred as "peers".Active-Active datastore: a datastore that resides on an Active-Active volume.ALUA: asymmetric logical unit access. 

1.5 Prerequisites Two InfiniBox 5.0 or later systems.FC connectivity. (I/O)Ethernet connectivity between the InfiniBox systems. (replication)Maximum of 5ms RTT latency between the InfiniBox systems.

Page 6: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INTRODUCTION  –  6

• For additional vSphere Metro Storage related requirements please refer to VMware documentation.

1.6 Supported vSphere versions See an up-to-date list on the INFINIDAT Interoperability Matrix website.

Page 7: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  7

••

2 Solution OverviewDeploying VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox Active-Active replication provides a highly available and resilient solution for protecting applications availability and data, with a minimal performance impact. 

2.1 Introduction to InfiniBox Active-Active replicationInfiniBox Active-Active replication provides zero-RPO and zero-RTO, enabling mission critical business-services to keep operating even through a complete site failure:

Symmetric synchronous replication solution, applications can be geographically clustered.Fully integrated into InfiniBox, allows simple management of application spread across data centers.

2.2 Introduction to VMware vSphere Metro StorageA VMware vSphere Metro Storage Cluster configuration is a specific storage configuration that combines replication with array-based clustering. These solutions are typically deployed in environments where the distance between data centers is limited, often metropolitan or campus environments.

The primary benefit of a stretched cluster model is that it enables fully active and workload-balanced data centers to be used to their full potential and it allows for an extremely fast recovery in the event of a host or even full site failure.vSphere ESXi servers are in a single vCenter cluster and can be spreaded across sites (separate datacenters or geographic areas) 

2.3 Uniform / Non-uniform Host Access typesvMSC solutions are classified into two distinct types. These categories are based on a fundamental difference in how the vSphere hosts access the storage systems, which influences design considerations.

Uniform host access - vSphere hosts on both sites are all connected to the storage systems across both sites. LUN paths presented to vSphere hosts are stretched across the sites.Non-uniform host access - vSphere hosts at each site are connected only to the local storage system at the same site. LUN paths presented to vSphere hosts from storage nodes are limited to the local site.

2.4 ESXi hosts and Active-Active Volumes RelationshipsESXi hosts identify both peers as the same storage device. 

InfiniBox Active-Active replication is supported with both uniform and non-uniform host access types. 

Page 8: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  8

2.5 Stretched Storage ArchitectureWith a Stretched Storage architecture there are two InfiniBox storage array linked in an Active-Active relationship, one is on site A and another on site B. The ESXi hosts on both sites can be connected to both or one of the InfiniBox systems (uniform / non-uniform). When an Active-Active datastore is provisioned to the cluster, the ESXi hosts identify both peers of the Active-Active volume as the same datastore and device. It is possible to read and write simultaneously using both of the peers, while all writes are synchronously replicating between the InfiniBox systems.

Page 9: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  9

Page 10: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PREPARING THE VSPHERE AND INFINIBOX ENVIRONMENT  –  10

•••

••

3 Preparing the vSphere and InfiniBox EnvironmentESXi hosts should be installed and configured with a vSphere HA cluster.

3.1 Using INFINIDAT Host PowerTools for VMwareIt is highly recommended to install Host PowerTools for VMware with every vSphere vCenter Server environment. Host PowerTools for VMware provides:

Ease-of-use.Storage and host automation.Best practices validation.

Fore more information on how to install and use Host PowerTools for VMware see: Host PowerTools for VMware

3.2 Setting InfiniBox Best Practices for vSphereValidate that the vSphere cluster is configured according to the InfiniBox best practices for vSphere.

The simplest method to apply the InfiniBox best practices is by using Host PowerTools for VMware. For instruction see the following guide: ESXi hosts and clusters readinessIn order to manually configure the ESXi hosts according to InfiniBox best practices, please refer to the following guide: List of items being checked by Host PowerTools for VMware

3.3 Active-Active link between the InfiniBox systemsThe InfiniBox systems should be configured with a replication link.

The link must support Active-Active replication.For more information on how to configure an Active-Active link, see: InfiniBox Best Practices Guide for Setting Up a Replication Service

InfiniBox Active-Active volumes are supported by ESXi native multipath software.

Page 11: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

CONFIGURING VMSC AND ACTIVE-ACTIVE DATASTORES  –  11

••

4 Configuring vMSC and Active-Active Datastores

Follow the instruction on the following chapters based on the desired host access configuration. (uniform / non-uniform)

The vMSC configuration depends on the host access type.If uniform access is desired, FC connectivity between the sites is required (used for I/O) in addition to the Ethernet connectivity used for replication.The guide will explore the configuration process mostly using the InfiniBox Web-GUI.

Prior to configuring vMSC and InfiniBox Active-Active Datastores, ensure that the environment is configured according to requirements on the "Preparing the vSphere and InfiniBox Environment" section above.

Page 12: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  12

5 vMSC with Uniform Host AccessWhen configuring uniform host access, the ESXi hosts can access the same datastore through both InfiniBox systems - the system that exists on the same site (local) and the remote system on the other site. 

Typically, the datastore paths to the remote system will be less optimized than the paths to the datastore on the local system, due to the extra travel between the sites which carries added latency.

The InfiniBox system can intelligently hint the ESXi hosts about which are the optimal paths to serve I/O. This is further discussed later on. 

FC connectivity between the site is required.

5.1 Setting up the ESXi Hosts on InfiniBoxConnect all the ESXi hosts in the vSphere cluster to both InfiniBox systems. 

Page 13: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  13

1.

2.•

3.

For each host it is recommended to have one path to each InfiniBox node (the nodes of both systems) from two initiators. (12 paths in total)The hosts can access both InfiniBox systems and will be able to see paths to the datastore from both systems.

5.1.1 Creating InfiniBox Hosts objectsOnce all hosts are connected (zoned), a corresponding InfiniBox host object should be created for each of the ESXi hosts, on both systems. 

In order to create the InfiniBox hosts objects:

Login to one of the InfiniBox systems using the management console.

Create a new host object. Add the corresponding host's ports (initiators).

Repeat steps 2 until all the ESXi host are created. 

The steps below describe how to create (register) the vSphere cluster on InfiniBox

The simplest method to register a uniform vSphere cluster is by using Host PowerTools for VMware. For instruction see: Registering ESXi hosts and clusters

If Host PowerTools for VMware is used to register the vSphere cluster, sections "Creating InfiniBox Hosts objects" and "Creating InfiniBox Cluster object" can be skipped.Make sure that the vSphere cluster is registered on both systems.

Page 14: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  14

4.

1.

2.

3.

4.

Once all hosts are created, repeat the steps above on the other system.

Upon completion, all the vSphere cluster's hosts should exists as an InfiniBox host objects on both system.

5.1.2 Creating InfiniBox Cluster objectOnce all host are created, add them to an InfiniBox cluster object.

The InfiniBox cluster object aggregates host objects and enables to map volumes to all of the cluster's hosts at once. 

In order to create the InfiniBox cluster object:

Select the "Host & Cluster" icon on the left bar.

Click the "Create" button and select "Cluster".

Add all the previously created hosts to the cluster. 

Repeat steps on the other system.

Upon completion, all the vSphere hosts objects should be added to an InfiniBox cluster object on both system.

Page 15: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  15

••

•••

•••

5.2 Setting the optimized InfiniBox system for each hostThe InfiniBox system can intelligently hint using ALUA the ESXi hosts about which are the optimal paths to serve I/O to each volume.

ALUA is a standard used for identifying paths prioritization between the storage and hosts, and enables the initiators to query the target about path attributes, such as the paths ALUA state.

This settings is controlled by an InfiniBox Host object option, which sets the host's "Optimized / Non-Optimized" setting.

By default host objects are created as "Optimized".The InfiniBox system hints the ESXi hosts by setting the hosts's mapped volumes paths ALUA state to "Optimized / Non-Optimized".

5.2.1 InfiniBox Host objects Optimize/Non-optimized configuration Configure the InfiniBox Host objects as follow:

On InfiniBox - Site A:Ensure that the hosts which are located on Site A are set to "Optimized".The hosts that are located on Site B should be set to "Non-Optimized".

On InfiniBox - Site B:Ensure that the hosts which are located on Site B are set to "Optimized".The hosts that are located on Site A should be set to "Non-Optimized".

Settings this properly is crucial when configuring vMSC with a Uniform host access, as the ESXi hosts are presented with datastore paths also from the remote InfiniBox system, which are typically less optimal. 

Not required for a Non-uniform configuration.

Page 16: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  16

1.

2.

3.

4.

5.

Settings the InfiniBox hosts to Optimized / Non-Optimized 

In order to set a host to "Optimized / Non-Optimized":

Login to the InfiniBox system using the management console.

Select the "Host & Cluster" icon on the left bar.

Right click on a Host object.

Select "Modify Host".

Set the "Path ALUA state" to the proper option. 

Ensure proper configuration for all ESXi hosts on both InfiniBox system. 

The simplest method to set a vSphere host to Optimized/Non-optimize is by using Host PowerTools for VMware.

Page 17: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  17

Page 18: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  18

6 vMSC with Non-uniform Host AccessWhen configuring non-uniform host access, the ESXi hosts on each site can access the storage only through the local InfiniBox system - the system that exists on the same site (local).

6.1 Configuring the ESXi hosts for Non-uniform accessWhen configuring Active-Active volumes using Non-uniform connectivity it is required to set the ESXi hosts objects on InfiniBox to “ESXi” type.

"ESXi" type - Makes InfiniBox issue a PDL SCSI sense response to the host in case a mapped Active-Active peer no longer synchronized. (cannot serve R/W IO, becomes "offline")If the host type is not set to "ESXi", vSphere HA cannot properly detect that a mapped Active-Active peer is no longer synchronized ("offline").

Therefore, in a Non-uniform environment, vSphere HA will not try to recover affected VMs that are running on hosts which can access only the "offline" peer.

Setting the ESXi hosts objects on InfiniBox to the “ESXi” type is further discussed below on the "Setting up the ESXi Hosts on InfiniBox" section.

Page 19: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  19

1.

2.

3.•

••

6.2 Setting up the ESXi Hosts on InfiniBoxConnect the ESXi hosts in each site only to the local InfiniBox systems. 

For each host it is recommended to have one path to each InfiniBox node from two initiators. (6 paths in total)The hosts on each site can access only the local InfiniBox system and therefore will be able to see paths to the datastore only from the local system.

6.2.1 Creating InfiniBox Hosts objectsAfter the hosts are connected, in order to be able to provision storage to the vSphere cluster, a corresponding InfiniBox host object must be created for each of the ESXi hosts.

Each host object contains the host’s initiators ports, which then can be mapped to a volume.

In order to create the InfiniBox hosts objects:

Login to the InfiniBox system on Site A using the management console.

Create a new host object. 

Set the created host object type to "ESXi".This is settable only using InfiniShell. 

Use the following command: "host.set_host_type host=<esxi-host-name> host_type=ESXi"When creating the hosts using InfiniShell it is possible to set the host type on creation, using the "host_type=ESXi" argument. 

Page 20: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  20

4.

5.

1.

2.

3.

4.

5.

Repeat steps 2-3 until all the ESXi hosts that reside on the same site are created. 

Repeat the steps on the InfiniBox system in Site B.

Upon completion, all the host in Site A should exist on the system in Site A and all the hosts in Site B should exist on the system in Site B.

Site A:

Site B:

6.2.2 Creating InfiniBox Cluster objectOnce all host are created, add them to an InfiniBox cluster object. The InfiniBox cluster object aggregates host objects and enables to map volumes to all of the cluster's hosts at once. 

In order to create the InfiniBox cluster object:

Login to the InfiniBox system on Site A using the management console.

Select the "Host & Cluster" icon on the left bar.

Click the "Create" button and select "Cluster".

Add all the previously created hosts to the cluster. 

Repeat steps on the system in Site B.

Page 21: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  21

Upon completion, all the vSphere hosts objects should be added to an InfiniBox cluster object on both system.

Each InfiniBox cluster represent the hosts which reside on the same site. 

Site A: 

Site B:

Page 22: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  22

1.

2.

3.

••

4.•

7 Provisioning Active-Active DatastoresIn case virtual machines are designed to simultaneously run in both sites, it is advised to provision at least two Active-Active datastores.

Login to one of the InfiniBox systems using the management console.

Create two new volume.

Configure Active-Active replication to the remote system on one of the previously created

volumes.Keep the Preferred system option as Local.Upon success, an Active-Active replication is set and a volume peer is created on the remote system.

Configure Active-Active replication to the remote system also on the other volume. This time set the Preferred system option to Remote.

Virtual machines in each site should reside on an Active-Active datastore which his preferred peer is set on the local InfiniBox system. (the system on the same site)

Page 23: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  23

5.•

6.

Upon success two Active-Active replications are created. For one "System A" is set as preferred and for the other "System B" .

Map the two Active-Active volume to the previously created InfiniBox cluster object.

Page 24: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  24

1.

2.•

3.

4.

••

7.1 Mapping the other peer(s)Login to the other InfiniBox system using the management console.

Map the Active-Active volume peers to the previously created InfiniBox cluster object.It is recommended to map each peer using the same LUN ID in both systems. 

Login to the vSphere Web Client.

Perform a storage rescan on the vSphere cluster.

7.2 Ensuring proper volumes accessEnsure that the hosts properly see the paths to the mapped Active-Active volume.

The paths state and the number of paths to the Active-Active volume depends on the host access type. Optimized paths are presented as "Active (I/O)", which are the paths to the local InfiniBox system. 

As long as there are "Active (I/O)" paths available, All R/W IO will go through these paths which provide optimal performance.

Non-optimized are presented as "Active", which are the paths to the InfiniBox system in the other site. 

7.2.1 Non-uniform Host Access It is recommended that each host will have one path to each of its local InfiniBox nodes from two initiators. (six paths in total)

In the screenshot below, there is an InfiniBox Active-Active volume (storage device) which is mapped to an ESXi host. The Active-Active volume is presented to the host only from the local system, therefore all the paths are in "Active (I/O)" status which indicates they are optimized. 

The Active-Active volumes are now presented to each ESXi host (in both sites), from both InfiniBox systems. (uniform access)  

Page 25: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  25

7.2.2 Uniform Host Access It is recommended that each host will have one path to each InfiniBox node (the nodes of both systems) from two initiators. (Twelve paths in total, six to each system)

In the screenshot below, there is an InfiniBox Active-Active volume (storage device) which is mapped to an ESXi host from both systems.

The six "Active (I/O)" paths are the optimized path to the local InfiniBox system where the ESXi host is set as optimized.The other "Active" path are non-optimized to the remote InfiniBox system and will be used only in case all the optimized paths are down.

Page 26: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  26

7.3 Creating VMFSCreate a VMFS Datastore over the Active-Active volume (storage device). 

Located the vSphere cluster "New Datastore..." Wizard:

Select the Active-Active volume LUN and walk-through the wizard. 

Page 27: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  27

Upon success the created datastore should be mounted on all hosts. 

The same Active-Active datastore can perform reads and writes through both systems simultaneously

Page 28: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  28

••

••

•••

8 vSphere Cluster ConfigurationThe following settings configuration is recommended for the vMSC vSphere Cluster:

8.1 vSphere Availability Validate that the vSphere HA is turned-on. ("TURN ON vSphere HA" is checked)

Allows automatic virtual machines fail-over in the case of a failure. It is advised to follow VMware best practices for vSphere HA configuration. For more information please refer to VMware documentation.

8.1.1 Failures and ResponsesEnsure that "Enable Host Monitoring" is enabled. 

Host Failure Response: Restart VMs.Datasore with PDL: Power off and restart VMs.

For more information about PDL, refer to the following VMware documentation: https://kb.vmware.com/s/article/2004684

Datasore with APD: Power off and restart VMs - conservative or aggressive restart policy. The recommendation is to choose one of the two options.For more information about APD, refer to the following VMware documentation: https://kb.vmware.com/s/article/2004684

8.2 VM/Host Groups and RulesvSphere HA enables automatic virtual machines fail-over by restarting the virtual machines of a failed host on another host that can access the datastore.

Page 29: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  29

••

Thus, a virtual machine can reside on a host in Site A, and in case of failure to be recovered (restarted) on a host in Site B.

Therefore, It is recommended and beneficial to configure vSphere HA to choose the preferred hosts on which to restart recovered VMs - typically hosts on the same site.

This is done using a VM/Host rules. 

8.2.1 VM/Host GroupsFirst, create groups for the Hosts and VMs.

Typically two Host groups, each group contains the hosts that reside on the same site.VM groups can be created as desired. 

In the example we will create also two VM groups. One for VMs that should reside on Site A and another one for Site B. 

vSphere HA fail-over and recovery process has nothing special or different when used with InfiniBox Active-Active replication.

Page 30: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  30

8.2.2 VM/Host RulesOnce the Hosts and VMs groups are created, they can be associated with VM/Host rules.

In the following example, two rules are created to set VMs in group "VM Group Site A" to reside on hosts in group "Hosts Group Site A".

It is recommended to use the “should” rule, so in the case there are no hosts available in the associated group, the rule can be broken.

Page 31: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  31

Then another rule is created to set VMs in group "VM Group Site B" to reside on hosts in group "Hosts Group Site B".

Page 32: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  32

8.3 Virtual Machines and Datastore PlacementAs previously discussed, the virtual machines in each site should reside on an Active-Active datastore which his preferred peer is set on the local InfiniBox system. (the system on the same site)

The virtual machines on group A (VMs that are running on hosts in Site A) should reside on the Active-Active datastore that his peer on system A  is set as preferred. (the peer on InfiniBox in Site A)The virtual machines on group B (VMs that are running on hosts in Site B) should reside on the Active-Active datastore that his peer on system B  is set as preferred. (the peer on InfiniBox in Site B)

Page 33: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  33

8.4 vMSC with Active-Active Datastores Configuration Diagram

8.4.1 Uniform Host Access

Page 34: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  34

8.4.2 Non-uniform Host Access

Page 35: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  35

••

••

9 Failure Scenarios and Expected ResponseThe following chapter underlines how InfiniBox Active-Active replication and the vSphere environment behave in case of a failure. Failures can be caused by two main factors:

Failure of the storage array, the SAN fabric connectivity or the replication link. Failure of ESXi host(s). 

9.1 ESXi Host FailuresWhen an ESXi host failure is detected by vSphere HA , its virtual machines will be recovered and restarted on other ESXi hosts in the vSphere cluster. This is the typical vSphere HA response, regardless of the specific vMSC uniform or non-uniform host access configuration type.

9.1.1 Multiple Hosts Failure In case all the ESXi hosts in a specific site fail, vSphere HA can quickly recover and restart the failed virtual machine on the ESXi hosts in the remote site.

This high level of resiliency is achieved thanks to the fact that the datastores are undergoing InfiniBox Active-Active replication (stretched) and presented to the ESXi hosts on both sites.

9.2 Storage Array FailuresIn case of a complete storage array or a SAN fabric failure scenario, the vSphere HA response depends on the vMSC host access configuration. (uniform / non-uniform).

9.3 Complete Storage Failures on Non-uniform Configuration On non-uniform configuration, the InfiniBox system in each site presents each Active-Active datastore peer only to the local ESXi hosts that reside in the same site.

Each host can only see paths to the local Active-Active peer. (the peer that resides on the InfiniBox system in same site)

Therefore, if from any reason an InfiniBox system becomes inaccessible for the ESXi hosts in a specific site, vSphere would need to recover the failed virtual machines on the hosts in the remote site. The following failure scenarios in a specific site would lead to that result: 

Loss of all SAN fabric connectivity.Brutally unmaping an Active-Active peer on InfiniBox while virtual machines that reside on that peer are powered-on. Failure of the InfiniBox system.

9.3.1 Failure Scenario ExampleThe following example will explore the scenario of access loss to a peer while virtual machines that reside on that peer are powered-on. Environment configuration:

Page 36: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  36

•••

••

••

••

••

•••

•••

••

••

2 InfiniBox systems:System-Site-A: in Site A.System-Site-B: in Site B.

2 Active-Active datastores. "Active-Active_Datastore1": preferred set on System in Site-A.

Reside on Active-Active volume named: "Active-Active_Datastore1""Active-Active_Datastore2": preferred set on System in Site-B.

Reside on Active-Active volume named: "Active-Active_Datastore2"4 Hosts, in 2 sites (Site A/B), divided to two Hosts groups:

Host Group Site A: 2 Hosts Host Group Site B: 2 Hosts

Non-uniform connection:Hosts in site A can only see the peers on the InfiniBox system in Site A.Hosts in site B can only see the peers on the InfiniBox system in Site B.

8 VMs, divided to two VM groups:VM Group Site A: 4 VMs VM Group Site B: 4 VMs

2 VM/Host rules:VMs that are members of the "VM Group Site A" should run on hosts that are members of the "Host Group Site A".VMs that are members of the "VM Group Site B" should run on hosts that are members of the "Host Group Site B".

VMs in group "VM Group Site A" reside on "Active-Active_Datastore1". VMs in group "VM Group Site B" reside on "Active-Active_Datastore2". 

All virtual machines are powered-on. 

VMs that are housed on hosts in Site A and reside on datastore "Active-Active_datastore1"

VMs that are housed on hosts in Site B and reside on datastore "Active-Active_datastore2"

Page 37: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  37

Active-Active replication is set.

Failure

System-Site-B experienced a failure which caused a loss of access to the local Active-Active peers. (the peers on Site B of both Active-Active volumes).

Results

Initially:

The virtual machines that are housed on hosts in Site B (reside on "Active-Active_Datastore2"), become inaccessible. 

The virtual machines on Site B are expected to be inaccessible until the APD timeout has been reached (If APD timeout is enabled).

While Virtual machines that are housed on hosts in Site A (reside on "Active-Active_Datastore1"), continue to run without any interruption.

Page 38: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  38

••

••

••

Looking at the datastore view from Site B hosts would show that both datastores are inaccessible. Due to the loss of access to the local system) 

While looking at the datastore view from Site A hosts would show that both datastores are still available.Due to the fact that the Active-Active datastores' peers on Site A are still accessible for the hosts in Site A through the local InfiniBox system in Site A.

When the APD timeout is reached (If APD timeout is enabled), vSphere HA will shut down the inaccessible VMs and recover (restart) them on hosts in Site A, which can still access the datastore through the local peer.

The recovered VMs will be back online and powered-on on Site A hosts.

Page 39: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  39

Once the InfiniBox system on Site B failure is resolved, the peer will be re-synchronized automatically and the datastores be back again accessible to the hosts in Site B.

9.4 Complete Loss of a Access to a Storage Array on Uniform Configuration 

With a uniform configuration, the InfiniBox system in each site presents its Active-Active datastore peer to the ESXi hosts in both sites.

All the ESXi hosts can access Active-Active datastores through the local and remote peers. (and can see paths to both systems) 

Accordingly, if from any reason one of InfiniBox system becomes inaccessible for the ESXi hosts in a specific site, while the other InfiniBox system is still accessible, the virtual machines will keep and run on their hosts non-disruptively.

Thus, in this case, the downtime until the failed VMs are restarted on the remote hosts by vSphere HA is spared. 

It is recommended to ensure that the VMs are migrated back to hosts in Site B.

Therefore, uniform configuration provides an even greater level of availability for virtual machines, as it can sustained a complete loss of access to an InfiniBox system. 

Page 40: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  40

•••

••

••

••

••

••

••

••

••

••

••

In case both InfiniBox systems become inaccessible for all the ESXi hosts in a specific site, the vSphere HA behavior will be similar to the one explored in the non-uniform section.

9.4.1 Failure Scenario ExampleEnvironment configuration:

2 InfiniBox systems:System-Site-A: in Site A.System-Site-B: in Site B.

2 Active-Active datastores. "Active-Active_Datastore1": preferred set on System in Site-A.

Reside on Active-Active volume named: "Active-Active_Datastore1""Active-Active_Datastore2": preferred set on System in Site-B.

Reside on Active-Active volume named: "Active-Active_Datastore2"4 Hosts, in 2 sites (Site A/B), divided to two Hosts groups:

Host Group Site A: 2 Hosts Host Group Site B: 2 Hosts

Uniform connection:Hosts in site A can access both Active-Active peers on the local and remote InfiniBox systems.

The hosts are set as "optimized" on the system in Site A and as "non-optimized" on the system in Site B.

Hosts in site B can access both Active-Active peers on the local and remote InfiniBox systems.The hosts are set as "optimized" on the system in Site B and as "non-optimized" on the system in Site A.

Paths are set to optimized for the local peer and non-optimized for the remote peer respectively. 8 VMs, divided to two VM groups:

VM Group Site A: 4 VMs VM Group Site B: 4 VMs

2 VM/Host rules:VMs that are members of the "VM Group Site A" should run on hosts that are members of the "Host Group Site A".VMs that are members of the "VM Group Site B" should run on hosts that are members of the "Host Group Site B".

VMs in group "VM Group Site A" reside on "Active-Active_Datastore1". VMs in group "VM Group Site B" reside on "Active-Active_Datastore2". 

All VMs that are housed on hosts in Site A and reside on datastore "Active-Active_datastore1" are powered-on. 

Page 41: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  41

All VMs that are housed on hosts in Site B and reside on datastore "Active-Active_datastore2" are powered-on. 

Active-Active replication is set.

Page 42: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  42

•••

As each host can access both peers, it can see paths to both systems. 

The paths to the local peers are set to optimized, while the paths to the remote peers are set to non-optimized. Looking at the "Connectivity and multipathing" view would show the state of each path:

In this example there are 6 paths to each InfiniBox system, 12 paths in total.Paths with the "Active (I/O)" are the optimized paths to the local peer.

As long as there are "Active (I/O)" paths available, All R/W IO will go through these paths which provide optimal performance. The "Active" (non-optimized) paths, which are the paths to the remote peer, will be used only in case all optimized paths are gone. 

The following example will explore the same a scenario of a complete access loss to a InfiniBox system in a specific site site, while all virtual machines are powered-on.

Page 43: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  43

Failure

System-Site-B experienced a failure which caused a loss of access to the local Active-Active peers. (the peers on Site B of both Active-Active volumes).

Results

All the virtual machines on both sites stay online.

Looking at the datastore view from the hosts in Site B would show that both datastores are still accessible. 

Due to the fact they can still access the peer volume from the remote system.

Looking at the "Connectivity and multipathing" view would show that the paths to the peer on Site B are marked as "Dead" - cannot perform reads or writes I/O. 

Page 44: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  44

••

Until the failure on the InfiniBox system in Site B is resolved, the VMs that are running on hosts in Site B are using the "non-optimized" paths which perform I/O directly to the remote InfiniBox system.

This would cause relativity larger latency.In cases where the failure on Site B is expected to persist a long period of time, consider to migrate the VMs that are running on hosts in Site B to the hosts in Site A, in order to spare the I/O travel addition. 

Once the failure on the InfiniBox system in Site B is resolved, the peer will be re-synchronized and accessible again to the hosts on both sites. vSphere then would automatically use only the optimized paths to each of the datastore.  

Page 45: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

HOW INFINIBOX HANDLES FAILURES  –  45

10 How InfiniBox Handles FailuresInfiniBox has two mechanism to handle failures for Active-Active replication: Witness and preferred system.

If an InfiniBox system becomes unavailable, e.g. power outage of the entire site, the peer system will provide access to all the volumes. If the replication link between the systems fails, then datastores will continue to serve I/O on one of the systems. Each datastore has a property in InfiniBox that defines its preferred-system, which will remain online. 

10.1 InfiniBox WitnessThe witness is an arbitrator entity residing in a 3rd site (separate from the two InfiniBox systems involved in Active-Active replication), that acts as quorum in case of replication link failures. The witness is a lightweight stateless software deployed as a VM. 

If the witness is down or inaccessible, replication link failure will result in InfiniBox systems keeping volumes online according to their preferred-system settings.

10.2 Preferred system Each volume that is undergoing Active-Active replication has a definition for preferred system, which the witness uses to make correct decisions. 

If the witness is not available to the systems, the decision on which side stays active will be done per replica based on the preferred system.

10.3 Storage Failover InfiniBox Active-Active replication failover is fully automatic and does not require any storage administrator intervention.

10.4 Storage Replication Resynchronization and RecoveryInfiniBox Active-Active recovery is completely automatic; no storage administrator intervention is necessary to trigger a re-sync and recover replication.

If the InfiniBox systems got disconnected, the replication will internally fallback to async mode. Once the connectivity between the systems recovers, synchronization jobs will start replicating the missing data to the lagging system. During this time, from disconnection and through the re-sync progress the Active-Active volumes on the synchronized system serve I/O operations, while the remote side will be in lagging state until all data is synchronized between the volumes. 

Once the volumes are nearly in sync, they will smoothly transition to Sync replication mode, with no I/O disruption. The host paths to the lagging side will be automatically restored, allowing the hosts to perform I/O operations through both systems again.

Page 46: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INFINIBOX COMPONENTS FAILURES    –  46

11 InfiniBox Components Failures  The following table describes the InfiniBox storage accessibility in a different failure scenarios when using Active-Active replication:

Scenario InfiniBox System-A

InfiniBox System-B

Replication Link

Witness Active-Active Volumes Access

Optimal UP UP UP UP Volumes are available through both systems

Witness is down UP UP UP Down Volumes are available through both systems

Replication Link is down

UP UP Down UP Volumes are available through the preferred system

System-A is down

Down UP UP UP Volumes are available through System-B

Both systems are down

Down Down N/A N/A Volumes are not available

*Assuming that the Active-Active replicas were in a "Synchronized" state at the moment of failure and the systems link is in witness resiliency.

Page 47: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INTRODUCTION  –  47

••

••

••

•••••

12 IntroductionThe following guide explores the best practices and configurations for using INFINIDAT InfiniBox Active-Active Replication with VMware vSphere Metro Storage Cluster (vMSC). InfiniBox Active-Active Replication was introduced with the 5.0 release, and together with VMware vSphere provides a highly resilient solution for protecting applications data, availability and allows an extremely fast recovery even in the event of a full site failure. 

12.1 RevisionLast updated on: March 1, 2020

12.2 Target AudienceThis document is intended for storage, system and VMware administrators that plan to deploy or manage InfiniBox Active-Active replication with VMware vSphere Metro Storage Cluster configuration.

The authors of this document assume that the reader is familiar with the following:

InfiniBox storage resources and Active-Active replication.VMware vSphere, vCenter Server and High Availability (vSphere HA) solutions.

For more information and assistance with INFINIDAT InfiniBox please visit support.infinidat.com

12.3 Before You BeginPrior to setting up InfiniBox Active-Active replication with VMware vSphere Metro Storage Cluster (vMSC) it is advised to read the INFINIDAT InfiniBox documentation for Active-Active replication and VMware vSphere® Metro Storage Cluster Recommended Practices.

12.4 TerminologyActive-Active volume: a volume that is undergoing Active-Active (A-A) replication.Peers: a pair of volumes that are undergoing Active-Active replication relationship, are also referred as "peers".Active-Active datastore: a datastore that resides on an Active-Active volume.ALUA: asymmetric logical unit access. 

12.5 Prerequisites Two InfiniBox 5.0 or later systems.FC connectivity. (I/O)Ethernet connectivity between the InfiniBox systems. (replication)Maximum of 5ms RTT latency between the InfiniBox systems.For additional vSphere Metro Storage related requirements please refer to VMware documentation.

Page 48: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INTRODUCTION  –  48

12.6 Supported vSphere versions See an up-to-date list on the INFINIDAT Interoperability Matrix website.

Page 49: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  49

••

13 Solution OverviewDeploying VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox Active-Active replication provides a highly available and resilient solution for protecting applications availability and data, with a minimal performance impact. 

13.1 Introduction to InfiniBox Active-Active replicationInfiniBox Active-Active replication provides zero-RPO and zero-RTO, enabling mission critical business-services to keep operating even through a complete site failure:

Symmetric synchronous replication solution, applications can be geographically clustered.Fully integrated into InfiniBox, allows simple management of application spread across data centers.

13.2 Introduction to VMware vSphere Metro StorageA VMware vSphere Metro Storage Cluster configuration is a specific storage configuration that combines replication with array-based clustering. These solutions are typically deployed in environments where the distance between data centers is limited, often metropolitan or campus environments.

The primary benefit of a stretched cluster model is that it enables fully active and workload-balanced data centers to be used to their full potential and it allows for an extremely fast recovery in the event of a host or even full site failure.vSphere ESXi servers are in a single vCenter cluster and can be spreaded across sites (separate datacenters or geographic areas) 

13.3 Uniform / Non-uniform Host Access typesvMSC solutions are classified into two distinct types. These categories are based on a fundamental difference in how the vSphere hosts access the storage systems, which influences design considerations.

Uniform host access - vSphere hosts on both sites are all connected to the storage systems across both sites. LUN paths presented to vSphere hosts are stretched across the sites.Non-uniform host access - vSphere hosts at each site are connected only to the local storage system at the same site. LUN paths presented to vSphere hosts from storage nodes are limited to the local site.

13.4 ESXi hosts and Active-Active Volumes RelationshipsESXi hosts identify both peers as the same storage device. 

InfiniBox Active-Active replication is supported with both uniform and non-uniform host access types. 

Page 50: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  50

13.5 Stretched Storage ArchitectureWith a Stretched Storage architecture there are two InfiniBox storage array linked in an Active-Active relationship, one is on site A and another on site B. The ESXi hosts on both sites can be connected to both or one of the InfiniBox systems (uniform / non-uniform). When an Active-Active datastore is provisioned to the cluster, the ESXi hosts identify both peers of the Active-Active volume as the same datastore and device. It is possible to read and write simultaneously using both of the peers, while all writes are synchronously replicating between the InfiniBox systems.

Page 51: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

SOLUTION OVERVIEW  –  51

Page 52: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PREPARING THE VSPHERE AND INFINIBOX ENVIRONMENT  –  52

•••

••

14 Preparing the vSphere and InfiniBox EnvironmentESXi hosts should be installed and configured with a vSphere HA cluster.

14.1 Using INFINIDAT Host PowerTools for VMwareIt is highly recommended to install Host PowerTools for VMware with every vSphere vCenter Server environment. Host PowerTools for VMware provides:

Ease-of-use.Storage and host automation.Best practices validation.

Fore more information on how to install and use Host PowerTools for VMware see: Host PowerTools for VMware

14.2 Setting InfiniBox Best Practices for vSphereValidate that the vSphere cluster is configured according to the InfiniBox best practices for vSphere.

The simplest method to apply the InfiniBox best practices is by using Host PowerTools for VMware. For instruction see the following guide: ESXi hosts and clusters readinessIn order to manually configure the ESXi hosts according to InfiniBox best practices, please refer to the following guide: List of items being checked by Host PowerTools for VMware

14.3 Active-Active link between the InfiniBox systemsThe InfiniBox systems should be configured with a replication link.

The link must support Active-Active replication.For more information on how to configure an Active-Active link, see: InfiniBox Best Practices Guide for Setting Up a Replication Service

InfiniBox Active-Active volumes are supported by ESXi native multipath software.

Page 53: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

CONFIGURING VMSC AND ACTIVE-ACTIVE DATASTORES  –  53

••

15 Configuring vMSC and Active-Active Datastores

Follow the instruction on the following chapters based on the desired host access configuration. (uniform / non-uniform)

The vMSC configuration depends on the host access type.If uniform access is desired, FC connectivity between the sites is required (used for I/O) in addition to the Ethernet connectivity used for replication.The guide will explore the configuration process mostly using the InfiniBox Web-GUI.

Prior to configuring vMSC and InfiniBox Active-Active Datastores, ensure that the environment is configured according to requirements on the "Preparing the vSphere and InfiniBox Environment" section above.

Page 54: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  54

16 vMSC with Uniform Host AccessWhen configuring uniform host access, the ESXi hosts can access the same datastore through both InfiniBox systems - the system that exists on the same site (local) and the remote system on the other site. 

Typically, the datastore paths to the remote system will be less optimized than the paths to the datastore on the local system, due to the extra travel between the sites which carries added latency.

The InfiniBox system can intelligently hint the ESXi hosts about which are the optimal paths to serve I/O. This is further discussed later on. 

FC connectivity between the site is required.

16.1 Setting up the ESXi Hosts on InfiniBoxConnect all the ESXi hosts in the vSphere cluster to both InfiniBox systems. 

Page 55: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  55

1.

2.•

3.

For each host it is recommended to have one path to each InfiniBox node (the nodes of both systems) from two initiators. (12 paths in total)The hosts can access both InfiniBox systems and will be able to see paths to the datastore from both systems.

16.1.1 Creating InfiniBox Hosts objectsOnce all hosts are connected (zoned), a corresponding InfiniBox host object should be created for each of the ESXi hosts, on both systems. 

In order to create the InfiniBox hosts objects:

Login to one of the InfiniBox systems using the management console.

Create a new host object. Add the corresponding host's ports (initiators).

Repeat steps 2 until all the ESXi host are created. 

The steps below describe how to create (register) the vSphere cluster on InfiniBox

The simplest method to register a uniform vSphere cluster is by using Host PowerTools for VMware. For instruction see: Registering ESXi hosts and clusters

If Host PowerTools for VMware is used to register the vSphere cluster, sections "Creating InfiniBox Hosts objects" and "Creating InfiniBox Cluster object" can be skipped.Make sure that the vSphere cluster is registered on both systems.

Page 56: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  56

4.

1.

2.

3.

4.

Once all hosts are created, repeat the steps above on the other system.

Upon completion, all the vSphere cluster's hosts should exists as an InfiniBox host objects on both system.

16.1.2 Creating InfiniBox Cluster objectOnce all host are created, add them to an InfiniBox cluster object.

The InfiniBox cluster object aggregates host objects and enables to map volumes to all of the cluster's hosts at once. 

In order to create the InfiniBox cluster object:

Select the "Host & Cluster" icon on the left bar.

Click the "Create" button and select "Cluster".

Add all the previously created hosts to the cluster. 

Repeat steps on the other system.

Upon completion, all the vSphere hosts objects should be added to an InfiniBox cluster object on both system.

Page 57: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  57

••

•••

•••

16.2 Setting the optimized InfiniBox system for each hostThe InfiniBox system can intelligently hint using ALUA the ESXi hosts about which are the optimal paths to serve I/O to each volume.

ALUA is a standard used for identifying paths prioritization between the storage and hosts, and enables the initiators to query the target about path attributes, such as the paths ALUA state.

This settings is controlled by an InfiniBox Host object option, which sets the host's "Optimized / Non-Optimized" setting.

By default host objects are created as "Optimized".The InfiniBox system hints the ESXi hosts by setting the hosts's mapped volumes paths ALUA state to "Optimized / Non-Optimized".

16.2.1 InfiniBox Host objects Optimize/Non-optimized configuration Configure the InfiniBox Host objects as follow:

On InfiniBox - Site A:Ensure that the hosts which are located on Site A are set to "Optimized".The hosts that are located on Site B should be set to "Non-Optimized".

On InfiniBox - Site B:Ensure that the hosts which are located on Site B are set to "Optimized".The hosts that are located on Site A should be set to "Non-Optimized".

Settings this properly is crucial when configuring vMSC with a Uniform host access, as the ESXi hosts are presented with datastore paths also from the remote InfiniBox system, which are typically less optimal. 

Not required for a Non-uniform configuration.

Page 58: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  58

1.

2.

3.

4.

5.

Settings the InfiniBox hosts to Optimized / Non-Optimized 

In order to set a host to "Optimized / Non-Optimized":

Login to the InfiniBox system using the management console.

Select the "Host & Cluster" icon on the left bar.

Right click on a Host object.

Select "Modify Host".

Set the "Path ALUA state" to the proper option. 

Ensure proper configuration for all ESXi hosts on both InfiniBox system. 

The simplest method to set a vSphere host to Optimized/Non-optimize is by using Host PowerTools for VMware.

Page 59: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH UNIFORM HOST ACCESS  –  59

Page 60: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  60

17 vMSC with Non-uniform Host AccessWhen configuring non-uniform host access, the ESXi hosts on each site can access the storage only through the local InfiniBox system - the system that exists on the same site (local).

17.1 Configuring the ESXi hosts for Non-uniform accessWhen configuring Active-Active volumes using Non-uniform connectivity it is required to set the ESXi hosts objects on InfiniBox to “ESXi” type.

"ESXi" type - Makes InfiniBox issue a PDL SCSI sense response to the host in case a mapped Active-Active peer no longer synchronized. (cannot serve R/W IO, becomes "offline")If the host type is not set to "ESXi", vSphere HA cannot properly detect that a mapped Active-Active peer is no longer synchronized ("offline").

Therefore, in a Non-uniform environment, vSphere HA will not try to recover affected VMs that are running on hosts which can access only the "offline" peer.

Setting the ESXi hosts objects on InfiniBox to the “ESXi” type is further discussed below on the "Setting up the ESXi Hosts on InfiniBox" section.

Page 61: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  61

1.

2.

3.•

••

17.2 Setting up the ESXi Hosts on InfiniBoxConnect the ESXi hosts in each site only to the local InfiniBox systems. 

For each host it is recommended to have one path to each InfiniBox node from two initiators. (6 paths in total)The hosts on each site can access only the local InfiniBox system and therefore will be able to see paths to the datastore only from the local system.

17.2.1 Creating InfiniBox Hosts objectsAfter the hosts are connected, in order to be able to provision storage to the vSphere cluster, a corresponding InfiniBox host object must be created for each of the ESXi hosts.

Each host object contains the host’s initiators ports, which then can be mapped to a volume.

In order to create the InfiniBox hosts objects:

Login to the InfiniBox system on Site A using the management console.

Create a new host object. 

Set the created host object type to "ESXi".This is settable only using InfiniShell. 

Use the following command: "host.set_host_type host=<esxi-host-name> host_type=ESXi"When creating the hosts using InfiniShell it is possible to set the host type on creation, using the "host_type=ESXi" argument. 

Page 62: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  62

4.

5.

1.

2.

3.

4.

5.

Repeat steps 2-3 until all the ESXi hosts that reside on the same site are created. 

Repeat the steps on the InfiniBox system in Site B.

Upon completion, all the host in Site A should exist on the system in Site A and all the hosts in Site B should exist on the system in Site B.

Site A:

Site B:

17.2.2 Creating InfiniBox Cluster objectOnce all host are created, add them to an InfiniBox cluster object. The InfiniBox cluster object aggregates host objects and enables to map volumes to all of the cluster's hosts at once. 

In order to create the InfiniBox cluster object:

Login to the InfiniBox system on Site A using the management console.

Select the "Host & Cluster" icon on the left bar.

Click the "Create" button and select "Cluster".

Add all the previously created hosts to the cluster. 

Repeat steps on the system in Site B.

Page 63: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VMSC WITH NON-UNIFORM HOST ACCESS  –  63

Upon completion, all the vSphere hosts objects should be added to an InfiniBox cluster object on both system.

Each InfiniBox cluster represent the hosts which reside on the same site. 

Site A: 

Site B:

Page 64: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  64

1.

2.

3.

••

4.•

18 Provisioning Active-Active DatastoresIn case virtual machines are designed to simultaneously run in both sites, it is advised to provision at least two Active-Active datastores.

Login to one of the InfiniBox systems using the management console.

Create two new volume.

Configure Active-Active replication to the remote system on one of the previously created

volumes.Keep the Preferred system option as Local.Upon success, an Active-Active replication is set and a volume peer is created on the remote system.

Configure Active-Active replication to the remote system also on the other volume. This time set the Preferred system option to Remote.

Virtual machines in each site should reside on an Active-Active datastore which his preferred peer is set on the local InfiniBox system. (the system on the same site)

Page 65: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  65

5.•

6.

Upon success two Active-Active replications are created. For one "System A" is set as preferred and for the other "System B" .

Map the two Active-Active volume to the previously created InfiniBox cluster object.

Page 66: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  66

1.

2.•

3.

4.

••

18.1 Mapping the other peer(s)Login to the other InfiniBox system using the management console.

Map the Active-Active volume peers to the previously created InfiniBox cluster object.It is recommended to map each peer using the same LUN ID in both systems. 

Login to the vSphere Web Client.

Perform a storage rescan on the vSphere cluster.

18.2 Ensuring proper volumes accessEnsure that the hosts properly see the paths to the mapped Active-Active volume.

The paths state and the number of paths to the Active-Active volume depends on the host access type. Optimized paths are presented as "Active (I/O)", which are the paths to the local InfiniBox system. 

As long as there are "Active (I/O)" paths available, All R/W IO will go through these paths which provide optimal performance.

Non-optimized are presented as "Active", which are the paths to the InfiniBox system in the other site. 

18.2.1 Non-uniform Host Access It is recommended that each host will have one path to each of its local InfiniBox nodes from two initiators. (six paths in total)

In the screenshot below, there is an InfiniBox Active-Active volume (storage device) which is mapped to an ESXi host. The Active-Active volume is presented to the host only from the local system, therefore all the paths are in "Active (I/O)" status which indicates they are optimized. 

The Active-Active volumes are now presented to each ESXi host (in both sites), from both InfiniBox systems. (uniform access)  

Page 67: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  67

18.2.2 Uniform Host Access It is recommended that each host will have one path to each InfiniBox node (the nodes of both systems) from two initiators. (Twelve paths in total, six to each system)

In the screenshot below, there is an InfiniBox Active-Active volume (storage device) which is mapped to an ESXi host from both systems.

The six "Active (I/O)" paths are the optimized path to the local InfiniBox system where the ESXi host is set as optimized.The other "Active" path are non-optimized to the remote InfiniBox system and will be used only in case all the optimized paths are down.

Page 68: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  68

18.3 Creating VMFSCreate a VMFS Datastore over the Active-Active volume (storage device). 

Located the vSphere cluster "New Datastore..." Wizard:

Select the Active-Active volume LUN and walk-through the wizard. 

Page 69: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

PROVISIONING ACTIVE-ACTIVE DATASTORES  –  69

Upon success the created datastore should be mounted on all hosts. 

The same Active-Active datastore can perform reads and writes through both systems simultaneously

Page 70: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  70

••

••

•••

19 vSphere Cluster ConfigurationThe following settings configuration is recommended for the vMSC vSphere Cluster:

19.1 vSphere Availability Validate that the vSphere HA is turned-on. ("TURN ON vSphere HA" is checked)

Allows automatic virtual machines fail-over in the case of a failure. It is advised to follow VMware best practices for vSphere HA configuration. For more information please refer to VMware documentation.

19.1.1 Failures and ResponsesEnsure that "Enable Host Monitoring" is enabled. 

Host Failure Response: Restart VMs.Datasore with PDL: Power off and restart VMs.

For more information about PDL, refer to the following VMware documentation: https://kb.vmware.com/s/article/2004684

Datasore with APD: Power off and restart VMs - conservative or aggressive restart policy. The recommendation is to choose one of the two options.For more information about APD, refer to the following VMware documentation: https://kb.vmware.com/s/article/2004684

19.2 VM/Host Groups and RulesvSphere HA enables automatic virtual machines fail-over by restarting the virtual machines of a failed host on another host that can access the datastore.

Page 71: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  71

••

Thus, a virtual machine can reside on a host in Site A, and in case of failure to be recovered (restarted) on a host in Site B.

Therefore, It is recommended and beneficial to configure vSphere HA to choose the preferred hosts on which to restart recovered VMs - typically hosts on the same site.

This is done using a VM/Host rules. 

19.2.1 VM/Host GroupsFirst, create groups for the Hosts and VMs.

Typically two Host groups, each group contains the hosts that reside on the same site.VM groups can be created as desired. 

In the example we will create also two VM groups. One for VMs that should reside on Site A and another one for Site B. 

vSphere HA fail-over and recovery process has nothing special or different when used with InfiniBox Active-Active replication.

Page 72: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  72

19.2.2 VM/Host RulesOnce the Hosts and VMs groups are created, they can be associated with VM/Host rules.

In the following example, two rules are created to set VMs in group "VM Group Site A" to reside on hosts in group "Hosts Group Site A".

It is recommended to use the “should” rule, so in the case there are no hosts available in the associated group, the rule can be broken.

Page 73: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  73

Then another rule is created to set VMs in group "VM Group Site B" to reside on hosts in group "Hosts Group Site B".

Page 74: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  74

19.3 Virtual Machines and Datastore PlacementAs previously discussed, the virtual machines in each site should reside on an Active-Active datastore which his preferred peer is set on the local InfiniBox system. (the system on the same site)

The virtual machines on group A (VMs that are running on hosts in Site A) should reside on the Active-Active datastore that his peer on system A  is set as preferred. (the peer on InfiniBox in Site A)The virtual machines on group B (VMs that are running on hosts in Site B) should reside on the Active-Active datastore that his peer on system B  is set as preferred. (the peer on InfiniBox in Site B)

Page 75: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  75

19.4 vMSC with Active-Active Datastores Configuration Diagram

19.4.1 Uniform Host Access

Page 76: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

VSPHERE CLUSTER CONFIGURATION  –  76

19.4.2 Non-uniform Host Access

Page 77: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  77

••

••

20 Failure Scenarios and Expected ResponseThe following chapter underlines how InfiniBox Active-Active replication and the vSphere environment behave in case of a failure. Failures can be caused by two main factors:

Failure of the storage array, the SAN fabric connectivity or the replication link. Failure of ESXi host(s). 

20.1 ESXi Host FailuresWhen an ESXi host failure is detected by vSphere HA , its virtual machines will be recovered and restarted on other ESXi hosts in the vSphere cluster. This is the typical vSphere HA response, regardless of the specific vMSC uniform or non-uniform host access configuration type.

20.1.1 Multiple Hosts Failure In case all the ESXi hosts in a specific site fail, vSphere HA can quickly recover and restart the failed virtual machine on the ESXi hosts in the remote site.

This high level of resiliency is achieved thanks to the fact that the datastores are undergoing InfiniBox Active-Active replication (stretched) and presented to the ESXi hosts on both sites.

20.2 Storage Array FailuresIn case of a complete storage array or a SAN fabric failure scenario, the vSphere HA response depends on the vMSC host access configuration. (uniform / non-uniform).

20.3 Complete Storage Failures on Non-uniform Configuration On non-uniform configuration, the InfiniBox system in each site presents each Active-Active datastore peer only to the local ESXi hosts that reside in the same site.

Each host can only see paths to the local Active-Active peer. (the peer that resides on the InfiniBox system in same site)

Therefore, if from any reason an InfiniBox system becomes inaccessible for the ESXi hosts in a specific site, vSphere would need to recover the failed virtual machines on the hosts in the remote site. The following failure scenarios in a specific site would lead to that result: 

Loss of all SAN fabric connectivity.Brutally unmaping an Active-Active peer on InfiniBox while virtual machines that reside on that peer are powered-on. Failure of the InfiniBox system.

20.3.1 Failure Scenario ExampleThe following example will explore the scenario of access loss to a peer while virtual machines that reside on that peer are powered-on. Environment configuration:

Page 78: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  78

•••

••

••

••

••

•••

•••

••

••

2 InfiniBox systems:System-Site-A: in Site A.System-Site-B: in Site B.

2 Active-Active datastores. "Active-Active_Datastore1": preferred set on System in Site-A.

Reside on Active-Active volume named: "Active-Active_Datastore1""Active-Active_Datastore2": preferred set on System in Site-B.

Reside on Active-Active volume named: "Active-Active_Datastore2"4 Hosts, in 2 sites (Site A/B), divided to two Hosts groups:

Host Group Site A: 2 Hosts Host Group Site B: 2 Hosts

Non-uniform connection:Hosts in site A can only see the peers on the InfiniBox system in Site A.Hosts in site B can only see the peers on the InfiniBox system in Site B.

8 VMs, divided to two VM groups:VM Group Site A: 4 VMs VM Group Site B: 4 VMs

2 VM/Host rules:VMs that are members of the "VM Group Site A" should run on hosts that are members of the "Host Group Site A".VMs that are members of the "VM Group Site B" should run on hosts that are members of the "Host Group Site B".

VMs in group "VM Group Site A" reside on "Active-Active_Datastore1". VMs in group "VM Group Site B" reside on "Active-Active_Datastore2". 

All virtual machines are powered-on. 

VMs that are housed on hosts in Site A and reside on datastore "Active-Active_datastore1"

VMs that are housed on hosts in Site B and reside on datastore "Active-Active_datastore2"

Page 79: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  79

Active-Active replication is set.

Failure

System-Site-B experienced a failure which caused a loss of access to the local Active-Active peers. (the peers on Site B of both Active-Active volumes).

Results

Initially:

The virtual machines that are housed on hosts in Site B (reside on "Active-Active_Datastore2"), become inaccessible. 

The virtual machines on Site B are expected to be inaccessible until the APD timeout has been reached (If APD timeout is enabled).

While Virtual machines that are housed on hosts in Site A (reside on "Active-Active_Datastore1"), continue to run without any interruption.

Page 80: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  80

••

••

••

Looking at the datastore view from Site B hosts would show that both datastores are inaccessible. Due to the loss of access to the local system) 

While looking at the datastore view from Site A hosts would show that both datastores are still available.Due to the fact that the Active-Active datastores' peers on Site A are still accessible for the hosts in Site A through the local InfiniBox system in Site A.

When the APD timeout is reached (If APD timeout is enabled), vSphere HA will shut down the inaccessible VMs and recover (restart) them on hosts in Site A, which can still access the datastore through the local peer.

The recovered VMs will be back online and powered-on on Site A hosts.

Page 81: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  81

Once the InfiniBox system on Site B failure is resolved, the peer will be re-synchronized automatically and the datastores be back again accessible to the hosts in Site B.

20.4 Complete Loss of a Access to a Storage Array on Uniform Configuration 

With a uniform configuration, the InfiniBox system in each site presents its Active-Active datastore peer to the ESXi hosts in both sites.

All the ESXi hosts can access Active-Active datastores through the local and remote peers. (and can see paths to both systems) 

Accordingly, if from any reason one of InfiniBox system becomes inaccessible for the ESXi hosts in a specific site, while the other InfiniBox system is still accessible, the virtual machines will keep and run on their hosts non-disruptively.

Thus, in this case, the downtime until the failed VMs are restarted on the remote hosts by vSphere HA is spared. 

It is recommended to ensure that the VMs are migrated back to hosts in Site B.

Therefore, uniform configuration provides an even greater level of availability for virtual machines, as it can sustained a complete loss of access to an InfiniBox system. 

Page 82: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  82

•••

••

••

••

••

••

••

••

••

••

••

In case both InfiniBox systems become inaccessible for all the ESXi hosts in a specific site, the vSphere HA behavior will be similar to the one explored in the non-uniform section.

20.4.1 Failure Scenario ExampleEnvironment configuration:

2 InfiniBox systems:System-Site-A: in Site A.System-Site-B: in Site B.

2 Active-Active datastores. "Active-Active_Datastore1": preferred set on System in Site-A.

Reside on Active-Active volume named: "Active-Active_Datastore1""Active-Active_Datastore2": preferred set on System in Site-B.

Reside on Active-Active volume named: "Active-Active_Datastore2"4 Hosts, in 2 sites (Site A/B), divided to two Hosts groups:

Host Group Site A: 2 Hosts Host Group Site B: 2 Hosts

Uniform connection:Hosts in site A can access both Active-Active peers on the local and remote InfiniBox systems.

The hosts are set as "optimized" on the system in Site A and as "non-optimized" on the system in Site B.

Hosts in site B can access both Active-Active peers on the local and remote InfiniBox systems.The hosts are set as "optimized" on the system in Site B and as "non-optimized" on the system in Site A.

Paths are set to optimized for the local peer and non-optimized for the remote peer respectively. 8 VMs, divided to two VM groups:

VM Group Site A: 4 VMs VM Group Site B: 4 VMs

2 VM/Host rules:VMs that are members of the "VM Group Site A" should run on hosts that are members of the "Host Group Site A".VMs that are members of the "VM Group Site B" should run on hosts that are members of the "Host Group Site B".

VMs in group "VM Group Site A" reside on "Active-Active_Datastore1". VMs in group "VM Group Site B" reside on "Active-Active_Datastore2". 

All VMs that are housed on hosts in Site A and reside on datastore "Active-Active_datastore1" are powered-on. 

Page 83: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  83

All VMs that are housed on hosts in Site B and reside on datastore "Active-Active_datastore2" are powered-on. 

Active-Active replication is set.

Page 84: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  84

•••

As each host can access both peers, it can see paths to both systems. 

The paths to the local peers are set to optimized, while the paths to the remote peers are set to non-optimized. Looking at the "Connectivity and multipathing" view would show the state of each path:

In this example there are 6 paths to each InfiniBox system, 12 paths in total.Paths with the "Active (I/O)" are the optimized paths to the local peer.

As long as there are "Active (I/O)" paths available, All R/W IO will go through these paths which provide optimal performance. The "Active" (non-optimized) paths, which are the paths to the remote peer, will be used only in case all optimized paths are gone. 

The following example will explore the same a scenario of a complete access loss to a InfiniBox system in a specific site site, while all virtual machines are powered-on.

Page 85: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  85

Failure

System-Site-B experienced a failure which caused a loss of access to the local Active-Active peers. (the peers on Site B of both Active-Active volumes).

Results

All the virtual machines on both sites stay online.

Looking at the datastore view from the hosts in Site B would show that both datastores are still accessible. 

Due to the fact they can still access the peer volume from the remote system.

Looking at the "Connectivity and multipathing" view would show that the paths to the peer on Site B are marked as "Dead" - cannot perform reads or writes I/O. 

Page 86: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

FAILURE SCENARIOS AND EXPECTED RESPONSE  –  86

••

Until the failure on the InfiniBox system in Site B is resolved, the VMs that are running on hosts in Site B are using the "non-optimized" paths which perform I/O directly to the remote InfiniBox system.

This would cause relativity larger latency.In cases where the failure on Site B is expected to persist a long period of time, consider to migrate the VMs that are running on hosts in Site B to the hosts in Site A, in order to spare the I/O travel addition. 

Once the failure on the InfiniBox system in Site B is resolved, the peer will be re-synchronized and accessible again to the hosts on both sites. vSphere then would automatically use only the optimized paths to each of the datastore.  

Page 87: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

HOW INFINIBOX HANDLES FAILURES  –  87

21 How InfiniBox Handles FailuresInfiniBox has two mechanism to handle failures for Active-Active replication: Witness and preferred system.

If an InfiniBox system becomes unavailable, e.g. power outage of the entire site, the peer system will provide access to all the volumes. If the replication link between the systems fails, then datastores will continue to serve I/O on one of the systems. Each datastore has a property in InfiniBox that defines its preferred-system, which will remain online. 

21.1 InfiniBox WitnessThe witness is an arbitrator entity residing in a 3rd site (separate from the two InfiniBox systems involved in Active-Active replication), that acts as quorum in case of replication link failures. The witness is a lightweight stateless software deployed as a VM. 

If the witness is down or inaccessible, replication link failure will result in InfiniBox systems keeping volumes online according to their preferred-system settings.

21.2 Preferred system Each volume that is undergoing Active-Active replication has a definition for preferred system, which the witness uses to make correct decisions. 

If the witness is not available to the systems, the decision on which side stays active will be done per replica based on the preferred system.

21.3 Storage Failover InfiniBox Active-Active replication failover is fully automatic and does not require any storage administrator intervention.

21.4 Storage Replication Resynchronization and RecoveryInfiniBox Active-Active recovery is completely automatic; no storage administrator intervention is necessary to trigger a re-sync and recover replication.

If the InfiniBox systems got disconnected, the replication will internally fallback to async mode. Once the connectivity between the systems recovers, synchronization jobs will start replicating the missing data to the lagging system. During this time, from disconnection and through the re-sync progress the Active-Active volumes on the synchronized system serve I/O operations, while the remote side will be in lagging state until all data is synchronized between the volumes. 

Once the volumes are nearly in sync, they will smoothly transition to Sync replication mode, with no I/O disruption. The host paths to the lagging side will be automatically restored, allowing the hosts to perform I/O operations through both systems again.

Page 88: VMware vSphere Metro Storage Cluster with INFINIDAT InfiniBox … · VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION 3 8.1 vSphere Availability

VMWARE VSPHERE METRO STORAGE CLUSTER WITH INFINIDAT INFINIBOX ACTIVE-ACTIVE REPLICATION

INFINIBOX COMPONENTS FAILURES    –  88

22 InfiniBox Components Failures  The following table describes the InfiniBox storage accessibility in a different failure scenarios when using Active-Active replication:

Scenario InfiniBox System-A

InfiniBox System-B

Replication Link

Witness Active-Active Volumes Access

Optimal UP UP UP UP Volumes are available through both systems

Witness is down UP UP UP Down Volumes are available through both systems

Replication Link is down

UP UP Down UP Volumes are available through the preferred system

System-A is down

Down UP UP UP Volumes are available through System-B

Both systems are down

Down Down N/A N/A Volumes are not available

*Assuming that the Active-Active replicas were in a "Synchronized" state at the moment of failure and the systems link is in witness resiliency.