7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
1/75
VMware vSphere HA
Recommendations toMaximize VirtualMachine Uptime
Josh Gray, VMware, Inc.
Jeff Hunter, VMware, Inc.
INF-BCO2382
#vmworldinf
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
2/75
2
Disclaimer
This session may contain product features that are
currently under development.
This session/overview of the new technology represents
no commitment from VMware to deliver these features in
any generally available product.
Features are subject to change, and must not be included in
contracts, purchase orders, or sales agreements of any kind.
Technical feasibi lity and market demand will affect final delivery.
Pricing and packaging for any new technologies or features
discussed or presented have not been determined.
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
3/75
3
High Availabili ty is Part of IT Business Continuity
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
4/75
4
Just a Few Clicks to Higher Availability
Turn ON vSphere HA
OK
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
5/75
5
Global Support Services (GSS)
Bangalore, India
Tokyo, Japan
Cork, IrelandBurlington, Canada
Palo Alto, CA Broomfield, CO
Support offices
Local language support
Spanish, Portuguese, French, German, Japanese, Chinese
Global Coverage24x7, 365 days/year
6 Suppor t Centers
1000+ Support
Engineers
Follow-the-sun
Support for
Severity 1 Issues
Support Relationships
with 100% of the
Fortune 100;
99% of For tune 500
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
6/75
6
Recent Enhancements
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
7/75
7
vSphere 5.0
Major Redesign
Fault Domain Manager (FDM)
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
8/75
8
vSphere 5.1
Minor Updates
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
9/75
9
Recommendations: Networking
Redundant Management Network
Fewest hops possible
Route based on or iginating port ID
Failback policy = No
Enable PortFast, Edge, etc.
MTU size the same
Keep things simple
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
10/75
10
Recommendations: Networking
Consistent portgroup names, network labels
Host Monitoring dur ing network maintenance
Use Maintenance Mode
Separate subnet for vSphere HA
Specify additional network isolation address
Each host can communicate with all other hosts
Keep things simple
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
11/75
11
Recommendations: Networking
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
12/75
12
Recommendations: Networking
Advanced Configuration Options
das.allowNetwork[0-9]=
das.isolationAddress[0-9]=
das.useDefaultIsolationAddress= (true/false)
das.failuredetectiontime
Not supported in vCenter 5.x
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
13/75
13
Recommendations: Storage
Implement mult iple paths
HBAs, storage processors (SPs), NICs, switches
Appropriate multipathing policy
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
14/75
14
Recommendations: Storage
Storage Heartbeats
HA selects two datastores by default
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
15/75
15
Recommendations: Storage
Storage Heartbeats
Override auto-selected datastores if necessary
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
16/75
16
HA Events(How to Avoid Problems)
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
17/75
17
Possible HA Events:Host Failure
Network partition
Host isolation
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
18/75
18
HA Events:Host Failures
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
19/75
19
HA Events:Network Partition
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
20/75
20
Recommendations: Network Partition
Symptoms: Network Partition
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
21/75
21
Recommendations: Network Partition
Symptoms: Network Partition
Master
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
22/75
22
Recommendations: Network Partition
Symptoms: Network Partition
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
23/75
23
Recommendations: Network Partition
Symptoms: Network Partition
New
Master
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
24/75
24
Recommendations: Network Partition
Symptoms: Network Partition
New
Master
New
Master
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
25/75
25
HA Events:Host Isolation
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
26/75
26
Host Isolation Policies:Leave Powered On
Power Off
Shutdown
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
27/75
27
Which Policy?(How to Avoid Problems)
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
28/75
28
Depends.(on HOW You Want to Avoid Problems)
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
29/75
29
Likelihood.
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
30/75
30
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d i I l i R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
31/75
31
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d ti I l ti R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
32/75
32
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d ti I l ti R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
33/75
33
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d ti I l ti R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
34/75
34
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d ti I l ti R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
35/75
35
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
R d ti I l ti R
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
36/75
36
Recommendations: Isolation Response
Host will retain
access to
datastores?
VMs will retain
access to VM
network?
Recommended
Isolation PolicyRationale
Likely LikelyLeave Powered
OnVM is running fine,
why power it off
Likely UnlikelyLeave Powered
On or Shutdown
Allow HA to restarton hosts that are
not isolated, likely
to have access to
storage
Unlikely Likely Power off
Avoid having two
instances of the
same VM on the
network
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
37/75
37
Admission Control(How to Avoid Problems)
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
38/75
38
Admission Control Policies:Static number of hosts
Percentage of cluster resources
Dedicated failover hosts
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
39/75
39
Static Number of HostsAdmission Control Policy
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
40/75
40
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VMware vSphere
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
41/75
41
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
Each Host:4 CPU x 2.40 GHz CPU
16 GB memory
Cluster:
38 GHz
64 GB memory
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
42/75
42
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
Reservation:
2 GHz
1024 MB
Reservation:
1 GHz
2048 MB
Each Host:4 CPU x 2.40 GHz CPU
16 GB memory
Cluster:
38 GHz
64 GB memory
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
43/75
43
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
Reservation:
2 GHz
1024 MB
Reservation:
1 GHz
2048 MB
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
44/75
44
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
Reservation:
2 GHz
1024 MB
Reservation:
1 GHz
2048 MB
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
45/75
45
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
Reservation:
2 GHz
1024 MB
Reservation:
1 GHz
2048 MB
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
46/75
46
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
47/75
47
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
48/75
48
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
49/75
49
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
50/75
50
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
51/75
51
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
WindowsClient
vSphere
Web Client
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
52/75
52
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
WindowsClient
vSphere
Web Client
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
53/75
53
Recommendations: Admission Control
Number of Hosts (Host Failures Cluster Tolerates)
vSphere Windows Client
Sets a cap on the slot size
Override
default
behavior
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
54/75
54
eco e dat o s d ss o Co t o
Number of Hosts (Host Failures Cluster Tolerates)
vSphere Web Client
Sets the exact size. Important difference.
Override
default
behavior
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
55/75
55
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
56/75
56
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
57/75
57
Number of Hosts (Host Failures Cluster Tolerates)
VM VM
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
58/75
58
Recap:
Static Number of HostsAdmission Control Policy
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
59/75
59
% of Cluster ResourcesAdmission Control Policy
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
60/75
60
Percentage of cluster resources
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
61/75
61
Percentage of cluster resources
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
62/75
62
Percentage of cluster resources
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
63/75
63
Percentage of cluster resources
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
64/75
64
Dedicated Failover HostsAdmission Control Policy
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
65/75
65
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
66/75
66
Which Do I Use?!?!
Recommendations: Admission Control
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
67/75
67
Basic design principle: Do the math, and take customerrequirements into account. If you need flexibility a
Percentage is the way to go.
Frank Denneman & Duncan Epping
VMware vSphere 5 Clustering Technical Deepdive
vSphere HA VM Monitor ing
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
68/75
68
VM Monitoring restarts VM if
VMware Tools Heartbeat not received
No network or disk activity within I/O stats interval
Default 120 seconds customize in vSphere Web Client
vSphere HA Application Monitoring
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
69/75
69
3rd-Party Solut ions
Symantec ApplicationHA
Neverfail vAppHA
Application Awareness API open with vSphere 5.0
Download VMware GuestAppMonitor SDK with 5.0
Download VMware Guest SDK for vSphere 5.1
vSphere HA Futures
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
70/75
70
VMware vSphere HA Today
Storage interconnect most commonly queried KB issue
Assumes storage connected on other hosts
Improvements with vSphere 5.0 U1 and 5.1
Virtual Machine Component Protection (VMCP)
Fine-grained controls for VM restart policy
Queries destination host(s) for storage health
Demo in VMware booth on show floor
vSphere HA Futures
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
71/75
71
VMware vSphere Fault Tolerance (FT) Today
Protects only VMs with 1 vCPU
Many mission-critical apps require multiple vCPUs
SMP Fault Tolerance (FT)
Protect VMs that have more than one vCPU
Customer Support Day Events
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
72/75
72
Coming to a location near you: sharing of VMware best practices!
Support Days are a col laboration between VMware Support, Sales
and customers you learn directly from the experts
Topics are driven by
customer input, and
typically include:
Best practices Tips/tricks
Top issues
Product roadmaps/demos
Certification offerings
http://www.vmware.com/go/supportdays
VMware GSS: Important Links
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
73/75
73
Blogs
Support Insider:
blogs.vmware.com/kb
KBTV:
blogs.vmware.com/kbtv
KB Digest:
blogs.vmware.com/kbdigest
@vmwarecares:
twitter.com/vmwarecares
@vmwarekb:
twitter.com/vmwarekb
https://www.facebook.com/vmwkb
Communities
communities.vmware.comYouTube
KBTV:
youtube.com/user/vmwarekb
Support and Downloads:
vmware.com/supportTechnical Support Welcome Guide:
vmware.com/go/supportguideGet Support v ia My VMware:
my.vmware.com/group/vmware/get-help Licensing Help Center:
vmware.com/support/licensingKnowl edge Base:
kb.vmware.comCustomer Support Days:
vmware.com/go/supportdaysRenewals:
vmware.com/go/renew Customer Advocacy:
[email protected] Support Centers:
vmware.com/support/product-support
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
74/75
FILL OUT
A SURVEY
EVERY COMPLETE SURVEY
IS ENTERED INTO
DRAWING FOR A
$25 VMWARE COMPANY
STORE GIFT CERTIFICATE
INF-BCO2382
7/27/2019 BCO2382-VMware vSphere HA Recommendations to Maximize Virtual Machine Uptime_Final_US.pdf
75/75
VMware vSphere HA
Recommendations toMaximize VirtualMachine Uptime
Josh Gray, VMware, Inc.
Jeff Hunter, VMware, Inc.
INF-BCO2382
Top Related