Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department,...

19
Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006

Transcript of Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department,...

Page 1: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

Queensland University of Technology

CRICOS No. 00213J

VMware as implemented bythe ITS department, QUT

Scott Brewster

7 December 2006

Page 2: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Note

• IT services at QUT are provided primarily by the central ITS department and additionally by the IT departments of various faculties and divisions.

• This presentation focuses on the VMware implementation managed by the central ITS department.

(There are other VMware implementations at QUT managed by faculty IT departments.)

Page 3: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Overview

• Why VMware?• VMware software• Physical hardware

– Host hardware– Network hardware– Storage hardware

• Virtual machine configuration• Guest operating-systems• Backup of virtual machines• VirtualCenter• Future directions

Page 4: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Why VMware?

• Server consolidation through server virtualisation– Relocating instances of operating-systems on

multiple under-utilised physical servers to multiple virtual machines on a single physical server

– Test and development environments are key targets for virtualisation

Page 5: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

VMware software

Timeframe:• Late-2005: Initial deployment: 6 hosts running ESX Server 2.5.2• Mid-2006:

– Installed ESX Server 3.0 on 8 new hosts– Migrated virtual machines from 6 original hosts:

• Manually shutdown and migrated existing virtual machines one at a time from the ESX Server 2.5.2 hosts to the new ESX Server 3.0 hosts, leaving all ESX Server 2.5.2 hosts empty of virtual machines. Unfortunately required virtual machine downtime!

– Re-installed ESX Server 3.0 on the original 6 hosts• Late-2006: Upgraded all hosts to ESX Server 3.0.1

– Used VMotion to migrate all virtual machines from a given host prior to its updating to ESX Server 3.0.1. No virtual machine downtime required!

• Now: Another 8 new hosts awaiting installation of ESX Server 3.0.1

Page 6: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Physical hardware

• VMware implementation requires three key types of physical hardware: hosts, a network, and shared storage– Hosts:

• 22 Hewlett-Packard (HP) ProLiant-series servers

– Network:• 1000 Mb/s Ethernet

• Cisco and Nortel network infrastructure

– Storage:• Local boot disks

• Shared storage provided by SAN

• SAN consists of HP storage arrays and fibre channel switches

Page 7: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Host hardware

• 22 physical hosts dedicated to VMware implementation:– 4 HP ProLiant DL380 G4

• 2 3.4 GHz Intel Xeon CPU’s• 5 GiB memory• 2 200 MiB/s Fibre channel (200-M5-SN-I) ports• 4 1000 Mb/s Ethernet (1000BASE-T) ports

– 10 HP ProLiant DL385 G1• 2 2.2 GHz AMD Opteron (dual core) CPU’s• 9 GiB memory• 2 400 MiB/s Fibre channel (400-M5-SN-I) ports• 4 1000 Mb/s Ethernet (1000BASE-T) ports

– 8 HP ProLiant BL465c G1• 2 2.6 GHz AMD Opteron (dual core) CPU’s• 14 GiB memory• 2 400 MiB/s Fibre channel (400-M5-SN-I) ports• 4 1000 Mb/s Ethernet (1000BASE-T) ports

Page 8: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Network hardware

• Each host has 4 1000 Mb/s network connections:1. IP subnet 131.181.117.128/25 for the service console2. IP subnet 10.0.0.0/8 on a dedicated VLAN for VMotion3. IP subnet 131.181.118.0/24 or 131.181.117.0/25 for use by

virtual machines4. Additional connection identical to (3) above, for redundancy.

• Some hosts have an extra 2 1000 Mb/s network connections:5. IP subnet 131.181.108.0/24 or 131.181.107.0/24 for use by

virtual machines6. Additional connection identical to (5) above, for redundancy.

Page 9: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Network hardware

• Now: External switch tagging (EST) mode

Vswitch Vswitch Vswitch Vswitch

131.181.117.128/25 10.0.0.0/8 131.181.108.0/24 131.181.118.0/24

Physical network connections

Serviceconsole

Vmotionmodule

Virtual machines

Page 10: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Network hardware

• Currently need access to four IP subnets just for virtual machines with desired access to even more subnets.

• Intention is to use virtual switch tagging (VST) mode– Allows virtual machines to access any subnet– Provides redundancy for all connections (including

Service Console and Vmotion)– Allows Vmotion between more ESX Server hosts

Page 11: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Network hardware

• Desired: Virtual switch tagging (VST) mode

Vswitch

131.181.117.128/25 10.0.0.0/8 131.181.108.0/24 131.181.118.0/24

Physical trunk connections

Serviceconsole

Vmotionmodule

Virtual machines

Page 12: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Storage hardware

• Hosts boot from local disks:– Local disks (all SCSI) are configured into a RAID-1 logical disk.– Our non-blade servers use an extra local disk as a hot spare.

• All other storage is shared and presented from a SAN:– Hosts have dual 200 MiB/s (or 400 MiB/s for newer hosts) fibre

channel connections to the SAN one to each SAN fabric.(QUT has two identical SAN fabrics for redundancy.)

– HP Storage arrays (EVA8000 in this case) provide shared SAN LUN’s to the hosts.

– SAN LUN’s for use by VMware are 500 GiB RAID-5 LUN’s.

Page 13: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Storage

• Each SAN LUN provides the backing for a single ESX datastore.

• Datastores can span SAN LUN’s but we haven’t tried this.

• In turn, a datastore can be formatted with the VMFS3 filesystem.

• Virtual machine’s virtual disks are backed by files in VMFS3 filesystems.

• We keep all of a virtual machine’s virtual disks on the same datastore.

Page 14: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Virtual machine configuration

• Currently hosting 64 virtual machines• CPU:

– Majority of virtual machines configured with a single “virtual” CPU– Some are configured with dual “virtual” CPU’s

• Memory:– Majority are configured with 512 MiB or less– Some use 1 GiB or more

• Network:– All currently use a single virtual network interface

• Storage:– Most have a relatively small boot virtual disk with one or more large data

virtual disks– Some have a larger combined boot/data virtual disk

Page 15: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Guest operating-systems

• Red Hat Enterprise Linux 4– 29 virtual machines running this OS– Even physical host hardware cannot always keep up with the

default system timer rate of 1000 clock interrupts/s. A custom kernel is therefore required to reduce this rate to 100 interrupts/s for virtual machines.

– Virtual machine is created manually by system-administrator.– Operating-system is then installed using network-based Kickstart

process from the university’s Red Hat Satellite. Custom scripts install additional QUT-specific software and customisation.

– The host is automatically registered for updates as part of the Kickstart process.

Page 16: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Installation of guest operating-systems

• Microsoft Windows 2003 Server– 35 virtual machines running this OS– Clock interrupts already occur at less than 100 interrupts/s, so

no customisation of the system timer is required.– Virtual machine is created by cloning a virtual machine template

which has previously been manually installed from a Windows installation CD. The template is configured to both run Sysprep and add the instance to the WSUS server for updates.

– The system-administrator then modifies the newly created virtual machine if extra disks, memory, etc. are required.

Page 17: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Backup of virtual machines

• No backup of ESX Server hosts is made:– Virtual machines are stored on the shared SAN LUN’s and can

be restarted from a different ESX Server host if an ESX host is lost.

• Each virtual machine is backed-up traditionally using a network-backup agent:– If a virtual machine is lost is must be recreated and restored

from tape.

• The shared SAN LUN’s are not backed-up:– If a shared SAN LUN is lost, all virtual machines it contained

must be recreated and restored from tape.

Page 18: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

VirtualCenter

• VirtualCenter version– Late-2005: Initial deployment used VirtualCenter 1.3.1– Mid-2006: Fresh installation of VirtualCenter 2.0– Late-2006: Upgrade to VirtualCenter 2.0.1

• Client: Only supported on Windows– Linux users have to use Terminal Services client to first connect to Windows host– Virtual consoles become unreliable when this is done – key-press and key-release events

are delayed causing unwanted repetition on virtual consoles• Server: Only supported on Windows

– Installed on a physical host• License server

– Dedicated license server running on the same physical host as the VirtualCenter server• VirtualCenter database

– Oracle database running under Linux on a physical host• Vmotion

– Separately licensed and additional cost, but essential tool in our experience– Allows on-line migration of virtual machines between physical hosts

Page 19: Queensland University of Technology CRICOS No. 00213J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.

CRICOS No. 00213Ja university for the worldrealR

Future directions

• Review virtual machine backup– Current backup strategy does nothing to reduce the

number of costly network backup licenses required– Network backups generate a lot of extra network

traffic, which is undesirable on virtual machines

• Configuration of resource pools– Currently little consideration is being given to

guaranteeing resources for virtual machines– Appropriately configured resource pools should help