The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and...

26
The Twelve Pillars of IBM z/VM System Management Alan Altmark, IBM Senior Managing z/VM Consultant February 2019

Transcript of The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and...

Page 1: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

The Twelve Pillars of IBM z/VM System Management

Alan Altmark, IBM

Senior Managing z/VM Consultant

February 2019

Page 2: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Notes

References to IBM products, programs, or services do not imply that IBM intends to make these available in all countries in which IBM operates. Any reference to an IBM product, program, or service is not intended to state or imply that only IBM's product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe on any of the intellectual property rights of IBM may be used instead. The evaluation and verification of operation in conjunction with other products, except those expressly designed by IBM, are the responsibility of the user.

IBM, the IBM logo, ibm.com, z/VM, OMEGAMON, and GDPS are trademarks or registered trademarks of International Business Machines Corporation, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.

Technical content Copyright © IBM Corporation, 2016, 2019

Copyright © IBM Corporation, 2016, 2019 2

Page 3: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

OK, so they’re more like rectangles….

Copyright © IBM Corporation, 2016, 2019 3

z/VMlifecycle

Virtual server lifecycle

Real resource management

Performance management

Security Networking

AutomationAlert

ManagementChargebackOperations

High Availability

Disaster Recovery

Page 4: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

z/VM Hypervisor Lifecycle

• Installation

• System cloning

• Customization

• Patching: RSU, COR, SPE, fix-test

• PSP bucket

• System upgrades

• Decommission

• Practice, practice, practice dumps and patches• Once a quarter• You never know when you’ll need it in a hurry!

Copyright © IBM Corporation, 2016, 2019 4

Page 5: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

z/VM Hypervisor Lifecycle

• Recommended Service Upgrade (RSU)• Selected preventive service PTFs• Cumulative• Executables pre-built• Conservative content

• Corrective service• Individual PTFs• Fix-test for an open APAR• Needed to be current

• New Function (NF) APARs• Shipped as COR service• MAY eventually be placed on RSU

Copyright © IBM Corporation, 2016, 2019 5

Page 6: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Virtual Server Lifecycle

• Virtual machines• Naming conventions

• Provisioning• Add / Change / Delete• Directory manager (e.g. DIRMAINT)• ICP, IBM wave

• Operations• Start / Stop / Pause / Move

Copyright © IBM Corporation, 2016, 2019 6

Page 7: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Real Resource Management

• Memory and CPU• Can resources be moved or are they tied to the partition?

• Dynamic I/O changes• IODF creation, distribution, and activation• HCD v. HCM v. DPM v. z/OS

• Hardware upgrades (concurrent or off/on)

• Storage migration

• Threshold and status alerts (operator)

Copyright © IBM Corporation, 2016, 2019 7

Page 8: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Performance Management

• Performance management • Real-time• Historical• Capacity planning• Advisories• Threshold and status alerts

• Performance Toolkit: Web, 3270

• OMEGAMON

• zVPS

Copyright © IBM Corporation, 2016, 2019 8

Page 9: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Security - Policy

• Policy is a roadmap for implementation• Multi-factor authentication?• Password phrases?

• What you are supposed to accomplish, not how• Follow on with an implementation guide that is z/VM specific

• Policy should avoid implementation-specific naming• “Must use the Foo Facility” can lock out platforms that have same functionality,

but don’t call it “Foo”

• Virtualization services are substantially different from traditional OS services

Copyright © IBM Corporation, 2016, 2019 9

Page 10: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Security - Operational

• Provisioning• Person or workload?

• Naming convention v. Groups

• Authorizations & privileges

• Emergencies & Error recovery• Break glass

• Database failure

• ESM failure

Copyright © IBM Corporation, 2016, 2019 10

Page 11: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Security - Operational

• What to audit?

• Detecting configuration changes

• Penetration testing

Copyright © IBM Corporation, 2016, 2019 11

Page 12: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Security - Data

• Encryption• In flight: TLS servers (no self-signed certs, please)• At rest: pervasive encryption• Upgrade your TN3270E clients

• Resource access controls• External security manager• Access rights review• Based on group membership

• Residual data management• Clear, purge, destroy

Copyright © IBM Corporation, 2016, 2019 12

Page 13: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Networking

• TCP/IP and VSWITCH• Bridge, HiperSockets, VLANs

• Planning

• Bandwidth (capacity)

• Hardware (cables, ports, switches, power)

• Alert generation (failure, thresholds, drops)

• Dropped packet diagnostics

Copyright © IBM Corporation, 2016, 2019 13

Page 14: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Networking

• “Just because it’s valid doesn’t mean it’s wise”- Sir Alan, Lord of the Protocols

• Configuration review• Backup port on same chpid• All OSAs plugged into same physical switch• Link aggregation / portchannel / bonding

• Vulnerability assessments

• Don’t forget about RoCE!

Copyright © IBM Corporation, 2016, 2019 14

Page 15: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

High Availability

• Single component outage• Planned or unplanned• Human or machine

• Clusters• Application, Database, Hypervisor, Server, Storage• All components eventually fail

• Storage mirrors: GDPS® and hyperswap

• Network link aggregation

• Alert generation (predictive, FYI, or unexpected)

• Archive / Retrieve (human failure)

Copyright © IBM Corporation, 2016, 2019 15

Page 16: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Hyperswap

Copyright © IBM Corporation, 2016, 2019 16

z/VM

SecondaryPrimary

Page 17: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Hyperswap

Copyright © IBM Corporation, 2016, 2019 17

z/VM

SecondaryPrimaryX

Page 18: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Single System Image

Copyright © IBM Corporation, 2016, 2019 18

Shared volumes

Multiple CTCs for ISFC-based

SSI communications

Common LAN for guest IP communications(optionally, with shared SAN for guest FCP connections)Non-shared volumes

Member 3

Member 1

Member 4

Member 2

Page 19: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Disaster Recovery

• Multiple component failure• Includes complete site failure

• Asynchronous storage mirror

• Alternate machine, if you don’t have another one nearby

• GDPS-managed region swap

• Alternate configuration management• Return home• Planned test

• Backup / Restore

Copyright © IBM Corporation, 2016, 2019 19

Page 20: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Automation

• System operator console• Tourist information or Important?• Response to prompts (e.g. from security server)• Use SECUSER - don’t run software IN the OPERATOR virtual machine

• Console log recording• All guests• Keep console data out of z/VM spool!

• Generate enterprise alerts

• File and command distribution

• Coordination of Flashcopy / Backup

• Startup / Shutdown staging and coordination with CP SIGNAL

Copyright © IBM Corporation, 2016, 2019 20

Page 21: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Alert Management

• Performance

• User revocation (potential intrusion attempt)

• Predictive hardware failure

• FYI

• Enterprise event management integration• IBM, CA, HP, Microsoft, ….• Syslog forwarding• SNMP traps

• Locally generated e-mail

Copyright © IBM Corporation, 2016, 2019 21

Page 22: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Operations

• IPL: Standalone program loader (SAPL)• LOADPARM• IPL parameters• SALIPL to set defaults

• SHUTDOWN / SHUTDOWN REIPL

• SNAPDUMP

• System restart full dump

• Stand-alone dump

• Practice, practice!

Copyright © IBM Corporation, 2016, 2019 22

Page 23: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Operations

• HMC• Get your own ID

• OSA-ICC• “Gotta have it”

• Auto-IPL

• Recording: EREP (OFF), Symptom (OFF), ACCOUNT (if needed)

• Backup / Restore

• Service virtual machine (SVM) data collection and archive (sweep)

Copyright © IBM Corporation, 2016, 2019 23

Page 24: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Chargeback

• Unit price per virtual server

• Unit price per CPU, disk, network

• Consumption (CPU) [hint: Don’t do this!]

• Premium charge for premium service

• See OPERACCT for accounting records

• No requirement to bill• May be used to calculate TCO for comparisons• Not using? Then turn off ACCOUNT record recording.

Copyright © IBM Corporation, 2016, 2019 24

Page 25: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

The goal is in sight!

Copyright © IBM Corporation, 2016, 2019 25

z/VMlifecycle

Virtual server lifecycle

Real resource management

Performance management

Security NetworkingHigh

AvailabilityDisaster Recovery

AutomationAlert

ManagementChargebackOperations

Page 26: The Twelve Pillars of z/VM System Management · 2020. 5. 15. · •Startup / Shutdown staging and coordination with CP SIGNAL ... 2016, 2019 20. Alert Management •Performance •User

Contact information

Alan AltmarkSenior Managing z/VM Consultant

IBM Systems Lab Servicesz Systems Delivery Practice

IBM1701 North StreetEndicott, NY 13760

Mobile 607 321 7556Fax 607 429 3323

Email: [email protected]

Copyright © IBM Corporation, 2016, 2019 26

Evaluations, please!