Data Center Capacity Management 1 1.ppt - Square · PDF fileData Center Capacity...
Transcript of Data Center Capacity Management 1 1.ppt - Square · PDF fileData Center Capacity...
WebinarWebinar
Data Center Capacity ManagementData Center Capacity Management
David Cuthbertson DirectorDavid Cuthbertson, DirectorSquare Mile Systems Ltd
Capacity Management Why?Capacity Management - Why?
1 Standards have been1. Standards have been referenced for the design, implementation and testingimplementation and testing of data centers – TIA942
2. Good components have been chosen - which rarely go wronggo wrong
3. Why does the data center3. Why does the data centersometimes run out of capacity?delay installation of critical kit?
Square Mile BackgroundSquare Mile Background
• Develop AssetGen toolsets training and B i P• Develop AssetGen toolsets, training and techniques for operational management of complex IT infrastructure
• Focus areas Services
Business ProcessesDepartmental, Company
• Focus areas– Connectivity management– Data center management
S i d CMDB iApplications
PC server mainframe SOA
ServicesEnd user, infrastructure, supplier
– Service and CMDB mapping– System change impact analysis – Documentation techniques
I f i li i
Virtual InfrastructureNetwork, Servers, Storage, DBMS
PC, server, mainframe, SOA
– Infrastructure visualisation • Use Visio automation for both
diagramming and reporting toolHardware Infrastructure
Network, Servers, UPS, Storage, Other
Fixed Infrastructure(Cabling, Power, Cabinets, Buildings)
Best Practices TrainingBest Practices Training
• Practical Data Centre Management (2 days)– Managing the facility and external teams (ITIL, ISO27001, BS25999)
• How to Map Services and Systems (1 day)– Communicating change / incident impact and dependencies (ITIL)g g p p ( )
• Visio for IT Professionals (2 days)– Training course on Visio diagramming and automation techniques
• Creating and Maintaining Visio Infrastructure Diagrams• Creating and Maintaining Visio Infrastructure Diagrams – One day iIn-house workshop on Visio diagram automation
Changing Requirements (Device)Changing Requirements (Device)
The Server requires
1 Power connection
1 or 2 network cables1 or 2 network cables
1 KVM connection
Server
Todays Requirements (Device)Todays Requirements (Device)
Other
Core Switch
SANArray
PowerPDU
Other
DD
P KVMEdge
Switch
SANSwitch
PowerStrip
KVMVideo
1-6 0-4 1-41-6
ServerMin 3Max 20+
Changing Requirements (Environment)Changing Requirements (Environment)
• Density– More servers in a rack (42 1u servers = 84 network cables)
More ports on a switch– More ports on a switch
• Cabling technology – Cat6A cable takes up more space fibre used for SAN– Cat6A cable takes up more space, fibre used for SAN
• Power– Multiple power inputs (HP blade server up to 6) & loadingMultiple power inputs (HP blade server up to 6) & loading
• Cooling– Must keep within cooling limits and not block airflowp g
• Risk– One box may have multiple servers so errors are multiplied
Changing Requirements (Business)Changing Requirements (Business)
• Centralised planning of changes– Local knowledge of connectivity resources required
R d th t f h• Reduce the costs of change– Less manual effort, optimise use of hardware assets
• Faster turn round of tasks• Faster turn round of tasks– Use of workflow & virtual systems (VLAN, storage, servers)
• Out tasking physical changes• Out tasking physical changes– Formal instructions / work orders to 3rd parties
• Reduce risk of disruptionReduce risk of disruption– More resilience and alternate paths required
Why Manage Capacity?Why Manage Capacity?
B iBusiness
IT
Computing Needs
InfrastructureThe Data Center team can’t predict
Data Centerhosting and other requirements!
- Highlight investment required- React to changes g
How Are Most Data Centers Managed?How Are Most Data Centers Managed?
• Informal / formal processes• Site survey, pre-installation checks, audits • Ownership is often assigned locally• Create knowledge sets as individuals or within teams
– MS Office - Excel, Visio, Word, Notes, Sharepoint, Access– Specialist toolsets AssetGen, Aperture, Nlyte and others
• Or give the problem to someone else• Or give the problem to someone else– Host, outsource, out task.
Capacity Management in PracticeCapacity Management in Practice
1. How you decide where to put equipment2. When to say no (or yes)
PROLIANTPROLIANT
PROLIANTPROLIANT
– Exceed technical design or operational limits– Doesn’t conform to the capacity management plan– Not optimal use of available resource
PROLIANTPROLIANT
– Not optimal use of available resource
3. Establishing authority and ownership– Allocation of resources and funding
440
I
CRITICAL
MAJOR
MINOR
USER
microsystems
440
I
CRITICAL
MAJOR
MINOR
USER
microsystemsAllocation of resources and funding– Decommissioning and moving
4. Confidence in service provisionUID
HPProLiant
DL380 G5
1 2
1 2 3 4 5 6 7 8
POWERSUPPLY SUPPLY
POWER
SPAREONLINE
MIRROR
CAGERISERPCI
PROC PR OC
LOCKINTER
TEMPOVER
FANS
PP
MP
PM
DIMMS
UID
HPProLiant
DL380 G5
1 2
1 2 3 4 5 6 7 8
POWERSUPPLY SUPPLY
POWER
SPAREONLINE
MIRROR
CAGERISERPCI
PROC PR OC
LOCKINTER
TEMPOVER
FANS
PP
MP
PM
DIMMS
COMPACTCOMPACT
p– Everything is working within design limits– Failover or resilience will work as required
1 2 3 4 5 6 7 8 17 18 19 20 21 22 23 249 10 11 12 13 14 15 161 2 3 4 5 6 7 8 17 18 19 20 21 22 23 249 10 11 12 13 14 15 16
COMPACTCOMPACT
Practical IssuesPractical Issues
• Who owns the problem of creating and maintaining an end to end data centre capacity management system?
Facilities?– Facilities?– IT Data Centre teams?– Platform teams?
Is this going to be solved by ITIL configuration management and a
– Service management?– Development teams?
g gCMDB? – NO!
• Where do you start?– People
P– Process – Toolsets
Capacity Management What do I Mean?Capacity Management – What do I Mean?
Performance ManagementMonitoring & AnalysisTuning / Optimisiing
Demand ManagementRequests / Reservation
Reviews
Modelling Capacity CommunicationPlacement options, current limits
Heat flow, resilience, fail overCapacity plan, energy usage
Reporting (Current and Future)
Workflow SystemLife cycle controls
Resource management
Monitoring SystemsMonitoring of current state
AlertingInfrastructure Resource management Alerting
DocumentationSpacePower
CoolinggConnectivity
InventoryProjects
Allocating a Server Location
• Building
Allocating a Server Location
• Building• Room• Row• Rack or tile• Height• Weight• Power• Network connectivity• Network connectivity• SAN connectivity• Heat• Cost
© Square Mile Systems14
Define Your Provisioning LifecycleDefine Your Provisioning LifecycleBusiness Projects
Budgetary
BusinessRequirement
Data Center
ResponseProceed
DetailedPl
Log Request
PlansAuthorised
ProjectStart
PlacementStart
BuildPhase Update all
ProjectCompletion
Phase Update all systems
Define the Roles/ResponsibilitiesDefine the Roles/Responsibilities
Design Install Change Maintain Monitor Document
Power
Cooling
Racks
Cabling
ServersServers
Storage
Network
Mid-range
© Square Mile Systems16
Define The Process Flow Swim ChartDefine The Process Flow – Swim ChartProjects
DeploymentPlanning
DesignTeam
ChangeMgmt
Build Team
StdComponents
Request
Requestform
No
Outline
Design Review
BookC
formYes
Forward bookingschedule
Design Doc
UpdateCapacity Plan
Contractor
Allocateddates
QA CheckDeployment
design
Change from reservedto allocated
Accept/Reject Release DetailedPlan Schedule Change
ConfirmContractors
Handover
Accept
Reject
© Square Mile Systems17
Contractors
CFD Heat Flow ModellingCFD Heat Flow Modelling
© Square Mile Systems18
Making Capacity Management EasierMaking Capacity Management Easier
• Develop common processes and standards for changes– Naming and labelling
Easier project / operations handover– Easier project / operations handover– Reservation of capacity and connectivity in operational systems
• Focus on reducing the number of data setsFocus on reducing the number of data sets– Spreadsheets– Databases– Diagrams– Documents
M it i t– Monitoring systems– Workflow tools
AssetGen TechniquesAssetGen Techniques
Coordinated database multiple viewpoints automated diagrammingCoordinated database, multiple viewpoints, automated diagramming
Capacity and connectivity reportsServicesSoftwareServers
p y y p
Change impact analysis and audit trails
Visio
ServersStorageCabinetsNetworks
Spreadsheet outputs
Service and Architecture diagrams
Excel
LAN/SAN/WAN/Power diagramsVisio
VisioNetworksCablingPowerV i
Service and Architecture diagrams
Visio
VoiceRack, floor plans
Visio Diagrams From AssetGenVisio Diagrams From AssetGen
Rack Position
Floor PlanService impact
BLADE_BIRM01
UK
_
UK
_
UK
_
UK
_
UK
__BIRM
01_BLADE-01
_BIRM
01_BLADE-02
_BIRM
01_BLADE-03
_BIRM
01_BLADE-04
BLADE-BIR
M01.BLAD
E-SW1
BLADE-BIR
M01.BLAD
E-SW2
_BIRM
01_BLADE-05
UK_BIR
M01_BLAD
E-09
UK_BIR
M01_BLAD
E-10
UK_BIR
M01_BLAD
E-12
H/W Build
Power SupplyPower Supply
Network Connections
Data Graphics Rack FunctionData Graphics - Rack FunctionColour coding
01-01 01-02 01-03 01-04 01-05 01-06 01-07 01-08 01-09 01-10
g
Network Network Server Server Server Server Server Storage Storage Mixed
02-01
N t k
02-02
N t k
02-03
S
02-04
S
02-05
S
02-06
Mi d
02-07
S
02-08
S
02-09
Unalloct d
02-10
N t kNetwork Network Server Server Server Mixed Server Server ated Network
03-01
Network
03-02
Network
03-03
Server
03-04
Server
03-05
Server
03-06
Server
03-07
Server
03-08
Server
03-09
Server
03-10
Unallocated
Visio 2007 linked to structured data
Network Network Server Server Server Server Server Server Server ated
Data Graphics Racks by Power FeedData Graphics - Racks by Power FeedBar chart
01-01
32 11
01-02
32 14
01-03
32
01-04
4833
01-05
32 15
01-06
32 12
01-07
48 43
01-08
32 11
01-09
3210
01-10
3211 1420
15 12 11 10 20
Breaker Actual
02-01
326
02-02
323
02-03
321
02-04
322
02-05
3222
02-06
323
02-07
321
02-08
321
02-09
321
02-10
32206 3 1 2 22 3 1 1 1 20
03-01
0 0
03-02
0 0
03-03
0 0
03-04
0 0
03-05
0 0
03-06
0 0
03-07
0 0
03-08
0 0
03-09
0 0
03-10
0 0
Visio 2007 linked to structured data
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Data Graphics Racks Exceeding PowerData Graphics - Racks Exceeding PowerIcons show where power / cooling limits exceed design
01-01 01-02 01-03 01-04 01-05 01-06 01-07 01-08 01-09 01-10
1095 2855 3705 5655 2755 2055 7455 1705 1455 7055
02-01 02-02 02-03 02-04 02-05 02-06 02-07 02-08 02-09 02-10
1095 505 2205 1005 7105 7205 1555 2255 2505 7105
03-01 03-02 03-03 03-04 03-05 03-06 03-07 03-08 03-09 03-10
Visio 2007 linked to structured data
2000 0 350 350 0 0 0 350 0 0
Techniques Used by AssetGenTechniques Used by AssetGen
• One update for a component affects – Inventory
Connectivity– Connectivity– Capacity reports - rack space, LAN, SAN, Power
• One repository to collate technical and commercial data
Capacity Management ReviewCapacity Management Review
Performance ManagementMonitoring & AnalysisTuning / Optimisiing
Demand ManagementRequests / Reservation
Reviews
Modelling Capacity CommunicationPlacement options, current limits
Heat flow, resilience, fail overCapacity plan, energy usage
Reporting (Current and Future)
Workflow SystemLife cycle controls
Resource management
Monitoring SystemsMonitoring of current state
AlertingInfrastructure Resource management Alerting
DocumentationSpacePower
CoolinggConnectivity
InventoryProjects
Webinar SummaryWebinar Summary
A b tt d t di f it t i d• A better understanding of capacity management issues and techniques.
• Gained a brief understanding of Square Mile techniques and AssetGen technology
• See the videos on AssetGen web site
N t bi• Next webinars are – Integrating Excel/Visio with data centre management toolsets– The Easy way to build a CMDB– Spreadsheet Chaos! – the problem of infrastructure documentation
Data Centre Capacity ManagementData Centre Capacity Management
CDavid CuthbertsonSquare Mile Systems Ltd
[email protected]+44 (0)870 950 465144 (0)870 950 4651
www.squaremilesystems.com
www.assetgen.com