Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain...
-
Upload
precious-bice -
Category
Documents
-
view
217 -
download
2
Transcript of Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain...
Connect. Communicate. Collaborate
Place your organisation logo in this area
GN2 Multidomain Monitoring Service: Serving IP NOCs
Nicolas Simar, DANTE
APM Meeting, Utrecht
24th of November 2006
Connect. Communicate. CollaborateAgenda
• Provide the general concepts of the Multi Domain Monitoring service.– Set the scenes.– You’ll use it soon!– The Support that will be offered to you.
• Demo the visualisations– Provide feedback
• Explain the next steps and what your role will be:– Taking part to the Pilot and Prototype. – Using the tools.– What metric and services will be available – validate the first
choice.
Connect. Communicate. CollaborateWhat is JRA1?
• JRA1 (Performance Measurement and Management) main objective is to build a multi-domain monitoring framework inter-operable across which is the basis to offer a Multi-Domain Monitoring (MDM) Service.
• Consists of the following main parts:• Design and develop the framework (perfSONAR).
• Integrate measurement tools and databases within the perfSONAR framework.
• Build user visualisation tools using the perfSONAR framework.
• There are about 25 participants (12.5 FTE), from 17 organisations.– Main partners are CARNet, CESNET, Cynet, Dante, DFN, NORDUnet,
PSNC.
Connect. Communicate. CollaborateperfSONAR philosophy
Connect. Communicate. CollaborateWhat is perfSONAR?
• perfSONAR is a consortium of organisations who seek to build network performance middleware that is inter-operable across multiple networks.
• perfSONAR is a protocol. – SOAP XML messages and following the Open Grid Forum (OGF)
Network Measurement Working Group (NM-WG).
• perfSONAR is, an example set of code (implementation of web-services using the perfSONAR protocol).
Connect. Communicate. CollaboratePerfSONAR Web-Services
• The framework takes care of the data movement.• It covers the following perfSONAR web-services
– Auth Service (JRA5)
– Autz Service
– Lookup Service (LS)
– Measurement Archives services (MA)• RRD MA, SQL MA, Hades MA
– Measurement Point services (MP)• BWCTL MP, SSH/Telnet MP, CLI MP (I2), L2 status MP (JRA4)
– Topology Service (TopS, cNIS – SA3).
• Allows diversity on the measurement layer and on the visualization layer.
Connect. Communicate. CollaborateperfSONAR philosophy
Connect. Communicate. Collaborate
Multi-Domain Monitoring Service (MDM)
• User : role – group of people making use of a MDM Service. – There may be several categories of users having different needs.
• An MDM service is an access to a set of metrics or functionalities offered to a group of users by several networks using the perfSONAR protocol.
• An MDM service is offered by deploying on a set of perfSONAR web-services and/or visualisations.
• E2E really means Edge to Edge, not End to End (unless end institutions buy into it).
Connect. Communicate. Collaborate
Multi-Domain Monitoring Service
OwnOwnUser
GN2Visualisation
perfSONAR SOAP XML + JRA5 AA
GN2Visualisation
Domain A
BWCTL MPOWD MALookup
Domain B
BWCTL MPOWD MALookup
Domain C
BWCTL MPOWD MALookup
UserVisualisation
perfSONAR SOAP XML + JRA5 AA
Connect. Communicate. Collaborate
Multi-Domain Monitoring Service
• Multi-Domain Monitoring Service– Access to a set of monitoring functionalities (e.g. accessing metric
or performing tests) offered to a group of users accessible directly through an XML SOAP interface (perfSONAR protocol) or through a visualisation tools.
– Based on an underlying set of perfSONAR web-services.
• perfSONAR web-service– Web service (providing data or allowing to perform an action) using
the XML NM-WG. The perfSONAR web-services are the basic building blocs of a MDM service.
Connect. Communicate. CollaborateUsers Segmentation
Advance Trouble-shooting
Trouble-shooting
Service Health Check
Project Trouble-shooting
Project Service Health check
SLA Verificat
ion
Tailored Added Value
Functional
Yes Yes Yes
Yes Yes Yes
[optional] [optional] Yes Yes
[optional] [optional] Yes Yes
[optional] [optional] Yes Yes
Yes Yes Yes
[optional]
Security
Layer3 ProjectPIP Project
End-UserNetwork Researcher
NREN non technical Staff
User group and their Monotoring Data Usage.
PERTNOCLayer2 Project
Connect. Communicate. CollaborateMDM Service Benefits
• For the NOCs – NRENs, EU RENs, GÉANT2 (Abilene(?), ESnet(?), RNP(?), etc). – In DJ1.1.1
• NOCs encounter 5-10% of the problems involving coordination of between multiple domains.
– E2E services/IP packets don’t stop at the boundaries of a domain.– To have an E2E view.
• In particular when offering added value E2E services. • Link capacity, link utilisation, packet drops, topology.
– To have in multiple domain on stand-by tools to perform basic tests.• TCP throughput, link utilisation, delay, looking glass.
– To have the capability of finding out where the tools are located.– To answer the question “End system vs network based problem?”– Send tests results easily.– Save time.
Connect. Communicate. CollaborateMDM Service Benefits
• PERT– Similar than for the NOCs.
• L2 project users (LHC OPN, DEISA, eVLBI).– Can see the health of their service.– Verify SLA.– Integrate the data within their own tools.
• L3 project users (EGEE, eVLBI).– Can see the health of their service.– Verify SLA.– Integrate the data within their own tools– We can provide them added value services (traffic matrix between project
sites).
• End-users when appropriate tools will be made available.– Empowering the network users: indication about the network.– Work not started.
Connect. Communicate. CollaborateGoing Operational• Pre-roll Out – define and set-up support structure now – March 07.• Pilot – April 07 – August 07 – 5 RENs + GÉANT2
– For NOC and PERT (no AA)– Understand the issues of going operational.– Validate the support structure, get feedback for next phase.– Release in January, deployment training in February.
• Prototype – October 07 – February 08 – 11 RENs + GÉANT2– For NOC, PERT and a limited number of projects.– Verify the MDM SLA.– Dedicated support team.– Verify how to provide the service to external parties.– Test the turn key solution.
• Operation – April 08– More RENs, closer to end-institution.– More projects supported.
Connect. Communicate. CollaborateMDM Service Pilot portfolio
MetricperfsonarUI
CNM NEMOVisualperfSON
AR
JRA4 L2
visualisation
XML access
L3 link utilisation Historical Yes Yes Yes Yes
L3 link capacity Historical Yes Yes Yes Yes
L2 circuit status Latest Yes Yes
L2 circuit status Historical Yes Yes
OWD, IPDV, OWPL, Historical Yes Yes Yes
traceroute Historical Yes
Delay RTT On-demand Yes Yes
show command On-demand Yes Yes
Traceroute On-demand Yes Yes
Achievable throughput (TCP) On-demand Yes Yes
UDP throughput On-demand Yes Yes
Service discovery Yes Yes Yes
Service
Telnet/SSH MP
BWCTL MP
Lookup Service
L2 status MP(*)
RRD MA or SQL MA
SQL MA (*)
Hades MA
(*) L2 status MP or SQL MA
Connect. Communicate. Collaborate
MDM Service Prototype portfolio
Prototype/Functionalityperfsonar
UICNM NEMO
VisualperfSONAR
JRA4 L2 visualisati
on
XML access
L3 link utilisation Historical Yes Yes Yes Yes YesL3 link capacity Historical Yes Yes Yes Yes YesL3 Interface Output drops Historical Yes Yes Yes Yes YesL3 Interface Input drops Historical Yes Yes Yes Yes Yes
L2 status MP L2 circuit status Latest Yes YesL2 circuit status Historical Yes YesOWD, IPDV, OWPL Historical Yes Yes Yes
Achievable throughput (TCP) Historical Yes Yes YesUDP throughput Historical Yes Yes Yestraceroute Historical Yes Yes Yes
Delay RTT On-demand Yes Yesshow command On-demand Yes Yes
Traceroute On-demand Yes YesAchievable throughput (TCP) On-demand Yes YesUDP throughput On-demand Yes YesOWD, IPDV, OWPL On-demand Yes Yes
Lookup Service Service discovery Yes Yes Yes Yes Yes
Topology Serivce Topology information Yes Yes Yes YesAuthentication Service Yes Yes Yes Yes YesAuthorisation Service Yes Yes Yes Yes Yes
RRD MA or SQL MA
SQL MA
Hades MA
Telnet/SSH MP
Service
BWCTL MP
Hades MP
AutzAuth or GiDP
In orange, the additional foreseen functionality from the prototype over the Pilot.
Connect. Communicate. CollaborateTaking part to the Pilot
• Deploy the web-services and provide the appropriate data.• Set-up an MDM Level2 support, provide an operational
service.– Ensuring availability of the web-services (Monitor the web-
services), reporting problems following the MDM service procedures.
• Having the NOC and PERT using the infrastructure, solving issues thanks to it and providing feedback.– Training the NOC and PERT.
• Validate the Service at the end of the phase.– Tools, metrics, services.
Connect. Communicate. Collaborate
MDM Web-services (Pilot phase)
Metric/Functionality
L3 link utilisation HistoricalL3 link capacity Historical
L2 circuit status Latest L2 circuit status Historical
OWD, IPDV, OWPL Historical
traceroute Historical
Delay RTT On-demand
show command On-demand
Traceroute On-demand
Achievable throughput (TCP) On-demand
UDP throughput On-demandService discovery
Service
Telnet/SSH MP
BWCTL MP (3 instances per REN)
Lookup Service
L2 status MP(*)
RRD MA or SQL MA
SQL MA (*)Hades MA (3 tool deployment per REN)
(*) To offer L2 status information, you can either chose the L2 status MP or SQL MA.An NRENs will only provide L2 status information when offering L2 circuits to LHC and DEISA.
Connect. Communicate. CollaborateMDM Service Support
• Infrastructure to support the perfSONAR web-services and the visualisation tools used by the MDM will be set-up.– For the deployers: installation, configuration, incident, monitoring.– For users: installation, utilisation.
Users(NOC, PERT,
Projects)
Deployers(RENs)
SLA
User Service DeskISS
SLADeployer Service
Desk
Connect. Communicate. CollaborateMDM Service Support• Level1 – Service Desk (ISS)
– Help to install, configure the tools, run reachability tests, help on usability, track the RFE, forward problem to proper person, log the requests, update the documentation, track bugs. This is a central function (rotating member or group of people - ownership).
• Level2 – Administrator (RENs)– Administrator of the machines where the services are installed. The
function lies within the providers. They are in charge of taking care of the security of the services, of their availability (up) and reachability (no firewall, etc). The service should be available 24/7.
• Level3 – Developers (3 years subcontract).– The JRA1 developers who have build the services. They are in charge of
implementing new features and fixing bugs and of answering the query forwarded by level1.
• The three levels of support will be available to both the users and the deployers.
Connect. Communicate. CollaborateMDM Service Support
• A turn key solution service could be provided for the web-services of a MDM service or part of it.– HW bought.– Web-services installed, monitored and managed on the REN
behalves. – REN would still have to do a little bit.
• More information about the MDM service in January– Transition to Service session on Tuesday afternoon. – What question have you got to be answered during that session?
Connect. Communicate. CollaborateVisualisations
Advance Trouble-shooting
Trouble-shooting
Service Health Check
Project Trouble-shooting
Project Service Health check
SLA Verificati
on
Tailored Added Value
Functional
Yes Yes Yes Yes Yes v
Layer2 ProjectLayer3 ProjectPIP Project
Yes Yes [o]+[a] Yes v Yes v
Yes Yes Yes Yes Yes Yes
User group and their Monotoring Data Usage.
PERTNOC
perfsonarUICNM
Visualisation Tools… … and their usages.
NEMO
VisualperfSONARJRA4 E2E L2 visualisation
In Red: Targeted for the Pilot. In Orange: Probably targeted for the Prototype (in addition to the Pilot ones)To find out what user group will use as visualisation tool, chose one type of usageand find out, in the same column in the second table the tools available for this usage.