EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... ·...

25
EMBL Identity & Access Management Rupert Lück IT Services EMBL Heidelberg e-IRG Workshop Zürich Apr 24th 2008

Transcript of EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... ·...

Page 1: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

EMBL Identity & Access Management

Rupert LückIT ServicesEMBL Heidelberg

e­IRG Workshop ZürichApr 24th 2008

Page 2: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

2IT ServicesHEIDELBERG

Outline

EMBL Overview

Identity & Access Management for EMBL

– IT Requirements & Strategy– Project Goal and Features– Defining the scope– Integrated User Management– Benefits

Page 3: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

3IT ServicesHEIDELBERG

EMBL

European Molecular BiologyLaboratorySupported by 20 MemberStates (+1 associated:    )1500 staff & researchersfrom 60 nations

Page 4: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

4IT ServicesHEIDELBERG

Heidelberg, DE:Main Lab, Basic molecularbiology research

Hinxton, UK:EBI, Bioinformatics databases,research & services

Grenoble, F &Hamburg, DE:Structural Biology

Monterotondo, I:Mouse Biology

EMBL Sites

Page 5: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

5IT ServicesHEIDELBERG

Flagship Lab for Basic Research in Molecular BiologyInstrumentation & Technology DevelopmentServicesAdvanced TrainingTechnology transfer

EMBL’s Mission

Page 6: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

6IT ServicesHEIDELBERG

Systems Biology: From Molecules to Organisms

Genome EmbryoCell

Fruitfly

Protein/DNA

Mouse Human Development,Ageing, Disease

Page 7: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

7IT ServicesHEIDELBERG

Systems Biology

Understand Cell Function as a dynamic biological system– Away from one gene –one function concept– Towards quantitative understanding of living systems

Involves– Interdisciplinary Research across scientific domains

Collaboration infrastructuresData sharing & data integration

– Quantitative studies & Integration of information– Technologically complex experimentation– Computational approaches

modeling and simulationHighly compute and storage intensive (Grid technology)

Page 8: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

8IT ServicesHEIDELBERG

Instrumentation & Technology Development

NG Sequencing, microarrays, databases, screens…Light Microscopy (4D confocal microscopy, cell assays screening, … )Electron & Synchrotron tomographyHigh throughput proteomics and structure analysisModelling of biological processessmall animal imaging

Large amounts of heterogeneous data (PetaByte+ range)Significant needs for Network, Compute & Storage ResourcesScalability of IT

Page 9: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

9IT ServicesHEIDELBERG

EMBL Services

More than 2000 Facility Users per yearuse the radiation sources for structural biology

More than 200,000 scientists per yearfrom all life sciences branchesuse the EMBL bioinformatic data resources

More than 1000 visitors per yearbenefit from state­of­the­art equipmentlearn new techniquescarry out collaborative projects

Page 10: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

10IT ServicesHEIDELBERG

0102030405060708090

100

Jun­

82

Jun­

84

Jun­

86

Jun­

88

Jun­

90

Jun­

92

Jun­

94

Jun­

96

Jun­

98

Jun­

00

Jun­

02

Jun­

04

EBI Services

Reference site for biological data– 150 different databases– 120+ different tools.– 9 different data submission systems.– 8 major query interfaces.

User base– Rapidly growing– > 100.000 different Users / Month– Scientific community– Pharma & Biotech Industry

Trends– Rapid growth of data– Faster than Moore’s law

=> Service oriented architectureWeb Service based accessDatabase FederationGrid approach

00.20.40.60.8

11.21.41.61.8

2

1999 2000 2001 2002 2003 2004 2005

year

web

 reqs

/day

 (mill

ions

)

EBI web requests / day(millions)

[ Source: Peter Stoehr, EBI ]

Gba

ses EMBL­Bank

Growth in Gbases1982­2005

Page 11: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

11IT ServicesHEIDELBERG

Outline

EMBL Overview

Identity & Access Management for EMBL

– IT Requirements & Strategy– Project Goal and Features– Defining the scope– Integrated User Management– Benefits

Page 12: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

12IT ServicesHEIDELBERG

IT Requirements & Strategy

IT Requirements– Collaboration IT Environment

to support Interdisciplinary research– Scalability, Efficiency & Reliability

of IT infrastructure and processes

Strategy– Institution­wide Collaboration Platform– Identity & Access management solution– Consolidation– IT Standards

Page 13: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

13IT ServicesHEIDELBERG

Project: Identity & Access Management for EMBL

Project goal– Provide an EMBL­wide user database  – EMBL Network Passport

Key features– Based on an LDAP– Identity management and provisioning infrastructure– Unified Login and Single­Sign­On where reasonable– Automated fine­grained provisioning of resources to different user

populations– Balanced implementation effort and cost– Future flexibility

Page 14: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

14IT ServicesHEIDELBERG

Defining the scope

ResourcesUser & Client populationsAccess rolesIT Security domains

Page 15: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

15IT ServicesHEIDELBERG

IT Resource Landscape

HPC Clusters– Several 1000 CPU cores– mainly in Heidelberg and at the EBI– NIS

Storage Systems– > 700 TByte primary storage– on NetApp and BlueArc NAS– 3 PB secondary storage– NIS, AD

Network– WLAN (Radius)– VPN (Radius)– Multiple VLANs– Inter­campus VPN

Applications– Small to enterprise level application

server based– Web apps and native clients– Scientific and commercial line of

business systems– LDAP, individual access silos

Database systems– Oracle– MySQL

Desktop and Server Systems– Operating systems (Windows,

MacOS X and Linux)

Page 16: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

16IT ServicesHEIDELBERG

User / Client populations

Named users– Staff:

~1500 across 5 different EMBL sites9yr contracts max.

– Visitors: >1000 / Year– Facility users: >2000 / Year– Contractors & Consultants

– e­Collaborators: >500– Alumni: >4000– Industry: collaborations & programme

Public access:– Scientific tool and content DB user populations (200.000+)

– High fluctuation– Even between populations

Page 17: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

17IT ServicesHEIDELBERG

Access Roles(selection)

VPN Access

Unix / NIS AccountWindows / Active Directory Account

Email AccountAccess to IntranetAccess to shared workspacesAccess to resource booking system (Microscopes, Rooms, etc.)SAP: can use online shopping module (SRM)SAP Modules X, Y, Z: can manage dataAccess to scientific application X,Y,Z

Oracle DB user / access roles

Page 18: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

18IT ServicesHEIDELBERG

IT Security domains

EMBL’s organization is distributed across 5 sites

Individual IT Services organizations– Responsible for local IT management

(Site in Rome, managed from Heidelberg)– Local IT security– Inter­site security as a joint effort

Split user domainsBlocks efficient collaboration

Page 19: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

19IT ServicesHEIDELBERG

HR IT

User Management Until 2007

Oracle DBsEMBL Groups (Web), Visitors,

PhD, EIPOD, Alumni, Consultants

monthlyexport

EMBL Web Pages &Web Applications

Other IT resources

IT resourcesApplications & Operating Systems

HR SystemPayroll & Staff

replacereplace

replacereplace

Unix,Windows,Mail,VPNetc.

HR Data

notlinked

Page 20: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

20IT ServicesHEIDELBERG

User Management Short comings

Many different identities in different systemsHuge efforts– to manage individual identities and access profiles– To achieve a reasonable level of consistency

No fine­grained assignment of access patternsBy default only access to IT infrastructure of users EMBL home siteMany existing (self developed) systems cannot be integrated withothers

Page 21: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

21IT ServicesHEIDELBERG

Integrated User & Access Management 2008+

LDAP / Oracle IMEMBL User Directory &Identity Management

Unix,Windows,Mail,VPNWeb CMS,SAP,Oracle,etc.

Master Data(one central resource)

Payroll, StaffEMBL Groups (Web),EIPOD, PhD, Visitors,Alumni, Consultants

Access Management

SAP HR / OM

sync

IT resourcesApplications & Operating Systems

Template basedProvisioning

User & Identity Management

syncHR IT

Page 22: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

22IT ServicesHEIDELBERG

Integrated User Management Benefits

One central user directory (LDAP)– for all people associated with EMBL– from all sites– not only staff

Automation of access rights management and provisioning to ITresourcesReal time information displayed on the EMBL webLDAP is a standard component– Easy Integration in future projects– Can also be used by any application developer within EMBL– Integration projects costs significantly lower

Page 23: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

23IT ServicesHEIDELBERG

Integrated User Management Collaboration Benefits

EMBL­wide unified login (username & password)e.g. NIS, Windows, SAP, Storage systems,…

Ability to login while visiting another EMBL site

Access to remote (expensive) analysis tools e.g. via Terminal Server

Secure sharing of data with EMBL colleagues from remote sites

Resource booking and checking peoples availability across theorganization

Page 24: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

24IT ServicesHEIDELBERG

Integrated User Management Technical Benefits

Provisioning templates allow fine­grained access management– i.e. a user population could get access to many resources– Others only could be assigned email­only access

Why a commercial solution– Vendors like Oracle provide out­of­the­box connectors to other access

infrastructures, e.g.Active DirectoryLDAP (various vendors)UNIX, NISSAP (various modules)

– Allows faster and cost effective integration of other infrastructures– Federations:

Supports Liberty alliance standardFederations across organizations also to industry partners

Page 25: EMBL€Identity€&€Access€Management - e-IRGe-irg.eu/documents/10920/272661/6+embl_e-irg... · EMBL€Services More€than 2000€Facility€Usersper€year use€the€radiation€sources€for€structural€biology

25IT ServicesHEIDELBERG

Summary

Systems biology at EMBL requires a collaborative, scalable andsecure IT environment to enable research and to protect IP

The introduced an identity management and provisioninginfrastructure is one of the key components to support thisrequirement

It allows automated fine­tuning of individual access scenarios

Allows fast and cost effective integration of other infrastructures