Prof. Richard O. Sinnott National e-Science Centre University of Glasgow, Scotland

40
AHM 2008, 11 th September 2008 Supporting Security-Oriented Interdisciplinary Research: Crossing the Social, Clinical and Geospatial Domains Prof. Richard O. Sinnott National e-Science Centre University of Glasgow, Scotland [email protected]

description

Supporting Security-Oriented Interdisciplinary Research: Crossing the Social, Clinical and Geospatial Domains. Prof. Richard O. Sinnott National e-Science Centre University of Glasgow, Scotland [email protected]. The Context. Many Grids EGEE, NGS, D-Grid, Naregi, OSG,… - PowerPoint PPT Presentation

Transcript of Prof. Richard O. Sinnott National e-Science Centre University of Glasgow, Scotland

Page 1: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Supporting Security-Oriented Interdisciplinary Research:

Crossing the Social, Clinical and Geospatial Domains

Prof. Richard O. Sinnott National e-Science Centre

University of Glasgow, [email protected]

Page 2: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

The ContextMany Grids

EGEE, NGS, D-Grid, Naregi, OSG,… Many definitions and standards

OGSA, OGSI, WSRF, WS-I, WS-ACRONYM-GOES-HERE…Many solutions

Semantic web/Grid, ...Web 2.0, wikis, mash-ups, collaboratories, clouds (fluffy/ill defined!)…Unicore, Globus, gLite, WS-…, OGSA-DAI, …

Tis all a bit (a lot!!!) of a mess… couple that with the data and knowledge explosion in many (all?) domains, and we have a recipe for chaos

“Grid” to me is solution that supports (simple) seamless access to a heterogeneous variety of compute and data resources

Often domain specific – especially data!

(simple) single sign-onsupport researchers and research, especially inter-, trans- disciplinary research

often at the risk of being non-sexy!

Page 3: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Interdisciplinary e-Health Example

Nu

cleo

tid

e se

qu

ence

s

Nu

cleo

tid

e st

ruct

ure

s

Gen

e ex

pre

ssio

ns

Pro

tein

Str

uct

ure

s

Pro

tei n

fu

nct

ion

s

Pro

tein

-pro

tein

inte

ract

ion

(p

ath

way

s)

Cel

l

Cel

l sig

nal

lin

g

Tis

sues

Org

ans

Ph

ysio

logy

Org

anis

ms

Pop

ula

tion

s

Security-oriented Middleware

biologists, bioinformaticians,

statisticians, clinicians,

pharmacists, epidemiologists,

physicists, chemists,

...

+ environmental, social, geographic …+ environmental, social, geographic …

VOTES

DAMES SeeGEO

Page 4: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Example of Inter-disciplinary Research

Typical Query What is the correlation between living adult males over 50 years of age in

Scotland who have had type-2 diabetes for 5 years or more and those employed in manual versus office jobs, i.e. does having the type-2 diabetes condition imply that those afflicted are more likely to be employed in manual or office jobs? Where in Scotland is this most prevalent?

Why?Health inequalities, impacts of policies,…

For example…Male life expectancy for the whole of Glasgow averages 70.7 yearsIn East Glasgow, it goes right down to 53.9 years in the Calton ward

UK National Average 77 years, Mongolia 65, Ghana 59, Gambia 54

Page 5: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Data, Data, Data

No magic bullet for data management on the GridYou can use data if you

a/ know where it is, b/ are allowed to access it, c/ know its format, d/ trust it is authentic,e/ are sure of its quality,f/ have the right local widget to talk to the right remote widgetz/…

a/ there are MANY, MANY, MANY resources out thereTis scary just how big the internet is! (see later)

b/ there are MANY, MANY, MANY ways to define and enforce access policies

Grid sexy security stuff vs real world of NHS, Range of data stakeholders, Ethics, Information governance, Sys-admin/user perspectives…

Page 6: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Data Grids

There is no single solutionWhy?

Things change Science revolutionGrid technology revolution

General principles/patterns are what we needHow do I set up a Virtual Organisation to do research into X?

Connecting users/software/resources across sites Seamless access, End-end security, …

How do I connect multiple Virtual Organisations to do research into X, Y and Z?

Clinical VOs vs other VOs

Page 7: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

AAAA

Grid Security

AAAA

Users like usernames/passwordsProvide them (once!)

Users don’t like/understand X.509 based PKIForget training, education for most users!

$> openssl pkcs12 -in cert.p12 -clcerts -nokeys -out usercert.pem!

The vast majority most certainly won’t jump through hoops to get on the Grid

“me-Science” culture

Should all be transparent to end users and Should all be transparent to end users and aligned with the way that want to work/access aligned with the way that want to work/access resourcesresources

Access Management Federation (Shibboleth) + authZ technologiesAccess Management Federation (Shibboleth) + authZ technologies

Page 8: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Shibboleth Decentralised Approach

Service provider

ShibFrontend

5. Pass authentication info and attributes to authZ function

Grid Portal

6. Make final AuthZ decision

Grid Application

Identity Provider

Home Institution

W.A.Y.F.

Federation

User1. User points browser at Grid

resource/portal

2. Shibboleth redirects

user to W.A.Y.F. service

3.User selects their

home institution

4. Home site authenticates user and

pushes attributes to the service provider

AuthNLDAP

LDAPAuthZuid

Log-in once and roamUser

AuthNLDAP

LDAP

AuthZ

Identity Provider

Home Institution

4. H

ome

site

auth

entic

ates

user

and

pu

shes

attr

ibut

es to

the

serv

ice p

rovid

er

User points browser

at Grid resource/

portal

?

?

?

?

Page 9: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Inter-disciplinary DataData, data everywhere… Or better yet, services, services everywhere…

Clinical Data VOTES project

– Primary care data, – Secondary care data, – Disease registries, …

Social Science Data DAMES project

– Occupational data – Social classification– Census data (educational, housing, family, ...)– Survey data sets – Ethnicity …

Geospatial data SeeGEO project

– EDINA UK Borders– DigiMap

Page 10: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Security-oriented Socio-, Geo-, Clinical Data Infrastructures

Licenses, privileges

Others

Page 11: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

VOTES

Virtual Organisations for Trials and Epidemiological Studies

3 year (£2.8M) MRC funded project started October 2005Plans to develop framework for producing Grid infrastructures to address key components of clinical trial/observational study

Recruitment of potentially eligible participants Data collection during the study Study administration and coordination

– Involves Glasgow, Oxford, Leicester/Nottingham, Manchester, Imperial

» Strong links with UK Biobank

Clinical Virtual Organisation Framework

IMP

CVO-2 (e.g. for

recruitment)

Used to realise

GPs

Lei- Nott GLA

OX

Disease registries

Hospital databases

Transfer Grid

CVO-1 (e.g. for data collection)

Clinical trial data sets

Page 12: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

VOTES Scottish ExperiencesScottish Data Space… up to now

Scottish Care Information (SCI) Store Hospital batch system rolled out across Scotland (lab data, patient records…)

Scottish Morbidity Records (SMR) Aggregated clinical records from last 40 years across Scotland We have been given pseudo-anonymised

– SMR01A General acute inpatient and day case discharges (3,719,206 records)

– SMR04A Psychiatric and mental handicap hospitals and units: admissions, residents and discharges   (241,599 records)

– SMR06A Scottish cancer registrations (171,167 records)          – SMR99A Deaths (173,615 records)           

General Practitioners Administration System for Scotland (GPASS)

Used by 85% of GPs across Scotland

Consent Opt-in/opt-out trial, study, disease area, …

Applied in range of areas/projects: UK Biobank, Congenital anomaly, Brain trauma, Diabetes, Knee pain/obesity,

Prostate cance.…

Community Health Index (CHI) number key to this!

Page 13: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMESNCeSS Data Management through e-Social Science node

Lead by Stirling NeSC Glasgow involvement started August 2008

Occupational dataSocial classificationCensus data (educational, housing, family, ...)Survey data sets Ethnicity…

Page 14: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::GEODE

Page 15: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::GEODE

Page 16: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::EuroOccupations

Page 17: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::UK Data Archives

Page 18: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::CESSDA

Page 19: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::ONS

Page 20: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

SeeGEO::EDINA

Page 21: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

SeeGEO Project

Census data 1991 and 2001

Changes in UK Borders

Page 22: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Demo Walk Through.

1. User attempts to access clinical trials

portal

Page 23: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 24: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 25: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 26: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 27: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 28: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

2. User redirects browser to geospatial

portal

Page 29: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 30: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 31: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

SeeGEO Project

Page 32: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

3. User redirects browser to DAMES

portal

Page 33: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 34: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Page 35: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

ConclusionsSystems driven by Information Governance/Ethics

MREC, LREC, PAC, PIAG, Caldicott Guardians, Joe Public

Once defined have tools/techniques to rapidly roll-out e-Infrastructures to support researchers

Diabetes?Cancer?Obesity?Smoking?Health/Wealth?Genetics and Healthcare?Nature / Nurture?

Focus not on single VO but supporting many VOs that have their own access/usage policiesUnderstanding data models are ESSENTIAL to make any of this work!!!

Page 36: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

A Final Word

Page 37: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::SCROL

Page 38: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::SCROL

Page 39: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

DAMES::SCROL

Page 40: Prof. Richard O. Sinnott  National e-Science Centre University of Glasgow, Scotland

AHM 2008,11th September 2008

Questions …?