DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data...

21
DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Transcript of DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data...

Page 1: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery

John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Page 2: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Overview  

•  Supports  DwB  goal  “equal  and  easy  access  to  official  microdata  for  the  European  Research  Area”  

Ø provides  “more  coherent  system  for  resource  discovery  of  official  sta@s@cs”  

Ø demonstrates  ability  to  ingest  metadata  from  mul@ple  sources,  via  mul@ple  protocols  

 

Page 3: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Scope  of  the  portal  work  in  WP12  

•  Content-­‐wise  Ø Metadata  from  NSIs  +  Archives  

•  Technical  Ø Build  prototype/beta    Ø Sound,  future  proof  methods,  architecture,  components  

Ø Standards-­‐based  Ø Extensible  Ø Easy  to  hand  over  to  ‘sustainability’  body  

Page 4: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Func9onal  aspects  of  the  portal  

•  Research  data  discovery  (obviously)  

•  Provider  portal,  QA  

•  PlaRorm  for  addi@onal  services  

Page 5: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Metadata  ingest  

•  Metadata  gets...  Ø harvested  Ø made  ready  for  QA  Ø transformed  into  canonical  model  

Ø indexed  Ø exposed  

Page 6: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Canonical  metadata  model  •  Harmonisa@on  

Ø DDI-­‐C,  DDI-­‐L,  MISSY,  CIMES,  etc.  

•  Builds  on  DISCO    Ø DDI  discovery  RDF  

Page 7: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Metadata  ingest  dependencies  Step          Source/standard  1)  Harves@ng        specific  2)  Produce  harves@ng  report      specific  3)  Conversion  to  Raw-­‐RDF      agnos@c  4)  Produce  conversion  report      agnos@c  5)  Harmoniza@on          agnos@c  6)  Produce  harmoniza@on  report      agnos@c  7)  Loading          agnos@c  8)  Produce  loading  report        agnos@c  9)  Indexing          agnos@c  10)  Produce  indexing  report        agnos@c  11+)  Discovery,  other  downstream  processes    agnos@c  

Page 8: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

PlaAorm  for  services  

•  DwB  search  portal  is  just  a  front-­‐end  applica@on  

•  Machine-­‐ac@onable  interfaces  for  most  func@ons  (REST)  

Page 9: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Search  Portal  (alpha)  •  Powered  by  Solr    •  Facets  

Ø producer,  geography,  date,  data  type  …  

 

Page 10: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Search  Portal  (alpha)  •  Sugges@ons  /  autocomplete  

 

Page 11: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Search  Portal  (alpha)  •  ‘Did  you  mean?’  func@onality  

Page 12: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Sprint  in  Colchester,  May  2014  •  Use  Jenkins  CI  tool  to:  

Ø Harvest  Nesstar  metadata  §  any  public  instance  

Ø Load  DDI  XML  in  to  BaseX    Ø Convert  DDI  XML  to  raw  DwB-­‐RDF  Ø Harmonize  DwB-­‐RDF  

§  Simple    

Page 13: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Sprint  in  Colchester,  May  2014  •  Integrated  Jenkins  with  Git  to  

Ø Build  Nesstarvester  and  BasexSync  tools  automa@cally  Ø Update  harmoniza@on  scripts  automa@cally  

•  Iden@fied  mechanism  to  detect  metadata  language  Ø So  can  check  language  tag  is  correct  

•  Produced  Solr  schema  

Page 14: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Jenkins  Dashboard  

Page 15: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Jenkins  Job  Details  

Page 16: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Jenkins  Job  Details  

Page 17: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Metadata  harmoniza9on  •  Standard  level  -­‐  sources  based  on  the  various  metadata  

standards  •  Version  level  -­‐  within  a  standard,  the  use  of  different  versions  

(e.g.  DDI  1.2.2,  2.5,  3.x)  •  Template/flavour  level  -­‐  the  use  of  elements  of  the  standard  

for  different  purposes;  presence/absence  of  op@onal  elements  Ø driven  by  ins@tu@onal  prac@ces,  templates,  or  sopware  tooling  

Page 18: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Typical  Console  Output  (Captured)  

Page 19: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

What  next?  •  Perform  provider/format  specific  transforma@ons  

•  Apply  DwB  specific  adjustments  (iden@fiers,  system  metadata,  etc.)  

•  Apply  DwB  harmonizers  (map  metadata  in  to  DwB  standard  facets/CV  etc.)  

•  Load  harmonized  DwB-­‐RDF  in  to  Virtuoso  RDF  database  

•  Index  DwB-­‐RDF  with  Solr    

Page 20: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

What  next?  

•  Producing  various  inges@on  /  QA  reports  •  Propagate  deletes  for  survey  that  have  been  dropped  

•  Synchronize  various  metadata  files  to  repository  Ø For  ‘before  and  aper’  comparisons/provider  feedback  

 

Page 21: DwB Discovery Portal · DwB Discovery Portal A New CESSDA Portal for European Research Data Discovery John Shepherdson - UKDA Pascal Heus - Metadata Technology Ørnulf Risnes - NSD

Any  Ques9ons?