Goobi at the Wellcome Library: Current Work and New Developments
-
Upload
intranda-gmbh -
Category
Software
-
view
318 -
download
0
Transcript of Goobi at the Wellcome Library: Current Work and New Developments
![Page 1: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/1.jpg)
Rioghnach Ahern, Digital Ingest Coordinator, 18th of November, 2016.
UK Goobi User Meeting
Current Work and New Developments
![Page 2: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/2.jpg)
2
The vision for the Wellcome Library’s digital engagement programme is to create the world’s largest free and unrestricted
digital library focused on the cultural contexts of health.
![Page 3: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/3.jpg)
3
Digitisation at the Wellcome Library• The Wellcome Library is developing a world-class online resource for the
history of medicine by digitising a substantial proportion of its holdings & making the content freely available on the web.
• We will also strive to include important content from other institutions, which complements our own holdings, & to explore commercial partnerships for cost-effective digitisation of other parts of our collections.
• The Wellcome Library began its digitisation programme in 2010; its ambition is to make freely available over 50 million pages of historic medical books, archives, manuscripts & journals by 2020.
![Page 4: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/4.jpg)
4
27.8 million images ingested so far!
![Page 5: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/5.jpg)
5
Digitised Material Types• Monographs – 93,719.• Archives – 40,010.• Reports – 5,818.• Artworks – 3,709.• Audio-visual – 1,096.• Manuscripts – 805.• Journals – 260.
![Page 6: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/6.jpg)
6
On-going Projects• UK Medical Heritage Library• UK Medical Officer of Health Reports• Mental Health Care Archives• Medieval Manuscripts• Recipe Books• Royal Army Medical Corps Archives• Ancestry• Visual Arts Discovery
![Page 7: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/7.jpg)
7
UK Medical Heritage Library
• Wellcome Library• Royal College of Physicians of London• Royal College of Physicians of Edinburgh• Royal College of Surgeons of England• UCL (University College London)• University of Leeds• University of Glasgow• London School of Hygiene & Tropical Medicine• King's College London• University of Bristol
![Page 8: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/8.jpg)
8
![Page 9: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/9.jpg)
9
![Page 10: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/10.jpg)
![Page 11: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/11.jpg)
11
FTP’ing of Third Party Content• FTP process linked to Goobi for processing of the content to be
automated.• Dedicated workflow created for Goobi which monitors the FTP
server for the arrival of new data.• Automatically virus checks & quarantines the content over a 24
hour period before automatically uploaded into Goobi.• If something fails the virus check or the image folders don’t
match the metadata in Goobi, they will go into a “suspicious” folder.
![Page 12: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/12.jpg)
![Page 13: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/13.jpg)
13
![Page 14: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/14.jpg)
14
![Page 15: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/15.jpg)
Sources of digitised content
Goobi(METS/OCR)
Preservica/DLCS
In-house
Institutions
Contractors
Harvesting
TIFF or JP2
TIFF or JP2HD & ftp
TIFF or JP2
Normalises TIFF to JP2
Manual
Automatic
Jpylyzer validates JP2Auto harvesting of JP2 & DMD
Grey literature
Pro
ject
Man
ager
s / I
nges
t Offi
cer
Pro
ject
Man
ager
s
Ingest Officer / Digital Curator
Snagging
Snagging
![Page 16: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/16.jpg)
![Page 17: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/17.jpg)
![Page 18: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/18.jpg)
18
Benefits of Automation• Reduced quarantine time for FTP’d content speeds up projects
considerably.• Automation has upped our throughput.• Fewer human clicks in general.• Less ingest resource required.
![Page 19: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/19.jpg)
19
Future Developments• Improve the matching process for automated image upload.• This would decrease the number of manual image upload tasks required.• Automate the METS edition task for non-sensitive and open content.• Create Rulesets for automating the METS edition of content according to
publication year.
![Page 20: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/20.jpg)
20
Medical Officer of Health Reports
• Parsing the harvest of Medical Officer of Health reports and the UK Medical Heritage Library project.
• Downloading rights metadata directly from the Internet Archive.
![Page 21: Goobi at the Wellcome Library: Current Work and New Developments](https://reader031.fdocuments.us/reader031/viewer/2022021922/5870044d1a28ab427f8b5b15/html5/thumbnails/21.jpg)
Click icon to add image. Then send to back to see titles
Thank you
@RioghnachAhern Linkedin.com/rioghnachahern