Dh2014 e mopcobre-complete

45
Distributed “Forms of Attention”: eMOP and the CobreTool Anton duPlessis, Laura Mandell, James Creel, and Alexy Maslov Texas A&M University DH2014 July 10, 2014

description

James Creel, Anton DuPlessis, and Laura Mandell presented the Cobre Tool at Digital Humanities 2014 in Lausanne Switzerland.

Transcript of Dh2014 e mopcobre-complete

Page 1: Dh2014 e mopcobre-complete

Distributed “Forms of Attention”:eMOP and the CobreToolAnton duPlessis, Laura Mandell, James Creel, and Alexy

MaslovTexas A&M University

DH2014 July 10, 2014

Page 2: Dh2014 e mopcobre-complete

Introduction

Distributed Reading

Page 3: Dh2014 e mopcobre-complete
Page 4: Dh2014 e mopcobre-complete
Page 5: Dh2014 e mopcobre-complete

Causer, T., J. Tonra, and V. Wallace. “Transcription Maximized; Expense Minimized?: Crowdsourcing and editing The Collected Works of Jeremy Bentham.” Literary and Linguistic Computing 27.2 (2012), pp. 119-137. Causer, T., and V. Wallace. “Building a Volunteer Community: Results and Findings from Transcribe Bentham.” Digital Humanities Quarterly 6.1 (2012). http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html.

Gibbs, Frederick W. “New Textual Traditions from Community Transcription.” Digital Medievalist 7 (2011). http://www.digitalmedievalist.org/journal/7/gibbs/

Holley, Rose. “How Good Can It Get: Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs.” D-Lib Magazine 1.3/4 (2009).

---. “Many Hands Make Light Work.” March 2009. National Library of Australia. ISBN 978‐0‐642‐27694‐0

Crowdsourcing

Page 6: Dh2014 e mopcobre-complete

Reading

Guillory, John. “Close Reading: Prologue and Epilogue,” ADE Bulletin 149 (2010): 8-14. Hayles, N. Katherine. “Hyper and Deep Attention: The Generational Divide in Cognitive Models,” Profession 2007: 187-199.

Commentary

vs.

Contribution

Bruno Latour {

Page 7: Dh2014 e mopcobre-complete

Cobre: Overview

• Developed for Los Primeros Libros Project• an international collaboration to digitize and provide access to 16th Century

New World imprints (1539 – 1600)• http://primeroslibros.org• http://libros.library.tamu.edu

• Create opportunities for academic investigation and instruction• Interface leverages scrolling filmstrip view of tiled thumbnails • Magnification and comparison tools facilitate detailed examination• View and compare multiple exemplars of the same work that would be

impossible with the physical books• Compare state, emission, edition, etc. of an exemplar• Examine variations in print, missing / obstructed text, missing / misnumbered /misbound /

damaged pages, fire marks, marginalia and other copy specific attributes• Synchronous examination of multiple books permits parallel comparison

Page 8: Dh2014 e mopcobre-complete

Cobre: Suite of Tools

• Reading Tools– Book View– Reading View– Detailed View– Repository View– Comparison View

• Quick Comparison View • Annotations

– Structural • table of contents

– Non-structural • copy specific features

– Transcription • capability to view and correct the

OCR output of a text

• Editing Tools– Basic– Canonical

• abstract construct that permits alignment of different exemplars of the same work by leveraging the structural metadata

– Frankenbook• application of the canonical

construct using images drawn from any exemplar(s) to replace the placeholders to create custom editions via a drag and drop method

Page 9: Dh2014 e mopcobre-complete

Cobre: Book View

Page 10: Dh2014 e mopcobre-complete

Cobre: Dspace View

Page 11: Dh2014 e mopcobre-complete

Cobre: Detailed View

Page 12: Dh2014 e mopcobre-complete

Cobre: Transcription Tool

Page 13: Dh2014 e mopcobre-complete

Cobre: Annotation Tool

Page 14: Dh2014 e mopcobre-complete

Cobre: Comparative View

Page 15: Dh2014 e mopcobre-complete

Cobre: Quick Comparative View

Page 16: Dh2014 e mopcobre-complete

Cobre: The Advantage

Page 17: Dh2014 e mopcobre-complete

New Features supporting transcription for eMOP

• A systematic workflow for getting EEBO and ECCO content and metadata into Cobre

• Accept existing OCR text as transcriptions in XML import

• Editors for human transcription/revision of pages

• Addition of transcriptions to XML export

Page 18: Dh2014 e mopcobre-complete

New Ingestion Workflow

Page 19: Dh2014 e mopcobre-complete

The Bitstream Metadata Bitstream (BMB)

• DSpace does not support bitstream (i.e. file) level metadata of the detail required for annotation and transcription.

• We include an additional bitstream that contains metadata about the page-image bitsreams – the Bitstream Metadata Bitstream

• The BMB is an XML file with “chunks” that describe one or more pages.

Page 20: Dh2014 e mopcobre-complete

Accepting transcriptions from the BMB XML – a view in the DSpace Source

A file attached to the item

Example snippet of its contents

Page 21: Dh2014 e mopcobre-complete

OCR text in the BMB accepted upon harvest into Cobre

Page 22: Dh2014 e mopcobre-complete

Transcription Editor

Invoked with a click

Page 23: Dh2014 e mopcobre-complete

Transcription Editor on Detail View

Page 24: Dh2014 e mopcobre-complete

Transcription Editor on Comparison View

Page 25: Dh2014 e mopcobre-complete

Vetting Transcriptions

Administrative users can indicate whether a transcription is vetted as acceptable

Page 26: Dh2014 e mopcobre-complete

Vetted Transcriptions will appear in BMB XML export

Click this

And export these

Page 27: Dh2014 e mopcobre-complete

Results of Usability Studies

Confusions

Page 28: Dh2014 e mopcobre-complete
Page 29: Dh2014 e mopcobre-complete
Page 30: Dh2014 e mopcobre-complete
Page 31: Dh2014 e mopcobre-complete
Page 32: Dh2014 e mopcobre-complete
Page 33: Dh2014 e mopcobre-complete
Page 34: Dh2014 e mopcobre-complete
Page 35: Dh2014 e mopcobre-complete
Page 36: Dh2014 e mopcobre-complete
Page 37: Dh2014 e mopcobre-complete
Page 38: Dh2014 e mopcobre-complete
Page 39: Dh2014 e mopcobre-complete
Page 40: Dh2014 e mopcobre-complete
Page 41: Dh2014 e mopcobre-complete
Page 42: Dh2014 e mopcobre-complete
Page 43: Dh2014 e mopcobre-complete
Page 44: Dh2014 e mopcobre-complete
Page 45: Dh2014 e mopcobre-complete