Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and...
Transcript of Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and...
![Page 1: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/1.jpg)
Historic Maryland Newspapers Project Presentation for Digital Maryland Conference 2014
March 7, 2014
Elizabeth M. Caringola Historic Maryland Newspapers Librarian
Digital Programs and Initiatives Digital Systems and Stewardship
![Page 2: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/2.jpg)
Introduction to the NDNP
• The National Digital Newspaper Program (NDNP) is a joint effort by the National Endowment for the Humanities (NEH) and the Library of Congress (LC) to digitize historic newspapers from every U.S. state and territory
• The goal is to create “an Internet-based, searchable database of U.S. newspapers with descriptive information and select digitization of historic pages”
• Each state/territory can be awarded an NDNP grant to digitize newspapers published between 1836 and 1922
• Newspapers are digitized from a second-generation duplicate of the camera master microfilm
• During 2-year grant cycle, awardee institutions must deliver 100,000 digitized pages to LC for upload to Chronicling America
![Page 3: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/3.jpg)
Selecting newspapers for digitization
Content criteria
• Research value • Geographic representation • Temporal coverage • Orphan titles • Diversity • Online availability
Microfilm
• Technical quality • Bibliographic
completeness of the microfilm copy
The NDNP content selection guidelines ensure that relevant titles and suitable microfilm are chosen for digitization
![Page 4: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/4.jpg)
• Technical targets • Master image
– uncompressed TIFF 6.0 – 8-bit grayscale – 300-400 dpi
• Use images – JPEG2000 – PDF that supports full-text
search
Technical specifications: Images
The preservation target that we use from Image Science Associates. See http://www.imagescienceassociates.com/mm5/merchant.mvc?Screen=PROD&Store_Code=ISA001&Product_Code=MPTC&Category_Code=TARGETS for more info.
NDNP Technical Guidelines for 2012 Awards
![Page 5: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/5.jpg)
Technical specifications: Metadata
• Full and up-to-date Cooperative Online Serials (CONSER) bibliographic record at the title level for the print newspaper
• Issue- and page-level metadata • Reel metadata
• All metadata is delivered in METS object structure according to an XML template
NDNP Technical Guidelines for 2012 Awards
![Page 6: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/6.jpg)
Technical specifications: OCR
Optical character recognition (OCR) is captured for every page that is digitized • ALTO XML schema captures the content and position of printed
text • Allows for full-text search and highlighting of search terms
NDNP Technical Guidelines for 2012 Awards
![Page 7: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/7.jpg)
Summary of digitized content
Page • Image files: TIFF, JPEG2000, PDF • XML file that contains OCR
Issue • XML file that contains issue- and page-level metadata
Reel • Image files for preservation targets and microfilm targets • XML file that contains reel technical metadata and metadata for targets
Batch • A batch manifest lists all reels and issues included in the batch
NDNP Technical Guidelines for 2012 Awards
![Page 8: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/8.jpg)
Historic Maryland Newspapers Project
• UMD Libraries joined the NDNP during the 2012-2014 award period
• To date:
– 35,916 pages of Maryland newspapers are live on Chronicling America
– 46,763 pages at LC awaiting ingest
![Page 9: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/9.jpg)
Titles selected for digitization, 2012-2014
Title Publication location Years to digitize
American Republican and Baltimore daily clipper Baltimore, Md. 1844-1846
Baltimore commercial journal, and Lyford's price-current Baltimore, Md. 1840-1849
Baltimore daily commercial Baltimore, Md. 1865-1867
Civilian & telegraph Cumberland, Md. 1859-1875
The daily exchange Baltimore, Md. 1858-1861
Der deutsche Correspondent Baltimore, Md. 1858-1918
The pilot and transcript Baltimore, Md. 1840-1841
Maryland free press Hagerstown, Md. 1862-1868
![Page 10: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/10.jpg)
Titles selected for digitization, 2014-2016
Title Publication location Years to digitize
The aegis & intelligencer Bel Air 1864-1922
The Baltimore daily news Baltimore 1885-1892(?)
Calvert gazette Prince Frederick 1885-1922
Calvert journal Prince Frederick 1867-1922
Catoctin clarion Mechanicsville 1871-1922
The Cecil Democrat Elkton 1850-1922
The citizen Frederick 1895-1922
The Cumberland daily news Cumberland 1871-1890
The daily banner Cambridge 1902-1922
Democratic messenger Snow Hill 1869-1922
Frederick herald Frederick 1832-1861
Frostburg mining journal Frostburg 1871-1913
Havre de Grace Republican Havre de Grace 1881-1922
The leader Laurel 1897-1922
Montgomery County sentinel Rockville 1856-1922
The Republican citizen Frederick 1836-1890
St. Mary's beacon Leonard Town 1845-1863
St. Mary's gazette Leonard Town 1863-1867
Saint Mary's beacon Leonard Town 1867-1922
![Page 11: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/11.jpg)
Why the NDNP?
• Standard for newspaper digitization • Chronicling America is a free, national database, and
it openly shares its data • LC preserves the master TIFF files and microfilm for
perpetuity • Awardees can use the digitized images and metadata
for their own projects/repositories
![Page 12: Historic Maryland Newspapers Project · 2015-09-01 · Issue • XML file that contains issue- and page-level metadata Reel • Image files for preservation targets and microfilm](https://reader033.fdocuments.us/reader033/viewer/2022043009/5f9b6ed632e3c85c986b767d/html5/thumbnails/12.jpg)
Resources
• National Endowment for the Humanities – National Digital Newspaper Program,
http://www.neh.gov/grants/preservation/national-digital-newspaper-program
• Library of Congress – Chronicling America, http://chroniclingamerica.loc.gov/ – National Digital Newspaper Program, http://www.loc.gov/ndnp/ – Content Selection Criteria, http://www.loc.gov/ndnp/guidelines/selection.html – Technical Guidelines for 2012 Awards,
http://www.loc.gov/ndnp/guidelines/archive/guidelines1213.html
• Historic Maryland Newspapers Project at UMD Libraries – Project website, http://digital.lib.umd.edu/newspapers – Blogs
• DigiStew, Division of Digital Systems and Stewardship, http://dssumd.wordpress.com/ • Special Collections, http://hornbakelibrary.wordpress.com/