Archiving and Preservation

MISSING IMAGE

Material Information

Title:
Archiving and Preservation
Series Title:
dLOC Advanced Topics Training Institute
Physical Description:
Training slides
Language:
English
Creator:
Sullivan, Mark V.
Publisher:
University of Florida Libraries / Digital Library of the Caribbean
Place of Publication:
Gainesville, Florida
Publication Date:

Subjects

Subjects / Keywords:
Digitization
Training
SobekCM
dLOC Training Presentation

Notes

Acquisition:
Resource was uploaded and editing during training for dLOC partners in Gainesville, FL on 7/29/2013. Submitted by Mark Sullivan.
General Note:
Slides for dLOC partner training

Record Information

Source Institution:
University of Florida
Holding Location:
University of Florida
Rights Management:

The author dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.
System ID:
AA00016215:00004


This item is only available as the following downloads:


Full Text

PAGE 1

Mark Sullivan Digital Library of the Caribbean

PAGE 2

2 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 3

Digitization: One Puzzle Piece Digitization is one piece of a digital preservation plan, but only one piece Without the rest of the puzzle, digitization is ultimately useless for preserving and archiving our historical and cultural resources Include Digital Preservation and full lifecycle of the image and data in all digitization plans. dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 3

PAGE 4

Preservation and Long Term Access Through Networked Services Project Survey Surveyed over 200 organizations, mostly European archives and libraries Digital content volume to increase 25 fold over next decade 70% of respondents expect to hold over 100TB of content Only 47% of organizations had an allocated budget for digital preservation ( http://www.planets project.eu/publications ) Large Scale Digitization dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 4

PAGE 5

Digital preservation is: Active Constant process Exponentially multi tiered About ensuring continuing accessibility Relatively new and somewhat in its infancy dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 5

PAGE 6

Compact Disks 2002 2008 23,000 CD/DVD ( 48 TB ) Unmanageable Slowly deteriorating Two (or 3 or 4) copies dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 6

PAGE 7

Table of Contents What to preserve? How do I know it is preserved? Object level Preservation File level Preservation Organizational: Trustworthy Digital Repositories Recommendations Questions dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 7

PAGE 8

How do I know it is preserved? dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 8

PAGE 9

What to preserve? How do I know it is preserved? Text level preservation? Aujourd'hui premier janvier dix huit cent quatre, le Gnral en chef de l'Arme indigne, accompagn des gnraux, chefs de l'arme, convoqus l'effet de prendre les mesures qui doivent tendre au bonheur du pays: Aprs avoir fait connatre aux gnraux assembls ses vritables intentions d'assurer jamais aux indignes d'Hati un gouvernement stable, objet de sa plus vive sollicitude : ce qu'il a fait un discours qui tend faire connatre aux puissances trangres la rsolution de rendre le pays indpendant, et de jouir d'une libert consacre par le sang du peuple de cette le ; et, aprs avoir recueilli les avis, a demand que chacun des gnraux assembls pronont le serment de renoncer jamais la France, de mourir plutt que de vivre sous sa domination, et de combattre jusqu'au dernier soupir pour l'indpendance. dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 9

PAGE 10

What to preserve? How do I know it is preserved? Text level preservation? Today, January 1, 1804, the General in Chief of the Indigenous Army, accompanied by generals and army chiefs convoked in order to take measures tending to the happiness of the country: After having made known to the assembled generals his true intention of forever ensuring to the natives of Haiti a stable government the object of his greatest solicitude, which he did in a speech that made known to foreign powers the resolution to render the country independent, and to enjoy the liberty consecrated by the blood of the people of this island; and, after having gathered their opinions, asked each of the assembled generals to pronounce a vow to forever renounce France; to die rather than to live under its domination; and to fight for independence with their last breath. dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 10

PAGE 11

What to preserve? How do I know it is preserved? dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 11

PAGE 12

What to preserve? How do I know it is preserved? document, data set, or sound or video recording that should be preserved, not the Microsoft Word file, or the Excel spreadsheet, Information Standards Quarterly Spring 2010. Also the context and layout though. dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 12

PAGE 13

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 13

PAGE 14

Object Level Preservation More than just the images and text (but also!) Descriptive Information Describes the intellectual entity ( title, author ) Same information used for discovery and display of object Structural Information Table of contents, page order File Information File size Checksums and digital signatures for fixity and authenticity checks Administrative Information Provenance and access conditions for the object dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 14

PAGE 15

Object Level Metadata METS M etadata E ncoding and T ransmission S tandard Descriptive Metadata Container MODS ( Metadata Object Description Standard ) Dublin Core VRACore ( Visual Resource Association ) MarcXML Structural Metadata Technical File Metadata Administrative Metadata dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 15

PAGE 16

PREMIS PREservation Metadata : Implementation Strategies Metadata needed to support preservation ( Dappert and Enders ) HUGE data dictionary must pick and choose Describes historical preservation actions/agents Provides information needed for any future preservation actions Can be encoded in METS file dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 16

PAGE 17

PREMIS dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 17

PAGE 18

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 19

PAGE 19

Necessary Preservation File Information Example PREMIS Image Record Checksums and Size Preservation Level ( full, bitwise, etc.. ) Behaviors Format ( MIME, Format Registry, Validation Info ) Creating Applications Capture Information (MIX) Environments ( software, operating system, processor, hardware ) All migrations events dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 20

PAGE 20

File Format Migration / Emulation Any sound preservation strategy must deal with the ephemeral nature of file formats Migration v. Emulation Must either constantly migrate forward or at least have a migration strategy Migration strategy is also ephemeral and represents best current options dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 21

PAGE 21

Recommended File Formats Four basic criteria Specification must be freely available Must be no patents or licenses on the format Other digital repositories using or endorsed format Variety of tools for writing and rendering format dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 22

PAGE 22

Recommended File Formats Migration strategy is also ephemeral and represents best current options Archivematica ( UNESCO preservation system ) Presented with a .WMV ( Windows Media Video ) proprietary file Current strategy: Convert to MPEG 2 Future strategy: Convert to Motion JPEG2000 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 23

PAGE 23

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 24

PAGE 24

Alexandria Library dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 25

PAGE 25

Alexandria Library dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 26

PAGE 26

Digital Preservation Community Process Models Open Archival Information System (OAIS) Preserving Digital Information published by Task Force on Archiving Digital Information in 1996. Draft OAIS reference model published in 1999 Primary benchmark and chief process model for the preservation of digital assets InterPARES International Researchon Permanent Authentic Records in Electronic Systems. Focuses on records maintenance Digital Curation Centre: Curation Lifecycle Model Defines three levels of preservation and best practices for each dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 27

PAGE 27

OAIS Reference Model Provides framework for understanding of archival concepts needed for long term digital information preservation and access Provides concepts needed by non archival organizations to be effective participants in the preservation process Provides framework for describing and comparing different strategies and techniques Provides basis for comparing data models of digital information preserved and how they change over time Guides the identification and production of OAIS related standards dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 28

PAGE 28

An OAIS must: Negotiate for and accept appropriate information from producers Obtain enough rights to ensure long term preservation Determine the designated community Ensure preserved objects are independently understandable to designated community Follow documented policies and procedures which ensure the information is preserved against all reasonable contingencies Ensure object can be disseminated as authenticated copies of the original or the original Make preservation information available to designated community dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 29

PAGE 29

Toward Certified Digital Repositories Where are we? Granular standards (PREMIS) Process Models (OAIS, InterPARES ) In 2000, RLG and OCLC published Trusted Digital Repositories: Attributes and Responsibilities This inches us closer, by defining a Trusted Digital Repository and expands the discussion to include the policies, standards, as well as technology infrastructure as a trusted system. dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 30

PAGE 30

Developing Metrics for Certification Nestor Catalogue of Criteria for Trusted Digital Repositories Network of Expertise in Long term Storage of Digital Resources Provides: Information and criteria for self assessment for digital repositories Training Tools Iterative process Documentation, Transparency, Adequacy, and Measurability dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 31

PAGE 31

Developing Metrics for Certification DRAMBORA Digital Repository Audit Model Based on Risk Assessment Digital Curation Center (DCC) and Digital Preservation Europe (DPE) Focuses on risk identification, risk prioritization, and risk mitigation of risks through six step process of self assessment. Developed from a TRAC based self assessment originally dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 32

PAGE 32

Developing Metrics for Certification TRAC Trusted Repositories Audit & Certification Criteria & Checklist Research Libraries Group (RLG) and U.S. National Archives and Record Administration (NARA) 2005 Grant from Andrew W. Mellon Foundation to develop procedures and activities to audit and certify digital archives. 2007 Final publication includes 84 criteria Organizational infrastructure Digital object management Technologies (IT and security) Two repositories ( Portico and HathiTrust ) initially audited Portico certified as a trustworthy digital repository for CRL community dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 33

PAGE 33

Toward Standards Conformity Assessment Requirements for Bodies Providing Audi and Certification of Management Systems ( ISO/IEC 17021 ) is current international standard applied Audit and Certification of Trustworthy Digital Respositories ( CCSDS 652.0 R1, ISO/DIS 16363 ) still under development as an ISO standard Requirements for Bodies Providing Audit and Certification of Candidate Trustworthy Digital Respositories also under development as an ISO standard dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 34

PAGE 34

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 35

PAGE 35

Recommendations Be aware of the ongoing cost of preservation and plan according Be a part of something larger Subject based trusted repositories LLMC ( Law Library Microform Consortium ) Regional and International trusted repositories HathiTrust Florida Dark Archive / State Universities dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 36

PAGE 36

Review of Topics What to preserve? How do I know it is preserved? Object level Preservation File level Preservation Organizational : Trustworthy Digital Repositories Recommendations 37 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 37

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 38