A METS based information package for long term accessibility of web archives - British Library Research Repository
Shared Research Repository
Conference paper (published)

A METS based information package for long term accessibility of web archives

2010

Abstract

The British Library’s web archive comprises several terabyte of harvested websites. Like other content streams this data should be ingested into the library’s central preservation repository. The repository requires a standardized Submission- and Archival Information Package. Harvested Websites are stored in Archival Information Packages (AIP). Each AIP is described by a METS file. Operational metadata for resource discovery as well as archival metadata are normalized and embedded in the METS descriptor using common metadata profiles such as PREMIS and MODS. The British Library’s METS profile for web archiving considers dissemination and preservation use cases ensuring the authenticity of data. The underlying complex content model disaggregates websites into web pages, associated objects and their actual digital manifestations. The additional abstract layer ensures accessibility over the long term and the ability to carry out preservation actions such as migrations. The library wide preservation policies and principles become applicable to web content as well.

Files

There is 1 file associated with this work, which is available for download.

Metadata

  • Resource type

    Conference paper (published)

  • Institution
    • British Library

  • Organisational unit
    • Digital Preservation

  • Event title
    • iPRES 2010 - 7th International Conference on Preservation of Digital Objects

  • Event location
    • Vienna, Austria

  • Book title
    • iPRES 2010 - Proceedings of the 7th International Conference on Preservation of Digital Objects

  • Editor
    • Rauber, Andreas
    • Kaiser, Max
    • Guenther, Rebecca
    • Constantopoulos, Panos
  • Pagination
    • 9

  • Publisher
    • Österreichische Computer Gesellschaf

  • Place of publication
    • Vienna, Austria

  • Official URL
  • Licence
  • Rights statement
  • Rights holder
    • Österreichische Computer Gesellschaft 2010