Search Constraints
Search Results
-
Conference paper (published)
Prospects for a Big Data History of Music
This position paper sets out the possibility of a musicology based on the analysis of musical-bibliographical metadata as Big Data. It outlines the work underway, as part of the AHRC-funded project A Big Data History of Music, to align seven major datasets of musical-bibliographical metadata. After discussing some of the...Rose, Stephen ; Tuppen, Sandra
-
Conference paper (published)
A Latinised Arabic for All? Issues of representation, purpose and audience
This paper reviews two major issues which account for much of the variation in representing Arabic using Latin characters. Since the Latinisation of Arabic entails encoding additional phonetic information(by adding short vowels), how we choose to represent Arabic for Latinisation becomes a central issue. This representation may either reflect the...Aboelezz, Mariam
-
Conference paper (published)
Arabic dialect identification in the context of bivalency and code-switching
In this paper we use a novel approach towards Arabic dialect identification using language bivalency and written code-switching. Bivalency between languages or dialects is where a word or element is treated by language users as having a fundamentally similar semantic content in more than one language or dialect. Arabic dialect...El-Haj, Mahmoud ; Rayson, Paul ; Aboelezz, Mariam
Arabic, machine learning, dialects, language identification, NLP, and bivalency
-
Conference paper (published)
Sustainability assessments at the British Library: Formats, frameworks and findings
File format assessments have been the subject of much debate in and outside of the preservation community in the past decade. Recognizing the unique structural, operational, and collecting context of the British Library, the Library’s digital preservation team recently initiated new format assessment work to deliver recommendations on which file...Pennock, Maureen ; Wheatley, Paul ; May, Peter
file formats, British Library, assessments, transparency, sustainability, and preservation master
-
Conference paper (published)
Identifying digital preservation requirements: Digital Preservation Strategy and collection profiling at the British Library
The British Library is increasingly a digital library. Over past decades, it has built up significant collections of digital content covering a very wide range of content types. In addition to the increasing amounts of digital content acquired by purchase or donation, the Library and its partners have also invested...Day, Michael ; McDonald, Ann ; Kimura, Akiko ; Pennock, Maureen
preservation planning, institutional contexts of preservation, collection content profiling, and digital preservation
-
Conference paper (published)
The Flashback Project: Rescuing disk-based content from the 1980s to the present day
This paper introduces the British Library's Flashback project, a proof-of-concept that explored the practical challenges of preserving digital content stored on physical media (magnetic and optical disks) using a sample of content from hybrid collection items dating from between 1980 and 2010. It describes some of the activities undertaken by...Pennock, Maureen ; May, Peter ; Day, Michael ; Davies, Kevin ; Whibley, Simon …
-
Conference paper (published)
Adventures with ePub3: when rendering goes wrong
The role of standards in digital preservation is widely acknowledged. The current version of the ePub standard, used for publishing and disseminating eBooks, is ePub3, specifically 3.1 (January 2017). A marked difference from ePub2 is support for fixed layout files and, whilst several different ePub readers are available, not all...Pennock, Maureen ; Day, Michael
-
Conference paper (published)
Preservation planning for emerging formats at the British Library
The British Library and the other UK Legal Deposit Libraries have been collecting various forms of born-digital digital publications since 2013 as part of what is known as Non-Print Legal Deposit (NPLD). In 2017, the UK Legal Deposit Libraries established an Emerging Formats project to look at selected types of...Day, Michael ; Pennock, Maureen ; Smith, Caylin ; Jenkins, Jeremy ; Cooke, Ian
-
Conference paper (published)
Not just a British library: enabling a global discovery experience
Within the walls of the British Library lies one of the greatest collections in the world. However, the value of the British Library lies not only in the preservation of heritage items, but also in its determination to keep pace with the many changes in the global information environment. As...Flanagan, Dimity
open access; repositories; discovery; persistent identifiers; text and data mining; digitisation
-
Conference paper (published)
Considerations on the acquisition and preservation of ebook mobile apps
In 2018 and 2019, as part of the UK Legal Deposit Libraries’ sponsored ‘Emerging Formats’ project, the British Library’s digital preservation team undertook a program of research into the preservation of new forms of content. One of these content types was eBooks published as Mobile Apps. Research considered a relatively...Pennock, Maureen ; May, Peter ; Day, Michael
access, mobile apps, acquisition, digital preservation, and preservation
-
Conference paper (published)
Practical analysis of TIFF file size reductions achievable through compression
This paper presents results of a practical analysis into the effects of three main lossless TIFF compression algorithms – LZW, ZIP and Group 4 – on the storage requirements for a small set of digitized materials. In particular we are interested in understanding which algorithm achieves a greater reduction in...May, Peter ; Davies, Kevin
LZW, Group 4, LibTiff, TIFF, ZIP, compression, and ImageMagick
-
Conference paper (published)
Deal with conflict, capture the relationship: the case of digital object properties
Properties of digital objects play a central role in digital preservation. All key preservation services are linked via a common understanding of the properties which describe the digital objects in a repository's care. Unfortunately, different services deal with properties on sometimes different levels of description. While, for example, a preservation...Dappert, Angela
-
Conference paper (published)
A METS based information package for long term accessibility of web archives
The British Library’s web archive comprises several terabyte of harvested websites. Like other content streams this data should be ingested into the library’s central preservation repository. The repository requires a standardized Submission- and Archival Information Package. Harvested Websites are stored in Archival Information Packages (AIP). Each AIP is described by...Enders, Markus
-
Conference paper (published)
LIFE3: A predictive costing tool for digital collections
Predicting the costs of long-term digital preservation is a crucial yet complex task for even the largest repositories and institutions. For smaller projects and individual researchers faced with preservation requirements, the problem is even more overwhelming, as they lack the accumulated experience of the former. Yet being able to estimate...Hole, Brian ; Lin, Li ; McCann, Patrick ; Wheatley, Paul
-
Conference paper (published)
Capturing and replaying streaming media in a web archive – a British Library case study
A prerequisite for digital preservation is to be able to capture and retain the content which is considered worth preserving. This has been a significant challenge or web archiving, especially for websites with embedded streaming media content, which cannot be copied via a simple HTTP request to a URL. This...Hockx-Yu, Helen ; Crawford, Lewis ; Coram, Roger ; Johnson, Stephen
-
Conference paper (published)
Developing a robust migration workflow for preserving and curating hand-held media
Many memory institutions hold large collections of hand-held media, which can comprise hundreds of terabytes of data spread over many thousands of data-carriers. Many of these carriers are at risk of significant physical degradation over time, depending on their composition. Unfortunately, handling them manually is enormously time consuming and so...Dappert, Angela ; Jackson, Andrew ; Kimura, Akiko
disk-copying robot, iPRES, data-carrier stabilization, auto loader, and digital preservation
-
Conference paper (published)
An analysis of contemporary JPEG2000 codecs for image format migration
This paper presents results of an analysis of different implementations of the JPEG2000 standard, specifically part 1: JP2, an image format that is currently popular within the digital preservation community. In particular we are interested in the effect different JPEG2000 codecs (encoders and decoders) have on image quality in response...Palmer, William ; May, Peter ; Cliff, Peter
TIFF, image quality, generational loss, JPEG2000, migration, codec, and PSNR
-
Conference paper (published)
Quality assured image file format migration in large digital object repositories
This article gives an overview on how different components developed by the SCAPE project are intended to be used in composite file format migration workflows; it will explain how the SCAPE platform can be employed to make sure that the workflows can be used to migrate very large image collections...Schlarb, Sven ; Cliff, Peter ; May, Peter ; Palmer, William ; Hahn, Matthias …
-
Conference paper (published)
The Integrated Preservation Suite: Scaled and automated preservation planning for highly diverse digital collections (long paper)
The Integrated Preservation Suite is an internally funded project at the British Library to develop and enhance the Library's preservation planning capabilities, largely focussed on automation and addressing the Library's heterogeneous collections. Through agile development practices, the project is iteratively designing and implementing the technical infrastructure for the suite as...May, Peter ; Pennock, Maureen ; Russo, David
software preservation, knowledge base, preservation watch, and preservation planning
-
Conference paper (published)
Costing the Digital Preservation Lifecycle More Effectively
Having confidence in the permanence of a digital resource requires a deep understanding of the preservation activities that will need to be performed throughout its lifetime and an ability to plan and resource for those activities. The LIFE (Lifecycle Information For E-Literature) and LIFE2 Projects have advanced understanding of the...Wheatley, Paul