Search Constraints
Search Results
-
Conference paper (unpublished)
Defoe: A Spark-Based Toolbox for Analysing Digital Historical Textual Data
This work presents defoe, a new scalable and portable digital eScience toolbox that enables historical research. It allows for running text mining queries across large datasets, such as historical newspapers and books in parallel via Apache Spark. It handles queries against collections that comprise several XML schemas and physical representations.... -
Conference paper (unpublished)
ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019
This paper presents an objective comparative evaluation of page analysis and recognition methods for historical documents with text mainly in Bengali language and script. It describes the competition rules, dataset, and evaluation methodology. Results are presented for five methods - three submit-ted, one re-run, and one open source state-of-the-art system....Clausner, Christian ; Antonacopoulos, Apostolos ; Derrick, Tom ; Pletschacher, Stefan
-
Conference paper (published)
Multi-spectral Imaging at the British Library
The British Library is the national library of the United Kingdom and holds over 150 million items with an additional three million new items added each year. The 625 km of shelving contains manuscripts, maps, newspapers, magazines, prints, drawings, music scores and patents. The fundamental purpose of the Library is...Duffy, Christina
multi-spectral imaging, text-recovery, iron gall ink, digital, digitization, reagent, data, library, fire-damage, imaging, and erasure
-
Conference paper (published)
Implementing Digital Preservation Strategy: Developing content collection profiles at the British Library
The British Library is increasingly a digital library. Through both digitization and acquisition, it has built up significant collections of digital content covering a very wide range of content types. Most recently, the extension of legal deposit provisions to non-print works in 2013 has meant that it - working in...Day, Michael ; McDonald, Ann ; Pennock, Maureen ; Kimura, Akiko
content management, digital libraries, content collection profiles, Internet, and digital preservation