Search Constraints
Search Results
-
Conference paper (published)
MapReader: a computer vision pipeline for the semantic exploration of maps at scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections. MapReader allows users with little computer vision expertise to i) retrieve maps via web-servers; ii) preprocess and divide them into patches; iii) annotate patches; iv) train, fine-tune, and evaluate deep neural network models; and...Hosseini, Kasra ; Wilson, Daniel C. S. ; Beelen, Kaspar ; McDonough, Katherine
maps and ordnance survey
-
Conference paper (published)
“Webcomics Archive? Now I'm Interested”: Comics Readers Seeking Information in Web Archives
There is a longstanding tradition of understanding information needs and interaction behavior across different user groups to inform the design of digital products and services. There is a gap in such research of comics readers, specifically how they seek and interact with the information and interfaces of web-based archives provided...Berube, Linda ; Makri, Stephann ; Cooke, Ian ; Priego, Ernesto ; Wisdom, Stella
-
Conference paper (published)
Towards a Collections Model for Preservation Planning at the British Library
The development of a framework for preservation planning at the British Library has highlighted the need for a more-structured understanding of its digital collections, in particular with regard to identifying the specific sets of objects that would be the focus of preservation plans. Work has recently commenced on developing a...Day, Michael ; Pennock, Maureen
-
Conference paper (published)
Design Patterns in Digital Preservation: Understanding Information Flows
This paper proposes a framework to help understand the different ways digital preservation goals can achieved, and the contextual factors these choices depend on. This is done through a worked example: three different design patterns representing the three possible modes of archival information flow, each illustrated with realistic examples and...Jackson, Andrew N
OAIS, design patterns, community, risk management, and innovation
-
Conference paper (published)
Exploring Software, Tools and Methods used in Web Archive Research
This paper is one part of a larger research project, titled, Web Archives - Researcher Skills and Tools (WARST). In this poster we focus on the data from the WARST study which examines the software, tools and methods used in the web archive research lifecycle.Schmid, Katharina ; Healy, Sharon ; Byrne, Helena
web archiving, web archive research, web archive users, and web archive creators
-
Conference paper (published)
When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation
As languages evolve historically, making computational approaches sensitive to time can improve performance on specific tasks. In this work, we assess whether applying historical language models and time-aware methods help with determining the correct sense of polysemous words. We outline the task of time-sensitive Targeted Sense Disambiguation (TSD), which aims...Beelen, Kaspar ; Nanni, Federico ; Coll Ardanuy, Mariona ; Hosseini, Kasra ; Tolfo, Giorgia …
-
Conference paper (published)
Living Machines: A study of atypical animacy
This paper proposes a new approach to animacy detection, the task of determining whether an entity is represented as animate in a text. In particular, this work is focused on atypical animacy and examines the scenario in which typically inanimate objects, specifically machines, are given animate attributes. To address it,...Coll Ardanuy, Mariona ; Nanni, Federico ; Beelen, Kaspar ; Hosseini, Kasra ; Ahnert, Ruth …
nineteenth-century English, living machines, BERT, and animacy
-
Conference paper (published)
Towards a Foundation for Collaborative Digital Archiving with Local Concert-Giving Organisations
The centenaries of former chapters of the British Music Society (BMS), established in 1918, have prompted their governing bodies to take stock of their histories and build on the cataloguing, documentation and preservation of their archival collections. The InterMusE project aims to support this shared instinct to archive by capturing... -
Conference paper (published)
Developing an Open-Source Corpus of Yoruba Speech
This paper introduces an open-source speech dataset for Yoruba — one of the largest low-resource West African languages spoken by at least 22 million people. Yoruba is one of the official languages of Nigeria, Benin and Togo, and is spoken in other neighboring African countries and beyond. The corpus consists...Gutkin, Alexander ; Demirşahin, Işın ; Kjartansson, Oddur ; Rivera, Clara ; Túbọ̀sún, Kọ́lá
-
Conference paper (published)
ICIDS2020 Panel: Building the Discipline of Interactive Digital Narratives
Building our discipline has been an ongoing discussion since the early days of ICIDS. From earlier international joint efforts to integrate research from multiple fields of study to today’s endeavours by researchers to provide scholarly works of reference, the discussion on how to continue building Interactive Digital Narratives as a...Bernstein, Mark ; Palosaari Eladhari, Mirjam ; Koenitz, Hartmut ; Louchart, Sandy ; Nack, Frank …
-
Conference paper (published)
DeezyMatch: A Flexible Deep Learning Approach to Fuzzy String Matching
We present DeezyMatch, a free, open-source software library written in Python for fuzzy string matching and candidate ranking. Its pair classifier supports various deep neural network architectures for training new classifiers and for fine-tuning a pretrained model, which paves the way for transfer learning in fuzzy string matching. This approach...Hosseini, Kasra ; Nanni, Federico ; Coll Ardanuy, Mariona
Natural Language Processing, string matching, toponym matching, machine learning, and digital humanities
-
Conference paper (published)
Archiving Interactive Narratives at the British Library
This paper describes the creation of the Interactive Narratives collection in the UK Web Archive, as part of the UK Legal Deposit Libraries Emerging Formats Project. The aim of the project is to identify, collect and preserve complex digital publications that are in scope for collection under UK Non-Print Legal...Clark, Lynda ; Rossi, Giulia Carla ; Wisdom, Stella
Emerging Formats, digital storytelling, new media collection management, Interactive Narratives collection, digital preservation, and web archiving
-
Conference paper (published)
Preserving eBooks: Past, Present and Future - A Series of National Library Perspectives
This panel will present and discuss different eBook workflows and challenges from four national libraries, considering a range of issues from technical complexities to evolution of the content type and changes in the publishing/collecting landscape.Owens, Trevor ; Pennock, Maureen ; Smyth, Tom ; Steinke, Tobias
access, ingest, ebooks, digital preservation, formats, and scale
-
Conference paper (published)
Dawn of Digital Repositories Certification under ISO 16363. Exploring the Horizon and beyond
The dawn of Trustworthy Digital Repository Certification under the ISO 16363:2012 standard is on the horizon. Across the digital preservation community, institutions are eager to learn more about the processes of preparing for and undergoing an ISO 16363 audit from an accredited third-party organization. As the first ISO 16363 audits...Giaretta, David ; LaPlant, Lisa ; Shiers, Jamie ; Tieman, Jessica ; Pennock, Maureen …
repository, certification, trustworthy, audit, and standards
-
Conference paper (published)
Resolving places, past and present: toponym resolution in historical British newspapers using multiple resources
Newspapers and their metadata are richly geographical, not only in their distribution but also their content. Attending to these spatial features is a prerequisite in newspaper research. Following other projects to have geoparsed place names in newspapers, we describe our approach to linking historical geospatial information in text to real-world...Coll Ardanuy, Mariona ; McDonough, Katherine ; Krause, Amrey ; Wilson, Daniel C.S. ; Hosseini, Kasra …
-
Conference paper (published)
CRISP: Crowdsourcing Representation Information to Support Preservation
In this paper, we describe a new collaborative approach to the collection of representation information to ensure long term access to digital content. Representation information is essential for successful rendering of digital content in the future. Manual collection and maintenance of RI has so far proven to be highly resource...Pennock, Maureen ; Jackson, Andrew N. ; Wheatley, Paul
-
Conference paper (published)
Cross-disciplinary Collaborations to Enrich Access to Non-Western Language Material in the Cultural Heritage Sector
The British Library is home to millions of items representing every age of written civilisation, including books, manuscripts and newspapers in all written languages. Large digitisation programmes currently underway are opening up access to this rich and unique historical content on an ever increasing scale. However, particularly for historical material...Derrick, Tom ; McGregor, Nora
HTR, page analysis, layout analysis, recognition, Bangla script, Arabic script, OCR, and datasets
-
Conference paper (published)
Multi-spectral Imaging at the British Library
The British Library is the national library of the United Kingdom and holds over 150 million items with an additional three million new items added each year. The 625 km of shelving contains manuscripts, maps, newspapers, magazines, prints, drawings, music scores and patents. The fundamental purpose of the Library is...Duffy, Christina
multi-spectral imaging, text-recovery, iron gall ink, digital, digitization, reagent, data, library, fire-damage, imaging, and erasure
-
Conference paper (published)
Using METS, PREMIS and MODS for Archiving eJournals: Paper - iPRES 2008 - London
As institutions turn towards developing archival digital repositories, many decisions on the use of metadata have to be made. In addition to deciding on the more traditional descriptive and administrative metadata, particular care needs to be given to the choice of structural and preservation metadata, as well as to integrating...Dappert, Angela ; Enders, Marcus
-
Conference paper (published)
Risk Assessment; using a risk based approach to prioritise handheld digital information
The British Library (BL) Digital Library Programme (DLP) has a broad set of objectives to achieve over the next few years, from web-archiving to the ingest of e-journals through to mass digitisation of newspapers and books. These projects are decided by the DLP programme board and are managed by the...McLeod, Rory
- « Previous
- Next »
- 1
- 2
- 3