Search Constraints
Search Results
-
Conference paper (unpublished)
Developing Identifiers for Heritage Collections
'Developing Identifiers for Heritage Collections ' is a tool aimed at heritage professionals to support decision making for persistent identifier (PID) use. The project’s 2020 survey found the value of PIDs, for example in creating trustworthy links, was understood but needs to be more clearly addressed directly to decision makers...Madden, Frances
persistent identifiers, Towards a National Collection, and PIDs
-
Journal article
Six Poll-Tax Receipts from Arsinoe
Micucci, Federica
-
Book chapter
“Existentialist Hu-ha”?: Censoring the Existentialists in the British Theater
This chapter will look at the censorship of playwrights associated with existentialist thinking in the British theater, from the opening up of the London stage to French writers after the Second World War to the end of theater censorship in Britain with the passing of the Theatres Act 1968. Consideration...Andrews, Jamie
-
Journal article
Covid-19 and the Future of the Digital Shift amongst Research Libraries: An RLUK Perspective in Context
Research Libraries UK is a consortium of 37 of the UK and Ireland’s largest research libraries with the purpose of convening its members around the key issues that affect them, to represent their collective voice, to support them as they face shared challenges, and to be an effective advocate on...Baxter, Guy ; Beard, Lorraine ; Beattie, Gavin ; Blake, Michelle ; Greenhall, Matthew …
library services, library space, academic libraries, Covid-19 pandemic, and digital shift
-
Abstract
PIDs for Repositories: DOIs and URNs and Handles, oh my….
This presentation, given at a workshop on PID requirements in the UKRI Open Access Policy, gives an overview of implementation options to meet technical requirement 5a: "PIDs for research outputs must be implemented according to international recognised standards, examples of international standards include DOI, URN or Handle".Cope, Jez
PIDs, Open Access, and UKRI
-
Report
Archiving Social Media Workshop Two (June 2019)
The workshop was organised by the British Library Web Archiving Team with support from Maja Maricevic, Head of Higher Education at the British Library on Monday 10th June 2019. The participants at the workshop were a mix of academic researchers, postgraduate research students and information professionals from the British Library,...UK Web Archive
-
Report
Archiving Social Media Workshop One (July 2018)
The workshop was organised by the British Library Web Archiving Team with support from Maja Maricevic, Head of Higher Education at the British Library on Monday 16th July, 2019. The participants at the workshop were a mix of academic researchers and information professionals from the British Library, The National Archives...UK Web Archive
-
Presentation
Publisher Submission Portal User Survey - Autumn 2021
A presentation given to the Collection Development and Acquisitions sub-group (CDAS) of the Legal Deposit Implementation Group in December 2021. CDAS brings together representatives from the six UK legal deposit libraries, to report on operational matters and plan policy for co-operation on managing the legal deposit collections. The Publisher Submission...Hazell, Lottie
-
Research report
Publisher Submission Portal User Survey Report – December 2021
The Publisher Submission Portal is a tool that can be used to deposit born digital publications to the British Library, and other legal deposit libraries, to fufil obligations under legal deposit law. The Portal is designed for use by small publishers, defined as producing 50 or fewer publications per year....Hazell, Lottie
-
Research report
Quality Assurance in the New Media Writing Prize Collection
The UK Web Archive is an online repository for UK based websites. Managed by the six UK Legal Deposit Libraries—the British Library, the National Library of Scotland, the National Library of Wales, the Bodleian Libraries at Oxford University, Cambridge University Libraries, and the Trinity College, Dublin—the UK Web Archive's chief...Pyke, Tegan
Emerging Formats, digital preservation, British Library PhD placement, web archiving, digital storytelling, and New Media Writing Prize
-
Dataset
Persistent Identifiers as IRO Infrastructure: Survey 2 Data
The survey ran from 4 October to 8 November 2021 and was open to everyone working in Galleries, Libraries, Archives and Museums internationally but the survey had a clear UK focus. Some responses have been removed or recoded to protect the identity of respondents.Kotarski, Rachael ; Madden, Frances
-
Journal article
MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital). This library transforms the way historians can use maps by turning extensive, homogeneous map sets into searchable primary sources. MapReader allows users with little or no computer vision expertise to i)...Hosseini, Kasra ; Wilson, Daniel C.S. ; Beelen, Kaspar ; McDonough, Katherine
-
Research report
Data Study Group Final Report: The National Archives, UK: Discovering Topics and Trends in the UK Government Web Archive
The challenge we address in this report is to make steps towards improving search and discovery of resources within this vast archive for future archive users, and how the UKGWA collection could begin to be unlocked for research and experimentation by approaching it as data (i.e. as a dataset at...Beavan, David ; Nanni, Federico
-
Journal article
Design Choices for Productive, Secure, Data-Intensive Research at Scale in the Cloud
We present a policy and process framework for secure environments for productive data science research projects at scale, by combining prevailing data security threat and risk profiles into five sensitivity tiers, and, at each tier, specifying recommended policies for data classification, data ingress, software ingress, data egress, user access, user...Arenas, Diego ; Atkins, Jon ; Austin, Claire ; Beavan, David ; Cabrejas Egea, Alvaro …
-
Journal article
Neural Language Models for Nineteenth-Century English
We present four types of neural language models trained on a large historical dataset of books in English, published between 1760-1900 and comprised of ~5.1 billion tokens. The language model architectures include static (word2vec and fastText) and contextualized models (BERT and Flair). For each architecture, we trained a model instance...Hosseini, Kasra ; Beelen, Kaspar ; Colavizza, Giovanni ; Coll Ardanuy, Mariona
-
Conference paper (published)
When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation
As languages evolve historically, making computational approaches sensitive to time can improve performance on specific tasks. In this work, we assess whether applying historical language models and time-aware methods help with determining the correct sense of polysemous words. We outline the task of time-sensitive Targeted Sense Disambiguation (TSD), which aims...Beelen, Kaspar ; Nanni, Federico ; Coll Ardanuy, Mariona ; Hosseini, Kasra ; Tolfo, Giorgia …
-
Journal article
Maps of a Nation? The Digitized Ordnance Survey for New Historical Research
Although the Ordnance Survey has itself been the subject of historical research, scholars have not systematically used its maps as primary sources of information. This is partly for disciplinary reasons and partly for the technical reason that high-quality maps have not until recently been available digitally, geo-referenced, and in color....Hosseini, Kasra ; McDonough, Katherine ; van Strien, Daniel ; Vane, Olivia ; Wilson, Daniel C.S.
-
Conference paper (published)
Living Machines: A study of atypical animacy
This paper proposes a new approach to animacy detection, the task of determining whether an entity is represented as animate in a text. In particular, this work is focused on atypical animacy and examines the scenario in which typically inanimate objects, specifically machines, are given animate attributes. To address it,...Coll Ardanuy, Mariona ; Nanni, Federico ; Beelen, Kaspar ; Hosseini, Kasra ; Ahnert, Ruth …
nineteenth-century English, living machines, BERT, and animacy
-
Conference paper (unpublished)
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym... -
Book chapter
How we got here
Wilson, Daniel C.S.
-
Book chapter
Text Meets Space: Geographic Content Extraction, Resolution and Information Retrieval
In this half-day tutorial, we will review the basic concepts of, methods for, and applications of geographic information retrieval, also showing some possible applications in fields such as the digital humanities. The tutorial is organized in four parts. First we introduce some basic ideas about geography, and demonstrate why text...Leidner, Jochen ; Martins, Bruno ; McDonough, Katherine ; Purves, Ross
-
Research report
Data Study Group Final Report: Smart monitoring for conservation areas
WWF (World Wide Fund for Nature) monitors over 250,000 protected areas (e.g. national parks and nature reserves) and thousands of other sites and critical habitats. These sites are the foundation of global natural assets and are central to the preservation of biodiversity and human well-being. Unfortunately, they face increasing pressures... -
Abstract
Using smart annotations to map the geography of newspapers
Geographic information is a key component in the description of collection objects, and yet its format is often unsuited for use with methods of geographic analysis. Catalogue entries are often inconsistent, in plain text, and without geographic coordinates (much less coordinates linked to authority records). Georesolution of the relevant fields...Ryan, Yann ; Coll Ardanuy, Mariona ; van Strien, Daniel ; Hosseini, Kasra ; Beelen, Kaspar …
-
Conference paper (unpublished)
Contextualizing Victorian Newspapers
Beelen, Kaspar ; Ahnert, Ruth ; Beavan, David ; Coll Ardanuy, Mariona ; Hosseini, Kasra …
-
-
Conference paper (unpublished)
Defoe: A Spark-Based Toolbox for Analysing Digital Historical Textual Data
This work presents defoe, a new scalable and portable digital eScience toolbox that enables historical research. It allows for running text mining queries across large datasets, such as historical newspapers and books in parallel via Apache Spark. It handles queries against collections that comprise several XML schemas and physical representations.... -
Journal article
Babbage among the insurers: Big 19th-century data and the public interest
This article examines life assurance and the politics of ‘big data’ in mid-19th-century Britain. The datasets generated by life assurance companies were vast archives of information about human longevity. Actuaries distilled these archives into mortality tables – immensely valuable tools for predicting mortality and so pricing risk. The status of...Wilson, Daniel C.S.
big data, Thomas Rowe Edmonds, Charles Babbage, public interest, and insurance
-
Policy report
British Library Persistent Identifier Policy
The purpose of this policy is to outline the approach to the use of identifiers and persistent identifiers (PIDs) in internal and external facing British Library systems, to define a consistent set of requirements for PIDs to use when managing and developing services and to articulate the responsibilities in applying...British Library
-
Dataset
Digitised Books. c. 1510 - c. 1900. JSONL (OCR derived text + metadata)
The dataset comprises metadata and OCR generated text from 49,455 digitised books published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in JSON Lines (JSONL) text format.British Library Labs ; British Library
OCR and monographs
-
Book
A catalogue of manuscript and printed reports, field books, memoirs, maps, etc., of the Indian Surveys, deposited in the map room of the India office
A catalogue of manuscript and printed reports, field books, memoirs, maps, etc., of the Indian Surveys, deposited in the map room of the India office is the published listing of the holdings of the map collection of the India Office, the working archive used for the administration of India from...Markham, Clements
-
Dataset
Mapping Irish Football
The Mapping Irish Football project called on the crowd to share any newspaper references they may have come across of women and any code of football prior to and including 1973. It is hoped that this project will start a conversation amongst researchers interested in Irish sports to do more...Byrne, Helena ; Bolton, Steve ; Carrier, John ; Farrell, Gerald ; Faller, Helge …
women's football, newspaper data, crowdsourcing, and Irish football history
-
Dataset
Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
We present a new dataset (version 2) for the task of toponym resolution in digitised historical newspapers in English. It consists of 455 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated...Coll Ardanuy, Mariona ; Beavan, David ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
nineteenth-century English, dataset, newspapers, toponym resolution, and geographic information retrieval
-
Presentation
Mapping Irish Women’s Football
The Mapping Irish Football project is calling on the crowd to share any newspaper references they may have come across of women and any code of football prior to and including 1973. It is hoped that this project will start a conversation amongst researchers interested in Irish sports to do...Byrne, Helena ; Gibbs, Stuart
-
Research report
Living with Machines Delivery Plan version 1, 2019
Living with Machines is a five-year collaborative project. It aims to generate new perspectives on the effects of the mechanisation of labour on the lives of ordinary people in Britain during the 'long nineteenth century' (c.1780-1918), by developing computational and historical techniques and research questions for working with historical sources....Ahnert, Ruth ; Beavan, David ; Colavizza, Giovanni ; Farquhar, Adam ; Griffin, Emma …
-
Conference paper (published)
Station to Station: Linking and Enriching Historical British Railway Data
The transformative impact of the railway on nineteenth-century British society has been widely recognized, but understanding that process at scale remains challenging because the Victorian rail network was both vast and in a state of constant flux. Michael Quick’s reference work Railway Passenger Stations in Great Britain: a Chronology offers...Coll Ardanuy, Mariona ; Beelen, Kaspar ; Lawrence, Jon ; McDonough, Katherine ; Nanni, Federico …
-
Conference paper (published)
Towards a Foundation for Collaborative Digital Archiving with Local Concert-Giving Organisations
The centenaries of former chapters of the British Music Society (BMS), established in 1918, have prompted their governing bodies to take stock of their histories and build on the cataloguing, documentation and preservation of their archival collections. The InterMusE project aims to support this shared instinct to archive by capturing... -
Journal article
Dunhuang scrolls: Innovative storage solutions at the British Library
The British Library’s Stein collection contains about 14,000 scrolls, fragments and booklets in Chinese from a cave in the Buddhist Mogao Caves complex near Dunhuang in north-west China. This article describes storage and access solutions for the collection in the context of a busy research library and the currently ongoing...Kralka, Paulina ; Muzart, Marya
conservation, storage, paper, Central Asia, Dunhuang, and scroll
-
Conference paper (unpublished)
What does the future hold for "open" and cultural heritage institutions?
GLAMs’ [Galleries, Libraries, Archives and Museums] public interest mission is squarely aligned with the open access ethos. Indeed, making their collections as openly accessible, shareable, and reusable as possible is the best way for GLAMs to achieve their mission as they digitize and offer their collections online. But only a...Vézina, Brigitte
-
Conference paper (unpublished)
The Ethics of Open Access in the Endangered Archives Programme
The Endangered Archives Programme (also known as EAP) gives funding to people running projects to digitise and preserve archival materials at risk of destruction. These can date from any time before the middle of the twentieth century, and from most parts of the world except Europe and North America. The...Schaik, Sam van