Search Constraints
Search Results
-
-
Dataset
Datasets for toponym recognition and disambiguation for nineteenth-century English newspapers
We present two datasets, one for the task of toponym recognition and one for the task of toponym disambiguation. The datasets are derived from the "Dataset for Toponym Resolution in Nineteenth-Century English Newspapers" (DOI: https://doi.org/10.23636/r7d4-kw08). The toponym recognition dataset consists of two JSON files (ner_fine_train.json and ner_fine_dev.json), whereas the toponym...Coll Ardanuy, Mariona ; Nanni, Federico
toponym disambiguation, nineteenth-century newspapers, named entity recognition, entity linking, toponym resolution, toponym recognition, and dataset
-
Dataset
DeezyMatch training set for OCR
Optical character recognition (OCR) is the process of automatically transcribing text from images. The presence of OCR-induced errors in digitised text is a common problem in the digital humanities. OCR errors are usually due to the misrecognition of characters, such as "h" recognised as "b", or "c" recognised as "o".... -
Conference paper (unpublished)
(Re)investing in a national repository infrastructure for cultural heritage
Since 2018, the British Library (BL) has invested considerable resource in establishing the necessary infrastructure for a national repository service for cultural heritage organisations, using Samvera Hyku. This has entailed working closely with all known Hyku suppliers and developers, as well as collaborating with the University of Virginia on an...Basford, Jenny ; Holt, Ilkay ; Jevon, Graham ; Ramsey, Nora
open access, OR2023, and repository
-
Dataset
Diachronic word embeddings from 19th-century newspapers digitised by the British Library (1800-1919)
Word vectors related to the paper "Machines in the media: semantic change in the lexicon of mechanization in 19th-century British newspapers" by Nilo Pedrazzini and Barbara McGillivray (2022). The embeddings were trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, word-vectors, late-modern-english, newspapers, diachronic-embeddings, and word2vec
-
Dataset
Decade-level Word2Vec models from automatically transcribed 19th-century newspapers digitised by the British Library (1800-1919)
Word embeddings trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into periods of ten years each. Unlike those in this repository, these were not aligned and OCR errors skimmed from the vocabulary. See related GitHub repository for the full documentation:...Pedrazzini, Nilo
historical semantics, British newspapers, word embeddings, word vectors, word2vec, and Late Modern English
-
Book chapter
The sociability of scientific knowledge exchange in British Farming, 1950-90
This is a single chapter from an edited collection that has the following abstract: In the late nineteenth and early twentieth centuries, agricultural practices and rural livelihoods were challenged by changes such as commercialization, intensified global trade, and rapid urbanization. Planting Seeds of Knowledge studies the relationship between these agricultural...Horrocks, Sally ; Martin, John ; Merchant, Paul
agrciculture, food, and farming
-
Dataset
Incunabula Printed Catalogue Dataset: Volumes 1-10 copy of github repository
This dataset includes the github repository used to derive catalogue entries from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library....British Library
book history, metadata, catalogues, datasets, incunabula, early printed books, and early printing
-
Dataset
Incunabula Printed Catalogue Dataset: Volumes 1-10
This dataset includes the catalogue entries derived from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library. The dataset was created...British Library
datasets, catalogues, early printing, book history, early printed books, metadata, and incunabula
-
Dataset
Incunabula Printed Catalogue Dataset Metadata: Volumes 1-10
This dataset includes the combined catalogue entries derived from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library. The dataset was...British Library
datasets, catalogues, early printing, incunabula, early printed books, metadata, and book history
-
Poster (published)
Datafication and reuse of the descriptions of the incunabula collections at the British Library
The poster discusses the AHRC-RLUK Professional Practice Fellowship project which investigates the legacies of curatorial voice in the descriptions of incunabula collections at the British Library and their future reuse.Atanassova, Rossitza
historical catalogues, computational analysis, incunabula, catalogue data, and practitioner research
-
Book chapter
Blue Heritage Among Fishermen of Mafia Island, Tanzania
Off the South-East coast of Tanzania, at the mouth of the Rufiji River, lies the Mafia archipelago. This chapter explores the concept of blue heritage by showing how the fishermen of Mafia have altered their language, perception of time and sense of community to include the sea and its animals....de Haan, Mariam
ethnography, Rufiji River, Tanzania, whale sharks, fishing communities, fishing, Mafia archipelago, and fish
-
Journal article
Challenging legacies at the British Library
The British Library established a corporate Anti-Racism Project (2020) designed to encourage participation via six subgroups, with staff recommendations incorporated into “Enacting Change”, the Library's Race Equality Action Plan (2022). The research and recommendations of the Cataloguing and Metadata subgroup fed into a pilot project proposed as a proof of...Danskin, Alan
Caribbean, anti-racism, South Asia, Cataloguing, and Metadata
-
Journal article
FAST the Inside Track: Where We Are, Where Do We Want to Be, and How Do We Get There?
This is an overview of the development of FAST (Faceted Application of Subject Terminology) from its inception in the late 1990s, through its development and implementation to the work being undertaken by OCLC and the FAST Policy and Outreach Committee (FPOC) to develop and promote FAST. FPOC members explain how... -
Dataset
Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England
Diachronic word embeddings (decade-level) trained with Word2Vec (via Gensim) on different geographic subcorpora of the Heritage Made Digital British and the Living with Machines historical newspaper collections: - North England (north.zip) - South England (south.zip) At the moment, for each subcorpus, Word2Vec models are available for each decade in the...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, diachronic embeddings, late modern English, word embeddings, word vectors, word2vec, and diatopic embeddings
-
Journal article
Writing Milan and Turin in the Light of (Failed) Utopia: Luciano Bianciardi and Paolo Volponi
This article examines a series of novels by Italian writers, Luciano Bianciardi and Paolo Volponi, that capture the transformations brought about by the post-World War II economic growth in the urban-industrial society of Northern Italy. The analysis draws on utopia as, in Ruth Levitas’s words, a ‘desire for a better...Brecciaroli, Giulia
Paolo Volponi, Utopia/dystopia, Turin, Luciano Bianciardi, Literary Urban Studies, Milan, and post-war Italian literature
-
Dataset
Living with Machines Zooniverse Participant Survey
Summary results from a survey of contributors to Living with Machines Zooniverse crowdsourcing projects. Responses were received between 24 May and 13 June 2022. We designed the survey so that we could align our reporting with two other audience / participant research groups. Firstly, we used the demographic categories that...British Library
online volunteering, digital participation, citizen science, citizen history, questionnaire, crowdsourcing, survey, and audience research
-
Poster (unpublished)
Changing Nature of the Visual Depiction of the Levant in the Early Printed Maps
This poster is part of a wider research project which demonstrates the transformation of topography, toponyms, and the hydrographic network of Cyprus and the Levant region represented in the maps, from the earliest printed map of the region (printed in 1477) up to the mid-19th century.Peszko, Magdalena
maps, toponyms, Middle East, cartography, early printed maps, and Levant
-
Journal article
Kaitiakitanga: Utilising Māori Holistic Conservation in Heritage Institutions
It is imperative that heritage institutions deal with the legacies of colonialism within their collections, the way this material is retained, preserved, displayed and interpreted, and the impact that this will have on local and global audiences. Failing to do so risks such organisations being perceived as the beneficiaries of...Nolan, Scott Ratima
empowerment, Māori, collections, custodianship, inclusion, and Kaitiakitanga
-
Journal article
Vehicularizing the vernacular: using the periodical press to popularize vernacular languages in Soviet Turkic communities
The study of language and script change among the Turkic communities of the Soviet Union often focuses on the switch from Arabic to Latin scripts. Less attention is paid to adaptations of the Arabic script to Turkic vernaculars, and to attempts aimed at convincing the literate masses of their usefulness....Erdman, Michael J.
-
Book chapter
Hunting for Treasure: Living with Machines and the British Library Newspaper Collection
This chapter discusses the open access digitisation programme undertaken by Living with Machines, exploring the range of constraints that inform digitisation strategies and selection priorities. Because the landscape of digitised newspaper collections is so complex, and research and digitisation processes operate on different timelines, we have focused on opportunities to...Tolfo, Giorgia ; Vane, Olivia ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
interdisciplinarity, digitised newspaper collections, digital corpus, research workflows, and digitisation strategy
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/rcm4-zk44. If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
higher education, dissertations, PhD, doctoral, and EThOS
-
Book
Making Miracles in Medieval England
The cult of the saints was central to medieval Christianity largely due to the miraculous. Saints were members of the elect of heaven and could intercede with God on the behalf of supplicants. Whilst people visited shrines and prayed to the saints for many reasons it was the hope of...Lynch, Tom
-
Journal article
The British Library – rethinking physical storage
Purpose The British Library (BL) faces a significant challenge with storage space predicted to run out within the next three years. However, alongside a plan to create additional capacity, the BL also intends to take the opportunity to rethink the integration of storage and workflows in order to implement a...Appleyard, Andrew H.
storage, library, automated, sustainability, and repository
-
Book chapter
Library partnerships in an age of openness
Librarians are strong collaborators. As we move toward the “next normal,” partnerships and collaboration will play an even larger role. In our post-pandemic world, what does “open” mean in terms of access to libraries, their staff, and their services? National libraries, for example, are still, to some extent, fixed in...Jolly, Liz
public libraries, collaboration, hybrid digital, open libraries, COVID-19 and libraries, library partnerships, national libraries, British Library, library mission, and access
-
Dataset
Review of Information Studies Courses in Higher Education for Web Archive Provision
This project reviewed the curriculum of information studies postgraduate courses in a number of countries across Europe. The curriculum for Library Studies, Archival Studies, Record Management and Digital Curation/Preservation and Digital Humanities were reviewed to see if there was any reference to web archiving. As these web pages were reviewed...