Search Constraints
Search Results
-
Dataset
OCR and crowdsourced annotations, Language of Mechanisation, JSON files
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
Dataset
Language of Mechanisation: annotated historical newspaper articles
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
Dataset
UK Doctoral Thesis Metadata from EThOS
The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We estimate the data covers around 98% of all PhDs ever awarded by UK Higher Education institutions, dating back to 1787. Thesis metadata from every PhD-awarding university in...British Library ; Rosie, Heather
higher education, student, UK, dissertations, PhD, theses, doctoral, ethos, thesis, and research
-
Software
Hybrid Correspondence Network Processing Script
The Python code was developed to to interrogate the ways in which digital and analogue correspondence files (letters and e-mails) function within the Archive of Harold Pinter; reflecting upon what these patterns might mean for archivists, curators and researchers working with hybrid correspondence collections. This code is collection agnostic and...Mckean, Callum
Harold Pinter, data science, hybrid archives, and visualisations
-
Dataset
EAP031 Catalogue Metadata
This Excel spreadsheet contains the metadata that describes the archival collection digitised in Bulgaria by the EAP031 "The Treasures of Danzan Ravjaa" project team. The metadata was originally created by the EAP031 project team that digitised the archive in 2005. The project team was led by Professor Caroline Humphrey. This...EAP031 Project Team
metadata, manuscripts, and Tibetan
-
Dataset
EAP696 Catalogue Metadata
This Excel spreadsheet contains the metadata that describes the archival collection digitised in Bulgaria by the EAP696 "Minority press in Ottoman Turkish in Bulgaria" project team. The metadata was originally created by the EAP696 project team that digitised the archive in 2014. The project team was led by Mr Stoyan...EAP696 Project Team
-
Geographical dataset
Sarah FitzGerald's PhD placement project folder
This dataset is a zip file that contains the complete folder structure that Sarah used to manage this project. The content includes her planning, work, and outcomes, in the form of reports, presentations and blog posts. In addition to the data visualisations on the projects relating to Africa, Sarah also...FitzGerald, Sarah
West Africa, research collaboration, projects, Africa, humanities, digital scholarship, and data visualisation
-
Dataset
Datasets for toponym recognition and disambiguation for nineteenth-century English newspapers
We present two datasets, one for the task of toponym recognition and one for the task of toponym disambiguation. The datasets are derived from the "Dataset for Toponym Resolution in Nineteenth-Century English Newspapers" (DOI: https://doi.org/10.23636/r7d4-kw08). The toponym recognition dataset consists of two JSON files (ner_fine_train.json and ner_fine_dev.json), whereas the toponym...Coll Ardanuy, Mariona ; Nanni, Federico
toponym disambiguation, nineteenth-century newspapers, named entity recognition, entity linking, toponym resolution, toponym recognition, and dataset
-
Dataset
DeezyMatch training set for OCR
Optical character recognition (OCR) is the process of automatically transcribing text from images. The presence of OCR-induced errors in digitised text is a common problem in the digital humanities. OCR errors are usually due to the misrecognition of characters, such as "h" recognised as "b", or "c" recognised as "o".... -
Dataset
Incunabula Printed Catalogue Dataset: Volumes 1-10 copy of github repository
This dataset includes the github repository used to derive catalogue entries from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library....British Library
book history, metadata, catalogues, datasets, incunabula, early printed books, and early printing
-
Dataset
Incunabula Printed Catalogue Dataset: Volumes 1-10
This dataset includes the catalogue entries derived from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library. The dataset was created...British Library
datasets, catalogues, early printing, book history, early printed books, metadata, and incunabula
-
Dataset
Incunabula Printed Catalogue Dataset Metadata: Volumes 1-10
This dataset includes the combined catalogue entries derived from volumes 1-10 of the "Catalogue of books printed in the 15th century now at the British Museum" (know as BMC). The BMC was published between 1908-2007 and comprises detailed descriptions of the incunabula collection at the British Library. The dataset was...British Library
datasets, catalogues, early printing, incunabula, early printed books, metadata, and book history
-
Dataset
Living with Machines Zooniverse Participant Survey
Summary results from a survey of contributors to Living with Machines Zooniverse crowdsourcing projects. Responses were received between 24 May and 13 June 2022. We designed the survey so that we could align our reporting with two other audience / participant research groups. Firstly, we used the demographic categories that...British Library
online volunteering, digital participation, citizen science, citizen history, questionnaire, crowdsourcing, survey, and audience research
-
Dataset
Review of Information Studies Courses in Higher Education for Web Archive Provision
This project reviewed the curriculum of information studies postgraduate courses in a number of countries across Europe. The curriculum for Library Studies, Archival Studies, Record Management and Digital Curation/Preservation and Digital Humanities were reviewed to see if there was any reference to web archiving. As these web pages were reviewed... -
Dataset
Dataset mapping the movement of Salkey's correspondents across the globe
Microsoft CSV file dataset created for Kepler to map the movement of Salkey's correspondents across the globeBritish Library
-
Dataset
Gephi Dataset for "Mapping Caribbean Diasporic Networks through Correspondence"
Microsoft CSV file dataset created in Gephi that can be uploaded in Gephi to create the visualisation of the network.British Library
-
Dataset
Spatial network dataset for "Mapping the Caribbean Diaspora through Andrew Salkey"
Microsoft csv. file dataset created for Kepler mapping the geographical movement of correspondentsBritish Library
-
Dataset
Kepler Dataset for "Mapping the Caribbean Diaspora through Andrew Salkey's Correspondence"
Dataset created in Kepler to map the movement of the Caribbean diasporic network present in Andrew Salkey's correspondence files.British Library
-
Dataset
All Data for "Mapping the Caribbean Diaspora through Andrew Salkey's Correspondence"
Microsoft excel of all of the metadata created by the project.British Library
-
Dataset
The Newspaper Press Directory (1846-1920) - enriched and structured version
Mitchell's Newspaper Press Directories contained an almost complete list of newspapers published in England, Wales, Scotland and Ireland. It was published regularly from 1846 onwards and provided a detailed description of the newspaper landscape over time. This version contains a structured, tabular representation of the directories (as CSV or Excel...C. Mitchell and Co. ; British Library