Search Constraints
Search Results
-
Research report
Circulations and Entangled History in 19th Century Chile
The Living with Machines Digital Residences have offered our research team a remarkable opportunity to experiment with methods for extracting data from historical newspapers dating back 100 to 150 years. This interdisciplinary project aims to expand the scope of digital humanities and historical research by developing automated techniques for data...Hayward, Jennifer ; Valenzuela, Gillian ; Shakib, Khandokar
digital humanities, historical newspapers, Living with Machines, and Victorian
-
Research report
UK Railway Archive (AR-UK)
Archive The Railway UK (AR-UK) is a comprehensive digital platform designed to enhance the online archives of the UK rail network. This initiative, developed in collaboration with Living with Machines, is primarily focused on research, historical preservation, and providing public access to railway history. As a centralized resource, it caters...Sheppard, Joanne
digital humanities, Victorian, rail, data visualisation, and Living with Machines
-
Research report
The Devils After the Fall
This is a report by Nicola Baldwin of a Digital Residency at Living With Machines, on the dataset: Crowdsourced accidents data from Newspapers, working in line with the LWM aims for Radical Collaboration, New Perspectives, Analysing at Scale. The following document is a personal account of work done, thoughts arising...Baldwin, Nicola
Living with Machines, accidents, Victorian, film, crowdsourcing, newspaper, and digital humanities
-
Research report
Visualising press politics in the United Kingdom
This project's main goal, as outlined in the initial proposal, was to develop an interactive, open-source web app that visualizes data from the Press Directories dataset alongside historical general election results. In this report, I will delve into the challenges encountered, the interesting findings uncovered, and the potential avenues for...Bonato, Nicolò
historcial newspapers, digital humanities, Victorian, politics, data visualisation, and Living with Machines
-
Research report
An Etiquette For Minor Time Travel
A report documenting the Living with Machines Digital Residency project called "An Etiquette for Minor Time Travel" by Robert Sherman. Via an open call and running from to July 2023, six Digital Residencies were funded by the Living with Machines project to support researchers and practitioners devising creative approaches to...Sherman, Robert
art, Living with Machines, Victorian, rail, data visualisation, digital humanities, and poetry
-
Research report
Publishing updated version of ‘R for Newspaper Data’
My Living with Machines Digital Residency, which I carried out between May and July 2023, allowed me to update and publish an online book on accessing and analysing newspaper data. The goal of the book is to make available an end-to-end set of instructions and tutorials which would allow researchers,...Ryan, Yann Ciarán
digital humanities, Victorian, historical newspapers, and Living with Machines
-
Living with Machines
User Collection -
Dataset
Living Machines atypical animacy dataset
Atypical animacy detection dataset, based on nineteenth-century sentences in English extracted from an open dataset of nineteenth-century books digitized by the British Library (available via https://doi.org/10.21250/db14, British Library Labs, 2014). This dataset contains 598 sentences containing mentions of machines. Each sentence has been annotated according to the animacy and humanness... -
Dataset
Living with Machines alpha and beta Zooniverse 'accident' task data
Data created through crowdsourcing tasks hosted on the Zooniverse platform. Members of the public were asked to look at a selection of articles from 19th century newspapers that mentioned machines and decide if they described an industrial accident. A further task asked participants to transcribe personal, organisational and place names...Zooniverse volunteers
crowdsourcing, digital history, citizen history, Living with Machines, newspapers, and digital humanities
-
Conference paper (published)
DeezyMatch: A Flexible Deep Learning Approach to Fuzzy String Matching
We present DeezyMatch, a free, open-source software library written in Python for fuzzy string matching and candidate ranking. Its pair classifier supports various deep neural network architectures for training new classifiers and for fine-tuning a pretrained model, which paves the way for transfer learning in fuzzy string matching. This approach...Hosseini, Kasra ; Nanni, Federico ; Coll Ardanuy, Mariona
Natural Language Processing, string matching, toponym matching, machine learning, and digital humanities
-
Conference paper (unpublished)
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym... -
Conference paper (unpublished)
Assessing the Impact of OCR Quality on Downstream NLP Tasks
A growing volume of heritage data is being digitized and made available as text via optical character recognition (OCR). Scholars and libraries are increasingly using OCR-generated text for retrieval and analysis. However, the process of creating text through OCR introduces varying degrees of error to the text. The impact of... -
Abstract
Historic machines from 'prams' to 'Parliament': new avenues for collaborative linguistic research
Research in computational linguistics has made successful attempts at modelling word meaning at scale, but much remains to be done to put these computational models to the test of historical scholarship (see e.g. Beelen et al. 2021). More importantly, a lot of computational research looks at texts in a historical...Ridge, Mia ; Tolfo, Giorgia ; Westerling, Kalle ; Pedrazzini, Nilo ; McGillivray, Barbara
crowdsourcing, computational linguistics, and digital humanities
-
Dataset
DeezyMatch training set for OCR
Optical character recognition (OCR) is the process of automatically transcribing text from images. The presence of OCR-induced errors in digitised text is a common problem in the digital humanities. OCR errors are usually due to the misrecognition of characters, such as "h" recognised as "b", or "c" recognised as "o".... -
Book
Collaborative Historical Research in the Age of Big Data
Living with Machines is the largest digital humanities project ever funded in the UK. The project brought together a team of twenty-three researchers to leverage more than twenty-years' worth of digitisation projects in order to deepen our understanding of the impact of mechanisation on nineteenth-century Britain. In contrast to many...Ahnert, Ruth ; Griffin, Emma ; Ridge, Mia ; Tolfo, Giorgia
digital humanities, British history, multidisciplinarity, digital history, and nineteenth century
-
Conference paper (published)
Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution languages and time periods. Using a historical newspaper corpus in 3 languages as test-bed, we use prompts to extract possible named entities. Our results show that a naive...De Toni, Francesco ; Akiki, Christopher ; De La Rosa, Javier ; Fourrier, Clémentine ; Manjavacas, Enrique …
-
Journal article
Under the Impression: Multispectral Imaging of Lord Frederick Campbell Charter XXI 5
Lord Frederick Campbell Charter 5 is the only surviving English document that still has an authentic, legible, pre-Conquest seal attached to it. The text purports to be a writ of Edward the Confessor (1003x5–1066) granting a slew of rights to Christ Church Cathedral, Canterbury. We examined the writ using multispectral...Hudson, Alison ; Duffy, Christina
multispectral imaging, digital humanities, conservation, seals, early medieval history, writs, and Norman Conquest
-
Dataset
StopsGB: Structured Timeline of Passenger Stations in Great Britain
Michael Quick's book _Railway Passenger Stations in Great Britain: a Chronology_ offers a uniquely rich and detailed account of Britain's changing railway infrastructure. Its listing of over 12,000 stations allows us to reconstruct the coming of rail at both micro- and macro-scales. However, being published originally as a book (and... -
Journal article
Can I believe what I see? Data visualisation and trust in the humanities
Questions of trust are increasingly important in relation to data and its use. The authors focus on humanities data and its visualisation, through analysis of their own recent projects with museums, archives and libraries internationally. Their account connects the specifics of hands-on digital humanities work to larger epistemological questions. They...Boyd Davis, Stephen ; Vane, Olivia ; Kräutli, Florian
scepticism, critical design, interdisciplinarity, ethics, digital humanities, interrogability, data visualisation, and GLAM
-
Journal article
Europeana Newspapers: searching digitized historical newspapers from 23 European countries
Europeana Newspapers is a European Commission-funded project which is refining, aggregating and giving researchers online access to historical newspaper content from 23 European libraries. It also offers free, open source tools which individual libraries can use to assess refinement quality and metadata standards in relation to their own digital newspaper...Willems, Marieke ; Atanassova, Rossitza
digitised historic newspapers, 19th-century newspapers, Europeana Newspapers, and digital humanities