Search Constraints
Search Results
-
Conference paper (published)
MapReader: a computer vision pipeline for the semantic exploration of maps at scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections. MapReader allows users with little computer vision expertise to i) retrieve maps via web-servers; ii) preprocess and divide them into patches; iii) annotate patches; iv) train, fine-tune, and evaluate deep neural network models; and...Hosseini, Kasra ; Wilson, Daniel C. S. ; Beelen, Kaspar ; McDonough, Katherine
maps and ordnance survey
-
Research report
Publishing updated version of ‘R for Newspaper Data’
My Living with Machines Digital Residency, which I carried out between May and July 2023, allowed me to update and publish an online book on accessing and analysing newspaper data. The goal of the book is to make available an end-to-end set of instructions and tutorials which would allow researchers,...Ryan, Yann Ciarán
digital humanities, Victorian, historical newspapers, and Living with Machines
-
Research report
Circulations and Entangled History in 19th Century Chile
The Living with Machines Digital Residences have offered our research team a remarkable opportunity to experiment with methods for extracting data from historical newspapers dating back 100 to 150 years. This interdisciplinary project aims to expand the scope of digital humanities and historical research by developing automated techniques for data...Hayward, Jennifer ; Valenzuela, Gillian ; Shakib, Khandokar
digital humanities, historical newspapers, Living with Machines, and Victorian
-
Research report
An Etiquette For Minor Time Travel
A report documenting the Living with Machines Digital Residency project called "An Etiquette for Minor Time Travel" by Robert Sherman. Via an open call and running from to July 2023, six Digital Residencies were funded by the Living with Machines project to support researchers and practitioners devising creative approaches to...Sherman, Robert
art, Living with Machines, Victorian, rail, data visualisation, digital humanities, and poetry
-
Research report
Visualising press politics in the United Kingdom
This project's main goal, as outlined in the initial proposal, was to develop an interactive, open-source web app that visualizes data from the Press Directories dataset alongside historical general election results. In this report, I will delve into the challenges encountered, the interesting findings uncovered, and the potential avenues for...Bonato, Nicolò
historcial newspapers, digital humanities, Victorian, politics, data visualisation, and Living with Machines
-
Research report
The Devils After the Fall
This is a report by Nicola Baldwin of a Digital Residency at Living With Machines, on the dataset: Crowdsourced accidents data from Newspapers, working in line with the LWM aims for Radical Collaboration, New Perspectives, Analysing at Scale. The following document is a personal account of work done, thoughts arising...Baldwin, Nicola
Living with Machines, accidents, Victorian, film, crowdsourcing, newspaper, and digital humanities
-
Research report
UK Railway Archive (AR-UK)
Archive The Railway UK (AR-UK) is a comprehensive digital platform designed to enhance the online archives of the UK rail network. This initiative, developed in collaboration with Living with Machines, is primarily focused on research, historical preservation, and providing public access to railway history. As a centralized resource, it caters...Sheppard, Joanne
digital humanities, Victorian, rail, data visualisation, and Living with Machines
-
Journal article
Working at scale: what do computational methods mean for research using cases, models and collections?
Open access, peer-reviewed article published in Science Museum Group Journal, as part of a double-length special issue for the AHRC TaNC discovery project, 'Congruence Engine'. The article gives a critical overview of how 'scale' operates as a keyword within computational humanities as well as reviewing a number of cognate fields,...Wilson, Daniel C S
machine learning, AI for GLAM, STS, scale, computational humanities, history, and congruence engine
-
Dataset
OCR and crowdsourced annotations, Language of Mechanisation, JSON files
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
Dataset
Language of Mechanisation: annotated historical newspaper articles
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
Software
Living-with-machines/MapReader: End of LwM
This release marks the end of the current funding for MapReader during the Living with Machines (LwM) project. @kasra-hosseini @andrewphilipsmith @rwood-97 @kmcdono2 @dcsw2 @kallewesterling @kasparvonbeelenHosseini, Kasra ; Wood, Rosie ; Smith, Andy ; McDonough, Katie ; Wilson, Daniel C. S. …
computer vision and maps
-
Other
Models for MapReader ACM SIGSPATIAL 2023 Geohumanities Workshop paper
Collection of fine-tuned models created during research published in Kasra Hosseini, Daniel C. S. Wilson, Kaspar Beelen, and Katherine McDonough. 2022. MapReader: a computer vision pipeline for the semantic exploration of maps at scale. In Proceedings of the 6th ACM SIGSPATIAL International Workshop on Geospatial Humanities (GeoHumanities '22). Association for...Hosseini, Kasra ; Beelen, Kaspar ; McDonough, Katherine ; Wilson, Daniel C. S.
computational humanities, computer vision, maps, models, and image classification
-
Learning object
Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 2)
This is the second of a two-part lesson introducing deep learning based computer vision methods for humanities research. This lesson digs deeper into the details of training a deep learning based computer vision model. It covers some challenges one may face due to the training data used and the importance...Strien, Daniel van ; Beelen, Kaspar ; Wevers, Melvin ; Smits, Thomas ; McDonough, Katherine
-
Learning object
Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)
This is the first of a two-part lesson introducing deep learning based computer vision methods for humanities research. Using a dataset of historical newspaper advertisements and the fastai Python library, the lesson walks through the pipeline of training a computer vision model to perform image classification.Strien, Daniel van ; Beelen, Kaspar ; Wevers, Melvin ; Smits, Thomas ; McDonough, Katherine
-
Dataset
Diachronic word embeddings from 19th-century newspapers digitised by the British Library (1800-1919)
Word vectors related to the paper "Machines in the media: semantic change in the lexicon of mechanization in 19th-century British newspapers" by Nilo Pedrazzini and Barbara McGillivray (2022). The embeddings were trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, word-vectors, late-modern-english, newspapers, diachronic-embeddings, and word2vec
-
Dataset
Decade-level Word2Vec models from automatically transcribed 19th-century newspapers digitised by the British Library (1800-1919)
Word embeddings trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into periods of ten years each. Unlike those in this repository, these were not aligned and OCR errors skimmed from the vocabulary. See related GitHub repository for the full documentation:...Pedrazzini, Nilo
historical semantics, British newspapers, word embeddings, word vectors, word2vec, and Late Modern English
-
Dataset
Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England
Diachronic word embeddings (decade-level) trained with Word2Vec (via Gensim) on different geographic subcorpora of the Heritage Made Digital British and the Living with Machines historical newspaper collections: - North England (north.zip) - South England (south.zip) At the moment, for each subcorpus, Word2Vec models are available for each decade in the...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, diachronic embeddings, late modern English, word embeddings, word vectors, word2vec, and diatopic embeddings
-
Dataset
Living with Machines Zooniverse Participant Survey
Summary results from a survey of contributors to Living with Machines Zooniverse crowdsourcing projects. Responses were received between 24 May and 13 June 2022. We designed the survey so that we could align our reporting with two other audience / participant research groups. Firstly, we used the demographic categories that...British Library
online volunteering, digital participation, citizen science, citizen history, questionnaire, crowdsourcing, survey, and audience research
-
Book chapter
Hunting for Treasure: Living with Machines and the British Library Newspaper Collection
This chapter discusses the open access digitisation programme undertaken by Living with Machines, exploring the range of constraints that inform digitisation strategies and selection priorities. Because the landscape of digitised newspaper collections is so complex, and research and digitisation processes operate on different timelines, we have focused on opportunities to...Tolfo, Giorgia ; Vane, Olivia ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
interdisciplinarity, digitised newspaper collections, digital corpus, research workflows, and digitisation strategy