Index Catalog // British Library

2023

Software

Living-with-machines/MapReader: End of LwM

This release marks the end of the current funding for MapReader during the Living with Machines (LwM) project. @kasra-hosseini @andrewphilipsmith @rwood-97 @kmcdono2 @dcsw2 @kallewesterling @kasparvonbeelen

Hosseini, Kasra ; Wood, Rosie ; Smith, Andy ; McDonough, Katie ; Wilson, Daniel C. S. …

computer vision and maps

2023

Other

Models for MapReader ACM SIGSPATIAL 2023 Geohumanities Workshop paper

Collection of fine-tuned models created during research published in Kasra Hosseini, Daniel C. S. Wilson, Kaspar Beelen, and Katherine McDonough. 2022. MapReader: a computer vision pipeline for the semantic exploration of maps at scale. In Proceedings of the 6th ACM SIGSPATIAL International Workshop on Geospatial Humanities (GeoHumanities '22). Association for...

Hosseini, Kasra ; Beelen, Kaspar ; McDonough, Katherine ; Wilson, Daniel C. S.

computational humanities, computer vision, maps, models, and image classification

2023

Dataset

Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England

Diachronic word embeddings (decade-level) trained with Word2Vec (via Gensim) on different geographic subcorpora of the Heritage Made Digital British and the Living with Machines historical newspaper collections: - North England (north.zip) - South England (south.zip) At the moment, for each subcorpus, Word2Vec models are available for each decade in the...

Pedrazzini, Nilo ; McGillivray, Barbara

historical semantics, diachronic embeddings, late modern English, word embeddings, word vectors, word2vec, and diatopic embeddings

2023

Dataset

Decade-level Word2Vec models from automatically transcribed 19th-century newspapers digitised by the British Library (1800-1919)

Word embeddings trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into periods of ten years each. Unlike those in this repository, these were not aligned and OCR errors skimmed from the vocabulary. See related GitHub repository for the full documentation:...

Pedrazzini, Nilo

historical semantics, British newspapers, word embeddings, word vectors, word2vec, and Late Modern English

2022

Dataset

MapReader_Data_SIGSPATIAL_2022

Hosseini, Kasra ; Wilson, Daniel C.S. ; Beelen, Kaspar ; McDonough, Katherine

Deep learning, Supervised learning, Computer vision, Historical maps, Digital libraries and archives, and Classification

2022

Learning object

Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 2)

This is the second of a two-part lesson introducing deep learning based computer vision methods for humanities research. This lesson digs deeper into the details of training a deep learning based computer vision model. It covers some challenges one may face due to the training data used and the importance...

Strien, Daniel van ; Beelen, Kaspar ; Wevers, Melvin ; Smits, Thomas ; McDonough, Katherine

computer vision

2022

Learning object

Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)

This is the first of a two-part lesson introducing deep learning based computer vision methods for humanities research. Using a dataset of historical newspaper advertisements and the fastai Python library, the lesson walks through the pipeline of training a computer vision model to perform image classification.

Strien, Daniel van ; Beelen, Kaspar ; Wevers, Melvin ; Smits, Thomas ; McDonough, Katherine

computer vision

2022

Dataset

Diachronic word embeddings from 19th-century newspapers digitised by the British Library (1800-1919)

Word vectors related to the paper "Machines in the media: semantic change in the lexicon of mechanization in 19th-century British newspapers" by Nilo Pedrazzini and Barbara McGillivray (2022). The embeddings were trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into...

Pedrazzini, Nilo ; McGillivray, Barbara

historical semantics, word-vectors, late-modern-english, newspapers, diachronic-embeddings, and word2vec

2021

Research report

Data Study Group Final Report: The National Archives, UK: Discovering Topics and Trends in the UK Government Web Archive

The challenge we address in this report is to make steps towards improving search and discovery of resources within this vast archive for future archive users, and how the UKGWA collection could begin to be unlocked for research and experimentation by approaching it as data (i.e. as a dataset at...

Beavan, David ; Nanni, Federico

2021

Conference paper (published)

When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation

As languages evolve historically, making computational approaches sensitive to time can improve performance on specific tasks. In this work, we assess whether applying historical language models and time-aware methods help with determining the correct sense of polysemous words. We outline the task of time-sensitive Targeted Sense Disambiguation (TSD), which aims...

Beelen, Kaspar ; Nanni, Federico ; Coll Ardanuy, Mariona ; Hosseini, Kasra ; Tolfo, Giorgia …

Research Repository

2023

Software

Living-with-machines/MapReader: End of LwM

2023

Other

Models for MapReader ACM SIGSPATIAL 2023 Geohumanities Workshop paper

2023

Dataset

Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England

2023

Dataset

Decade-level Word2Vec models from automatically transcribed 19th-century newspapers digitised by the British Library (1800-1919)

2022

Dataset

MapReader_Data_SIGSPATIAL_2022

2022

Learning object

Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 2)

2022

Learning object

Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)

2022

Dataset

Diachronic word embeddings from 19th-century newspapers digitised by the British Library (1800-1919)

2021

Research report

Data Study Group Final Report: The National Archives, UK: Discovering Topics and Trends in the UK Government Web Archive

2021

Conference paper (published)

When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation

Limit your search

Type

Resource Type

Creator

Keyword

Language

Collection

Institution

Availability

Research Repository

Search Constraints

Search Results

2023

Software

2023

Other

2023

Dataset

2023

Dataset

2022

Dataset

2022

Learning object

2022

Learning object

2022

Dataset

2021

Research report

2021

Conference paper (published)

Limit your search