Search Constraints
Search Results
-
Journal article
A Gold Girdle Book and its Connection with Anne Boleyn
The miniature prayer book with the shelfmark Stowe MS. 956 has long attracted attention because of a story associating it with Anne Boleyn. According to an oft-repeated account, this tiny girdle book with a gold metalwork binding was handed by Anne to one of her maids of honour on the...Jackson, Eleanor
-
Journal article
Italian Futurist Books (1909-1944) at the British Library
The Futurist book was instrumental in the circulation of Futurist ideas and represents a very experimental phase in book production, paving the way for the book object, the artist’s book, advertising and design. The purpose of this article is to produce a survey of the Italian Futurist collections held at...Mirabella, Valentina
-
Journal article
The London Stage 1660-1800: A Short History, Retrospective Anatomy, and Projected Future
The London Stage, 1660-1800, a day-by-day performance calendar spanning 140 years, was for its time a magnificent achievement published in eleven volumes (1960-1968 [recte 1970]) running to 1058 pages of introductory matter and 7182 pages of text, plus 672 pages of volume indexes. A one-volume cumulative index compiled from scratch...Hume, Robert D.
-
Journal article
The New Media Writing Prize Special Collection
This article introduces the New Media Writing Prize (NMWP) special collection (https://www.webarchive.org.uk/en/ukwa/collection/2912) created on behalf of the six UK Legal Deposit Libraries and hosted by the UK Web Archive. It is divided into two sections, presenting the perspectives of the archivists and the organizers of the prize respectively. The first...Rossi, Giulia Carla ; Pyke, Tegan ; Pope, James ; Skains, R. Lyle ; Wisdom, Stella
-
Dataset
OCR and crowdsourced annotations, Language of Mechanisation, JSON files
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
-
-
Book chapter
Thomas Grenville (1755-1846) and his books
Taylor, Barry
-
-
-
Conference paper (published)
Los libros españoles del Dr. William Bates (1625-1699) en la Dr. Williams’s Library de Londres
Taylor, Barry
-
Journal article
Libros religiosos coloniales de la British Library: libros impresos en México, Perú, Chile, Cuba, Ecuador y Guatemala, 1543/4-1800
El propósito del presente artículo no es otro que poner en manos del lector una lista clasificada de libros sobre religión impresos en Hispanoamérica en los siglos XVI, XVII y XVIII y actualmente conservados en la British Library. Se excluyen : pleitos en los que participan religiosos ; documentos sobre...Taylor, Barry ; West, Geoffrey
-
Book
Libraries within the Library: The Origins of the British Library’s Printed Collections
Dispersed along the shelves of the British Library today are many volumes that once stood side by side in private libraries. These essays explore some of the most important printed collections which were brought together to form the British Museum Library and cast new light on the individuals whose personal...Mandelbrote, Giles ; Taylor, Barry
-
Book
Foreign-Language Printing in London 1500-1900
The fourteen essays in this volume represent the first systematic attempt to document and to analyse the tradition of foreign language printing in London during the period 1500 to 1900. The surveys and case studies use a variety of approaches to document and describe this particular aspect of London printing...Taylor, Barry
-
Journal article
An old Spanish translation from the 'Flores Sancti Bernardi' in British Library ADD. MS. 14040, ff. 111V-112V
ALTHOUGH written in Castilian throughout, MS. Add. 14040 has a number of connections with the Catalan-speaking Kingdom of Aragon. The first text (ff. 1-85V) is a translation of Ramon Lull's 'Libre del gentil e los tres savis' made in Valencia by 'Goncalo Sanches de Useda'; a colophon gives the date...Taylor, Barry
-
Journal article
Ramon Miquel y Planas and his Biblioteca catalana: medievalism, publishing and bibliophilia in early twentieth-century Barcelona
RAMON Miquel y Planas (1874-1950) was the complete bookman. He published on the history of texts, binding, printing and bookplates; folklore and criticism of contemporary literature; preceptive works on publishing; and on the normalization of Catalan. His most enduring works, however, are his editions of medieval and Renaissance texts in...Taylor, Barry
-
Journal article
An old Spanish tale from Add. Ms. 14040, flf. 113r-114v: 'Exenplo que acaesçio en tierra de Damasco a la buena duenna climeçia que avia veynte annos e la mecia en cuna'
THE main body of Add. MS. 14040 contains three translations into Castilian: Ramon Lull's 'Libre del gentil e los tres savis' (ff. 1-85V) in a version by Gonçalo Sanches de Useda and his 'Coment del dictat' (ff. 86r-iiir) from the Catalan, and an extract from the Flores Sancti Bernardi, probably...Taylor, Barry
-
Presentation
Improving the Discoverability of Practice Research Processes, Outputs, and Data
CHOSN event: Open access policies and practice research in GLAMs. 5 March 2024 Tuesday 2pm GMT Holly Ranger, Research Data Management Officer at the University of Westminster. Our guest speaker addresses topics of open access policy environment for Galleries-Libraries-Archives-Museums (GLAMs) and diverse research activities & outputs. Holly Ranger presents on...Ranger, Holly
-
Presentation
Available, Accessible, Open
CHOSN event: Open access policies and practice research in GLAMs. 5 March 2024 Tuesday 2pm GMT Josie Fraser, Head of Digital Policy at the National Lottery Heritage Fund Our guest speaker addresses topics of open access policy environment for Galleries-Libraries-Archives-Museums (GLAMs) and diverse research activities & outputs. Josie Fraser provides...Fraser, Josie
-
Cultural Heritage Open Scholarship Network (CHOSN)
User Collection -
Journal article
Collecting Revolution: George Thomason and the ‘Thomason Tracts’
The approximately 24,000 pamphlets, manuscripts and newspapers collected by the London bookseller George Thomason are an invaluable source for the study of the political events of 1640 to 1663. This introduction surveys the articles, based on a conference held at the British Library, which are brought together in eBLJ 2023.Peacey, Jason
-
Other
WARCnet survey: COVID-19 Web Collections
This survey aimed to explore the methodologies and strategies employed by heritage institutions in collecting and documenting COVID-19-related developments on the Web. With a primary focus on European efforts, the survey sought to understand the scope and collection tactics of COVID-19 Web archives. Conducted under the auspices of the WARCnet...Bingham, Nicola ; de Wild, Karin ; Nyvang, Caroline ; Geeraert, Friedel
-
Journal article
‘Honest George’: George Thomason and London during the Civil War and Revolution
Part of the fascination with Thomason is that he was more than merely a prominent bookseller who collected a vast collection of civil war pamphlets and newspapers. He was also an active participant in public life, in terms of the workings of the Stationers’ Company and in terms of political...Lindley, Keith ; Peacey, Jason
-
Journal article
George Thomason and London in the 1650s
Thomason’s involvement in public politics, which had been extensive during the 1640s, brought him considerable personal trouble following the execution of Charles I, an event that he clearly opposed. Like many others who had been active Presbyterians before 1649, he became an opponent of the republican regime, and this chapter...Vernon, Elliot
-
Journal article
John Hammond and the Explosion of Print in 1641: Commercial and Political Opportunities
One of the great values of Thomason’s collection of civil war tracts and newsbooks is the opportunity that it affords for analysing the nature of the print trade during a key phase of the so-called ‘print revolution’. Given the so-called ‘explosion’ of cheap print that accompanied the descent into civil...Braddick, Michael J.
-
Journal article
Search and Seize: Partisan Publishers and Press Controls in Thomason’s London
The Thomason collection is recognised as being vital for exploring the dramatic developments in print culture that accompanied the English Revolution, not least those that were made possible by the collapse of press censorship in 1641. Less widely appreciated is that it also sheds valuable light upon the attempts that...Como, David R.
-
Journal article
The Thomason Tracts and Presbyterian Mobilization
This chapter uses the Thomason Tracts as a collection, as well as the partisan attitudes of Thomason himself, to assess the use of print in the bitter conflicts that divided parliamentarians in the 1640s. It compares the stress on division revealed in printed accounts of two particularly fraught episodes in...Hughes, Anne
-
Journal article
The Politics and Meaning of Thomason’s Tracts
This article takes as its starting point the reputation of Thomason's collection as royalist in orientation. It examines whether Thomason’s collection reflects 1640s and 1650s press output more broadly - offering a quantitative account of this - and whether the items he chose to purchase, and those he did not,...Raymond, Joad
-
Journal article
Milton’s Sonnet XIV and the poetry of George Thomason
It has long been recognised that Thomason was well connected, and that his friends included men like John Milton. This essay uses the sonnet that Milton wrote in honour of Thomason’s wife as the springboard for a discussion of a neglected aspect of the Thomason tracts: its poetry. It thus...Nevitt, Marcus
Thomason Tracts, George Thomason, Catharine Thomason, and John Milton
-
Journal article
Scattered about the Streets: George Thomason’s Annotations and Ephemeral Print during the English Revolution
Thomason is rightly famous for his tendency to annotate individual pamphlets, and his notes have long been exploited by scholars in order to trace his connections with various authors, to contextualise individual items, and to enhance our appreciation of writers and the debates in which they participated. This chapter subjects...Peacey, Jason
-
Journal article
From The Queen’s College to Montagu House: The History of the Thomason Tracts after the Restoration
Although the Thomason collection is rightly regarded as one of the treasures of the British Library, its survival was by no means inevitable. This chapter revisits the convoluted history of its fortunes after Thomason ceased collecting in 1661, shedding new light upon his own hopes and expectations regarding its fate,...Stoker, David
-
Journal article
Deconstruction and ‘Re-Volumization’: The Thomason Collection in the Past, Present, and Future
The Thomason Tracts that arrived at the British Museum as the gift of George III were in a rigorous chronological order, which was mirrored by Thomason’s own twelve-volume manuscript catalogue. Though Thomason boasted that by means of the catalogue even a single sheet could be found ‘instantly’, even more important...Mendle, Michael
-
Journal article
Chinese plates fit for an Acehnese queen
All over Southeast Asia is found a particular type of coarse Chinese export porcelain traditionally known as ‘Swatow’ ware but now more accurately identified as originating from Zhangzhou, dating from the late Ming period, from the end of the 16th to the early 17th centuries. One characteristic type of large...Gallop, Annabel Teh
Acehnese ninefold seal, Zhangzhou, Cap Sikureueng, Chinese porcelain, Chinese plates, Aceh, and Swatow
-
Book chapter
The Satow Collection of Japanese Books in the British Library: its history and significance
The aim of this article is to outline the history and importance of the collection of Japanese books which were acquired by the British Museum from the diplomat and scholar Sir Ernest Mason Satow (1843-1929) and which passed to the stewardship of the British Library on its creation in 1973....Todd, Hamish
-
Research report
Final report of the Doctoral Fellowship titled Creative Networks and Authors’ Houses: Links Between The British Library and National Trust
This report outlines the scope, findings and experience of the British Library and National Trust Doctoral Fellowship called Creative Networks and Authors’ Houses. Josip Martincic undertook a three-month placement to explore the networks and homes of George Bernard Shaw, Virginia Woolf and Rudyard Kipling. The overarching purpose of the fellowship...Martinčić, Josip
George Bernard Shaw, collaborative doctoral fellowship, British Library, National Trust, Virginia Woolf, and Rudyard Kipling
-
Living with Machines
User Collection -
Dataset
Frederick May's London Press Dictionary and Advertiser's Handbook (1883-1911)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Frederick May and successors, containing information on newspapers, magazines and periodicals and arranged in alphabetical and sometimes tabular order. Information for each title included price publisher office political and religious leaningFrederick May & Son ; British Library
-
Dataset
May's British and Irish Press Guide and Advertiser's Handbook & Dictionary etc. (1871-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Frederick May and successors, containing information on newspapers, magazines and periodicals and arranged in alphabetical and sometimes tabular order. Information for each title included price, publisher, office, political and religious leaning.Frederick May & Son ; British Library
-
Journal article
Neural Language Models for Nineteenth-Century English
We present four types of neural language models trained on a large historical dataset of books in English, published between 1760-1900 and comprised of ~5.1 billion tokens. The language model architectures include static (word2vec and fastText) and contextualized models (BERT and Flair). For each architecture, we trained a model instance...Hosseini, Kasra ; Beelen, Kaspar ; Colavizza, Giovanni ; Coll Ardanuy, Mariona
-
Journal article
MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital). This library transforms the way historians can use maps by turning extensive, homogeneous map sets into searchable primary sources. MapReader allows users with little or no computer vision expertise to i)...Hosseini, Kasra ; Wilson, Daniel C.S. ; Beelen, Kaspar ; McDonough, Katherine
-
Journal article
Maps of a Nation? The Digitized Ordnance Survey for New Historical Research
Although the Ordnance Survey has itself been the subject of historical research, scholars have not systematically used its maps as primary sources of information. This is partly for disciplinary reasons and partly for the technical reason that high-quality maps have not until recently been available digitally, geo-referenced, and in color....Hosseini, Kasra ; McDonough, Katherine ; van Strien, Daniel ; Vane, Olivia ; Wilson, Daniel C.S.
-
Journal article
Babbage among the insurers: Big 19th-century data and the public interest
This article examines life assurance and the politics of ‘big data’ in mid-19th-century Britain. The datasets generated by life assurance companies were vast archives of information about human longevity. Actuaries distilled these archives into mortality tables – immensely valuable tools for predicting mortality and so pricing risk. The status of...Wilson, Daniel C.S.
big data, Thomas Rowe Edmonds, Charles Babbage, public interest, and insurance
-
Dataset
Language of Mechanisation: annotated historical newspaper articles
Datasets created through crowdsourcing tasks created on the Zooniverse crowdsourcing platform by the Living with Machines ‘language of mechanisation’ project team. Building on earlier work classifying machines by function, we asked volunteers on Zooniverse 'how did the word x change over time and place?' and presented them with options for... -
Journal article
Design Choices for Productive, Secure, Data-Intensive Research at Scale in the Cloud
We present a policy and process framework for secure environments for productive data science research projects at scale, by combining prevailing data security threat and risk profiles into five sensitivity tiers, and, at each tier, specifying recommended policies for data classification, data ingress, software ingress, data egress, user access, user...Arenas, Diego ; Atkins, Jon ; Austin, Claire ; Beavan, David ; Cabrejas Egea, Alvaro …
-
Journal article
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions... -
Dataset
Living Machines atypical animacy dataset
Atypical animacy detection dataset, based on nineteenth-century sentences in English extracted from an open dataset of nineteenth-century books digitized by the British Library (available via https://doi.org/10.21250/db14, British Library Labs, 2014). This dataset contains 598 sentences containing mentions of machines. Each sentence has been annotated according to the animacy and humanness... -
Dataset
Stretford and Urmston Examiner
Stretford and Urmston Examiner. (1879 - 1880) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Living with Machines alpha and beta Zooniverse 'accident' task data
Data created through crowdsourcing tasks hosted on the Zooniverse platform. Members of the public were asked to look at a selection of articles from 19th century newspapers that mentioned machines and decide if they described an industrial accident. A further task asked participants to transcribe personal, organisational and place names...Zooniverse volunteers
crowdsourcing, digital history, citizen history, Living with Machines, newspapers, and digital humanities
-
Dataset
The Newspaper Press Directory (1881-1920)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...C. Mitchell and Co. ; British Library
-
Dataset
The Newspaper Press Directory (1846-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...C. Mitchell and Co. ; British Library
-
Dataset
The Newspaper Press Directory (1846-1920) - enriched and structured version
Mitchell's Newspaper Press Directories contained an almost complete list of newspapers published in England, Wales, Scotland and Ireland. It was published regularly from 1846 onwards and provided a detailed description of the newspaper landscape over time. This version contains a structured, tabular representation of the directories (as CSV or Excel...C. Mitchell and Co. ; British Library
-
Software
Living-with-machines/MapReader: End of LwM
This release marks the end of the current funding for MapReader during the Living with Machines (LwM) project. @kasra-hosseini @andrewphilipsmith @rwood-97 @kmcdono2 @dcsw2 @kallewesterling @kasparvonbeelenHosseini, Kasra ; Wood, Rosie ; Smith, Andy ; McDonough, Katie ; Wilson, Daniel C. S. …
computer vision and maps
-
Dataset
Diachronic word embeddings from 19th-century newspapers digitised by the British Library (1800-1919)
Word vectors related to the paper "Machines in the media: semantic change in the lexicon of mechanization in 19th-century British newspapers" by Nilo Pedrazzini and Barbara McGillivray (2022). The embeddings were trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, word-vectors, late-modern-english, newspapers, diachronic-embeddings, and word2vec
-
Dataset
Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
We present a new dataset (version 2) for the task of toponym resolution in digitised historical newspapers in English. It consists of 455 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated...Coll Ardanuy, Mariona ; Beavan, David ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
nineteenth-century English, dataset, newspapers, toponym resolution, and geographic information retrieval
-
Dataset
Living with Machines Zooniverse Participant Survey
Summary results from a survey of contributors to Living with Machines Zooniverse crowdsourcing projects. Responses were received between 24 May and 13 June 2022. We designed the survey so that we could align our reporting with two other audience / participant research groups. Firstly, we used the demographic categories that...British Library
online volunteering, digital participation, citizen science, citizen history, questionnaire, crowdsourcing, survey, and audience research
-
Dataset
Decade-level Word2Vec models from automatically transcribed 19th-century newspapers digitised by the British Library (1800-1919)
Word embeddings trained on a 4.2-billion-word corpus of 19th-century British newspapers using Word2Vec and specific parameters. The embeddings are divided into periods of ten years each. Unlike those in this repository, these were not aligned and OCR errors skimmed from the vocabulary. See related GitHub repository for the full documentation:...Pedrazzini, Nilo
historical semantics, British newspapers, word embeddings, word vectors, word2vec, and Late Modern English
-
Dataset
Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England
Diachronic word embeddings (decade-level) trained with Word2Vec (via Gensim) on different geographic subcorpora of the Heritage Made Digital British and the Living with Machines historical newspaper collections: - North England (north.zip) - South England (south.zip) At the moment, for each subcorpus, Word2Vec models are available for each decade in the...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, diachronic embeddings, late modern English, word embeddings, word vectors, word2vec, and diatopic embeddings
-
Conference paper (published)
Resolving places, past and present: toponym resolution in historical British newspapers using multiple resources
Newspapers and their metadata are richly geographical, not only in their distribution but also their content. Attending to these spatial features is a prerequisite in newspaper research. Following other projects to have geoparsed place names in newspapers, we describe our approach to linking historical geospatial information in text to real-world...Coll Ardanuy, Mariona ; McDonough, Katherine ; Krause, Amrey ; Wilson, Daniel C.S. ; Hosseini, Kasra …
-
Dataset
Ordnance Survey Old / First series England and Wales 1:63360 (georeferenced sheet images)
Map sheet images for the Ordnance Survey Old Series / First Series England and Wales 1:63360, georeferenced and cropped at the neatlike (can be viewed together as a seamless composite). Geotiff format. The original (ungeoreferenced) sheet images can be found at: https://commons.wikimedia.org/wiki/Category:Ordnance_Survey_Old/First_series_England_and_Wales_1:63360_(full_sheets). The sheets were georeferenced by relating the sheet...Vane, Olivia
England, First Series, Old Series, maps, Ordnance Survey, and Wales
-
Dataset
Widnes Examiner
Widnes Examiner (1876-1920) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Conference paper (published)
DeezyMatch: A Flexible Deep Learning Approach to Fuzzy String Matching
We present DeezyMatch, a free, open-source software library written in Python for fuzzy string matching and candidate ranking. Its pair classifier supports various deep neural network architectures for training new classifiers and for fine-tuning a pretrained model, which paves the way for transfer learning in fuzzy string matching. This approach...Hosseini, Kasra ; Nanni, Federico ; Coll Ardanuy, Mariona
Natural Language Processing, string matching, toponym matching, machine learning, and digital humanities
-
Dataset
Weymouth Telegram
Weymouth Telegram (1860 - 1901) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
The Runcorn Examiner
The Runcorn Examiner (1870-1954) was a weekly newspaper and years 1870-1920 have been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Barrow Herald and Furness Advertiser
Barrow Herald and Furness Advertiser. (1863 - 1914) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Cradley Heath & Stourbridge Observer
Cradley Heath & Stourbridge Observer. (1864 - 1888) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Denton and Haughton Examiner
Denton and Haughton Examiner was a weekly newspaper which has been digitised by the British Library for the Living with Machines project. Variant titles are 1873-74 The Denton, Haughton, & District Weekly News. 1874-75 Denton & Haughton Weekly News, and Audenshaw, Hooley Hill, and Dukinfield Advertiser, 1875-78 Denton Examiner, Audenshaw,...British Library
-
Dataset
Dorset County Express and Agricultural Gazette
Dorset County Express and Agricultural Gazette was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Central Glamorgan Gazette
Central Glamorgan Gazette was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Stockton Herald, South Durham and Cleveland Advertiser
Stockton Herald, South Durham and Cleveland Advertiser. (1858 - 1918) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Liverpool Weekly Courier
Liverpool Weekly Courier was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Swansea and Glamorgan Herald, and South Wales Free Press
Swansea and Glamorgan Herald, and South Wales Free Press. (1847 - 1890) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Conference paper (unpublished)
Contextualizing Victorian Newspapers
Beelen, Kaspar ; Ahnert, Ruth ; Beavan, David ; Coll Ardanuy, Mariona ; Hosseini, Kasra …
-
Conference paper (unpublished)
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym... -
Conference paper (published)
Living Machines: A study of atypical animacy
This paper proposes a new approach to animacy detection, the task of determining whether an entity is represented as animate in a text. In particular, this work is focused on atypical animacy and examines the scenario in which typically inanimate objects, specifically machines, are given animate attributes. To address it,...Coll Ardanuy, Mariona ; Nanni, Federico ; Beelen, Kaspar ; Hosseini, Kasra ; Ahnert, Ruth …
nineteenth-century English, living machines, BERT, and animacy
-
-
Conference paper (unpublished)
Defoe: A Spark-Based Toolbox for Analysing Digital Historical Textual Data
This work presents defoe, a new scalable and portable digital eScience toolbox that enables historical research. It allows for running text mining queries across large datasets, such as historical newspapers and books in parallel via Apache Spark. It handles queries against collections that comprise several XML schemas and physical representations.... -
Dataset
Northern Weekly Gazette
Northern Weekly Gazette was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Conference paper (unpublished)
Assessing the Impact of OCR Quality on Downstream NLP Tasks
A growing volume of heritage data is being digitized and made available as text via optical character recognition (OCR). Scholars and libraries are increasingly using OCR-generated text for retrieval and analysis. However, the process of creating text through OCR introduces varying degrees of error to the text. The impact of... -
Dataset
Warrington Examiner
Warrington Examiner (1869-1901) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Abstract
Using smart annotations to map the geography of newspapers
Geographic information is a key component in the description of collection objects, and yet its format is often unsuited for use with methods of geographic analysis. Catalogue entries are often inconsistent, in plain text, and without geographic coordinates (much less coordinates linked to authority records). Georesolution of the relevant fields...Ryan, Yann ; Coll Ardanuy, Mariona ; van Strien, Daniel ; Hosseini, Kasra ; Beelen, Kaspar …
-
Other
Models for MapReader ACM SIGSPATIAL 2023 Geohumanities Workshop paper
Collection of fine-tuned models created during research published in Kasra Hosseini, Daniel C. S. Wilson, Kaspar Beelen, and Katherine McDonough. 2022. MapReader: a computer vision pipeline for the semantic exploration of maps at scale. In Proceedings of the 6th ACM SIGSPATIAL International Workshop on Geospatial Humanities (GeoHumanities '22). Association for...Hosseini, Kasra ; Beelen, Kaspar ; McDonough, Katherine ; Wilson, Daniel C. S.
computational humanities, computer vision, maps, models, and image classification
-
Learning object
Computer Vision for the Humanities: An Introduction to Deep Learning for Image Classification (Part 1)
This is the first of a two-part lesson introducing deep learning based computer vision methods for humanities research. Using a dataset of historical newspaper advertisements and the fastai Python library, the lesson walks through the pipeline of training a computer vision model to perform image classification.Strien, Daniel van ; Beelen, Kaspar ; Wevers, Melvin ; Smits, Thomas ; McDonough, Katherine