Buscar
Resultados de la búsqueda
-
Conference paper (unpublished)
Contextualizing Victorian Newspapers
Beelen, Kaspar ; Ahnert, Ruth ; Beavan, David ; Coll Ardanuy, Mariona ; Hosseini, Kasra …
-
Dataset
Living Machines atypical animacy dataset
Atypical animacy detection dataset, based on nineteenth-century sentences in English extracted from an open dataset of nineteenth-century books digitized by the British Library (available via https://doi.org/10.21250/db14, British Library Labs, 2014). This dataset contains 598 sentences containing mentions of machines. Each sentence has been annotated according to the animacy and humanness... -
Research report
Data Study Group Final Report: The National Archives, UK: Discovering Topics and Trends in the UK Government Web Archive
The challenge we address in this report is to make steps towards improving search and discovery of resources within this vast archive for future archive users, and how the UKGWA collection could begin to be unlocked for research and experimentation by approaching it as data (i.e. as a dataset at...Beavan, David ; Nanni, Federico
-
Dataset
Ordnance Survey Old / First series England and Wales 1:63360 (georeferenced sheet images)
Map sheet images for the Ordnance Survey Old Series / First Series England and Wales 1:63360, georeferenced and cropped at the neatlike (can be viewed together as a seamless composite). Geotiff format. The original (ungeoreferenced) sheet images can be found at: https://commons.wikimedia.org/wiki/Category:Ordnance_Survey_Old/First_series_England_and_Wales_1:63360_(full_sheets). The sheets were georeferenced by relating the sheet...Vane, Olivia
England, First Series, Old Series, maps, Ordnance Survey, and Wales
-
Dataset
May's British and Irish Press Guide and Advertiser's Handbook & Dictionary etc. (1871-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Frederick May and successors, containing information on newspapers, magazines and periodicals and arranged in alphabetical and sometimes tabular order. Information for each title included price, publisher, office, political and religious leaning.Frederick May & Son ; British Library
-
Dataset
The Newspaper Press Directory (1846-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...C. Mitchell and Co. ; British Library
-
Journal article
Neural Language Models for Nineteenth-Century English
We present four types of neural language models trained on a large historical dataset of books in English, published between 1760-1900 and comprised of ~5.1 billion tokens. The language model architectures include static (word2vec and fastText) and contextualized models (BERT and Flair). For each architecture, we trained a model instance...Hosseini, Kasra ; Beelen, Kaspar ; Colavizza, Giovanni ; Coll Ardanuy, Mariona
-
Dataset
Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
We present a new dataset for the task of toponym resolution in digitised historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions...Coll Ardanuy, Mariona ; Beavan, David ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
nineteenth-century English, geographic information retrieval, newspapers, toponym resolution, and dataset
-
Conference paper (published)
When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation
As languages evolve historically, making computational approaches sensitive to time can improve performance on specific tasks. In this work, we assess whether applying historical language models and time-aware methods help with determining the correct sense of polysemous words. We outline the task of time-sensitive Targeted Sense Disambiguation (TSD), which aims...Beelen, Kaspar ; Nanni, Federico ; Coll Ardanuy, Mariona ; Hosseini, Kasra ; Tolfo, Giorgia …
-
Research report
Living with Machines Delivery Plan version 1, 2019
Living with Machines is a five-year collaborative project. It aims to generate new perspectives on the effects of the mechanisation of labour on the lives of ordinary people in Britain during the 'long nineteenth century' (c.1780-1918), by developing computational and historical techniques and research questions for working with historical sources....Ahnert, Ruth ; Beavan, David ; Colavizza, Giovanni ; Farquhar, Adam ; Griffin, Emma …
-
Dataset
Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
We present a new dataset (version 2) for the task of toponym resolution in digitised historical newspapers in English. It consists of 455 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated...Coll Ardanuy, Mariona ; Beavan, David ; Beelen, Kaspar ; Hosseini, Kasra ; Lawrence, Jon …
nineteenth-century English, dataset, newspapers, toponym resolution, and geographic information retrieval
-
Journal article
MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital). This library transforms the way historians can use maps by turning extensive, homogeneous map sets into searchable primary sources. MapReader allows users with little or no computer vision expertise to i)...Hosseini, Kasra ; Wilson, Daniel C.S. ; Beelen, Kaspar ; McDonough, Katherine
-
Journal article
Maps of a Nation? The Digitized Ordnance Survey for New Historical Research
Although the Ordnance Survey has itself been the subject of historical research, scholars have not systematically used its maps as primary sources of information. This is partly for disciplinary reasons and partly for the technical reason that high-quality maps have not until recently been available digitally, geo-referenced, and in color....Hosseini, Kasra ; McDonough, Katherine ; van Strien, Daniel ; Vane, Olivia ; Wilson, Daniel C.S.
-
Dataset
Glasgow Courier
Glasgow Courier was a thrice weekly/bi-weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Kenilworth Advertiser
Kenilworth Advertiser was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Blandford Weekly News
Blandord Weekly News was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Bridlington and Quay Gazette
Bridlington and Quay Gazette was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
British Miner and General Newsman
British Miner and General Newsman was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Swansea Journal and South Wales Liberal
Swansea Journal and South Wales Liberal was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Birkenhead News
Birkenhead News was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
- « Anterior
- Siguiente »
- 1
- 2
- 3
- 4
- 5
- 6