Buscar
Resultados de la búsqueda
-
Dataset
Cradley Heath & Stourbridge Observer
Cradley Heath & Stourbridge Observer. (1864 - 1888) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Barrow Herald and Furness Advertiser
Barrow Herald and Furness Advertiser. (1863 - 1914) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
The Runcorn Examiner
The Runcorn Examiner (1870-1954) was a weekly newspaper and years 1870-1920 have been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Weymouth Telegram
Weymouth Telegram (1860 - 1901) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Conference paper (published)
DeezyMatch: A Flexible Deep Learning Approach to Fuzzy String Matching
We present DeezyMatch, a free, open-source software library written in Python for fuzzy string matching and candidate ranking. Its pair classifier supports various deep neural network architectures for training new classifiers and for fine-tuning a pretrained model, which paves the way for transfer learning in fuzzy string matching. This approach...Hosseini, Kasra ; Nanni, Federico ; Coll Ardanuy, Mariona
Natural Language Processing, string matching, toponym matching, machine learning, and digital humanities
-
Conference paper (published)
Resolving places, past and present: toponym resolution in historical British newspapers using multiple resources
Newspapers and their metadata are richly geographical, not only in their distribution but also their content. Attending to these spatial features is a prerequisite in newspaper research. Following other projects to have geoparsed place names in newspapers, we describe our approach to linking historical geospatial information in text to real-world...Coll Ardanuy, Mariona ; McDonough, Katherine ; Krause, Amrey ; Wilson, Daniel C.S. ; Hosseini, Kasra …
-
Dataset
Ordnance Survey Old / First series England and Wales 1:63360 (georeferenced sheet images)
Map sheet images for the Ordnance Survey Old Series / First Series England and Wales 1:63360, georeferenced and cropped at the neatlike (can be viewed together as a seamless composite). Geotiff format. The original (ungeoreferenced) sheet images can be found at: https://commons.wikimedia.org/wiki/Category:Ordnance_Survey_Old/First_series_England_and_Wales_1:63360_(full_sheets). The sheets were georeferenced by relating the sheet...Vane, Olivia
England, First Series, Old Series, maps, Ordnance Survey, and Wales
-
Dataset
Widnes Examiner
Widnes Examiner (1876-1920) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Diachronic and diatopic word embeddings from newspapers digitised by the British Library (1830-1889): North and South England
Diachronic word embeddings (decade-level) trained with Word2Vec (via Gensim) on different geographic subcorpora of the Heritage Made Digital British and the Living with Machines historical newspaper collections: - North England (north.zip) - South England (south.zip) At the moment, for each subcorpus, Word2Vec models are available for each decade in the...Pedrazzini, Nilo ; McGillivray, Barbara
historical semantics, diachronic embeddings, late modern English, word embeddings, word vectors, word2vec, and diatopic embeddings