DatasetAbstract: This dataset comprises an Excel spreadsheet of text extracted from almost 2,000 digital images of maps and documents held in the War Office Archive, covering a large part of eastern Africa between c.1880 and 1940. The items were catalogued and digitised with generous funding from Indigo Trust. The harvested text includes names of historical settlements and ethnic regions in eastern Africa, descriptions of historical land use, topography and vegetation, and notes of ethnographic, military or administrative context. Auto-extraction of the text was carried out using the Google Vision API. The spreadsheet provides text found, confidence scores from Google Vision relating to transcription accuracy, locations of text on each image, and links to a geographical search interface that enables access to the relevant images and catalogue records hosted on the BL website. A large number of erroneous text results were cleaned from the full set of Google Vision responses, but some errors remain - for example, where individual characters have been incorrectly transcribed within words, though the words themselves should still be identifiable. In addition, not all words appearing on the maps were captured.
DatasetAbstract: This dataset comprises 581 images of maps of the former British East Africa created between 1890 and 1940 and a spreadsheet of related catalogue records. All Open Government Licence v1.0 (OGL). A user-friendly geographical search index of the maps is available on Google Maps. These JPEG files were converted from TIFF files using Irfan view. They relate to maps of the former British East Africa held in the British War Office (1890-1940).