Ricerca
Risultati della ricerca
-
Dataset
Living with Machines alpha and beta Zooniverse 'accident' task data
Data created through crowdsourcing tasks hosted on the Zooniverse platform. Members of the public were asked to look at a selection of articles from 19th century newspapers that mentioned machines and decide if they described an industrial accident. A further task asked participants to transcribe personal, organisational and place names...Zooniverse volunteers
crowdsourcing, digital history, citizen history, Living with Machines, newspapers, and digital humanities
-
Dataset
Indexes to the Dispatches of the East India Company Court of Directors to Indian Governments
The IOR/E/4 Correspondence with India comprises 1112 volumes dating from 1703-1858. The material is arranged into eight series: four series of letters received by the Court of Directors from the administration in India; and four series of dispatches sent by the Court to the same administrations. Subject, name and place...India Office Library and Records ; British Library ; Hailey, Alex
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version (5): https://doi.org/10.23636/1344 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS,...British Library ; Rosie, Heather
higher education, ethos, dissertations, HE, research, PhD, doctoral, student, UK, theses, and thesis
-
Dataset
Faber Music and Music Sales Publications 2013 to 2018
The ‘Faber Music and Music Sales Publications 2013 to 2018’ dataset is an .xlsx (Excel Workbook) file containing metadata describing 57,202 digital and printed music publications published by Faber Music and Music Sales between 2013 and 2018 and deposited at the British Library under legal deposit legislation. The data was...Roper, Amelie ; British Library
music sales, legal deposit, digital music publications, and Faber Music
-
Dataset
Text extracted from digitised maps of eastern Africa circa 1880-1940
This dataset comprises an Excel spreadsheet of text extracted from almost 2,000 digital images of maps and documents held in the War Office Archive, covering a large part of eastern Africa between c.1880 and 1940. The items were catalogued and digitised with generous funding from Indigo Trust. The harvested text...Dykes, Nick
War Office Archive, place names, text extraction, military maps, East Africa, computer vision, land use, colonial history, and ethnography
-
Dataset
India Office Lists
The India Office Lists are annual reference works giving details of departments and post holders in: the India Office, London, 1858-1947; the Burma Office, 1935-47; the Government of India, Calcutta, later Delhi, 1858-1947; the main provincial administrations of Bengal, Bombay and Madras and minor administrations for the same dates. From...British Library
government, India Office, Bengal, Bombay, Colonial India, and Madras
-
Dataset
Government of India, Annual Administration Reports
Annual administration reports of the territories of British India for the following areas: Government of India 1870-1871; Government of Bengal, 1871-1936 ; Government of Burma, 1872-1899 ; Chin Hills, 1909-1923; Shan and Karenni States, 1889. For researchers seeking an overview of events and developments in the territories of British India...British Library
government, India Office, Bengal, Burma, Shan and Karenni States, administration reports, and Chin Hills
-
Dataset
Sir Hans Sloane's Catalogues of his Library and Manuscripts
The files in this dataset are derived from microfilm copies of the original library catalogue of Sir Hans Sloane, now presented across 9 volumes, Sloane MS 3972 C 1-8, and the name index to the Sloane library catalogue, Sloane MS 3972 D. The catalogues are crucial for understanding the development...British Library
library, catalogues, Sloane, and metadata
-
Dataset
Freemason manuscripts in the Modern Archive collections
Dataset of brief biographical entries for 365 prominent Freemasons, with links to relevant material held with the British Library’s Modern Archives and Manuscript Collections, from the 18th to the 20th century. The dataset was supported with funding from the American Friends of the British Library, and was created by Tabitha...British Library
Freemasons and archives
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts – Recognition of Early Indian Printed Documents competition - updated with improved XML coordinates
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
OCR, Indian, and transcription
-
Dataset
The Sun
The Sun was a daily evening newspaper founded in 1792 with the support of then Prime Minister, William Pitt, and his Tory government. By the mid-1830s the politics of the newspaper had shifted, and it was advocating liberal and free trade principles. Ran 1792-1871, with dataset covering 1801-1871.British Library
-
Dataset
The Liverpool Standard etc
The Liverpool Standard and General Commercial Advertiser (1832-1856, with two changes of title) was a Conservative newspaper established by local politicians to counter the rise of Radicalism and promote “Church and State” ideology.British Library
-
Dataset
Colored News
Colored News (1855) was an illustrated general interest weekly newspaper. It was the first British newspaper to publish illustrations in colour.British Library
-
Dataset
The Northern Daily Times etc
The Liverpool-based Northern Daily Times (1853-1861, with two changes of title) was the first provincial daily newspaper in England to enjoy a sustained run. It was also one of the very first one penny dailies.British Library
-
Dataset
EThOS metadata files augmented with identifiers
A selection of files of the EThOS metadata augmented by the organisational identifiers listed below. These files were created to inform a deliverable which is part of the FREYA project which aims to gather enhanced provenance information in the EThOS metadata. The ReadMe file contains full details of the files,their...British Library
persistent identifiers, ISNI, ROR, and GRID
-
Dataset
Books related to theatre derived from the Digitised 19th Century Books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books pertaining to theatre written in English. The dataset of 841 items was created by filtering by keywords which are related to different genre of play including Drama, Act, Scene, Play, Comedy, Farce, Pantomime, Tragedy and Shakespeare and...British Library ; British Library Labs
act, genre, books, metadata, bibliographic, theatre, and play
-
Dataset
Books related to India from the Digitised 19th Century Books dataset
A dataset which is derived from the Digitised 19th Century books dataset focusing on books related to India. The dataset was created by refining the book title field using keywords related to names used for India during the period, places within India, cultural terms such as 'Hindu' and another term...British Library ; British Library Labs
books, metadata, bibliographic, and India
-
Dataset
Books related to 19th Century British Colonies derived from the Digitised 19th Century books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books related to 19th Century British Colonies. The dataset of 1288 items was created using filtering by keywords of locations and then manually checked for accuracy. The data was augmented with additional columns including 'City', 'Colony Name' and...British Library ; British Library Labs
Africa, colonialism, Canada, Ceylon, metadata, bibliographic, India, Australia, books, British Colony, and British Colonies
-
Dataset
Books related to War derived from the Digitised 19th Century Books Dataset
A dataset which is derived from the Digitised 19th Century Books dataset comprising all non-fiction English language books related to armed conflicts. The dataset of 1127 items was developed by refining based on keywords such as 'war', 'battle', 'uprising', 'revolt', 'rebellion', 'invasion' and 'mutiny'. This dataset was curated by students...British Library ; British Library Labs
non-fiction, War, books, metadata, and bibliographic
-
Dataset
Books related to the Industrial Revolution derived from the Digitised 19th Century books dataset
A dataset which is a subset of the Digitised 19th Century Books dataset comprising books related to the Industrial Revolution in Britain. The subset of 354 items was refined by using keywords associated with placenames and the topic of industrialism. This dataset was curated by the Aepyi student group at...British Library ; British Library Labs
books, industrialism, metadata, bibliographic, and Industrial Revolution
-
Dataset
Latin American books in Digitised 19th century books
A dataset which is derived from the 19th Century Books dataset comprising c.1,100 books which are related to Latin America, written in Spanish, English, German, French, Italian, Swedish and Dutch.British Library ; British Library Labs
books, Latin America, metadata, and bibliographic
-
Dataset
Books divided by Genre from the Digitised 19th century books dataset
A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...British Library ; British Library Labs
Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama
-
Dataset
Books containing images about Finland
A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.British Library ; British Library Labs
books, Finland, metadata, and bibliographic
-
Dataset
Russian language books in the Digitised 19th century books dataset
A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.British Library ; British Library Labs
books, metadata, bibliographic, and Russia
-
Dataset
British and Irish Newspapers
A title-level list of British, Irish, British Overseas Territories and Crown Dependencies newspapers held by the British Library.British Library
datasets, catalogues, media, newspapers, periodicals, and metadata
-
Dataset
Ground Truth transcriptions for training OCR of historical Arabic handwritten texts
This dataset comprises 120 digitised images (TIFF files) drawn from a selection of historical Arabic scientific manuscripts (10th-19th century) digitised through the British Library Qatar Foundation Partnership. Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition (OCR) or handwritten text...British Library ; Keinan-Schoonbaert, Adi
Arabic, transcription, and OCR
-
Dataset
Judicial Committee of the Privy Council: Linked Appeals Data
The dataset in this collection contains Linked Data about appeal cases heard by the Judicial Committee of the Privy Council between 1860 and 1998. The Judicial Committee of the Privy Council (JCPC) is the final court of appeal for British overseas territories and Crown dependencies, as well as ecclesiastical and...Middle, Sarah
-
Dataset
John Jaffray dataset; a hand list of printed books and scrap books compiled by Jaffray relating to bookbinding and trade unionism in (mainly) 19th century Victorian London
This set comprises 169 records on 222 pages of a PDF listing the contents of the Jaffray Collection (shelf mark Jaff 1 to Jaff 169) composed using free text. John Jaffray (1811-1869) was a bookbinder in Victorian London, interested in bookbinding, trade unionism and Chartism. This is a restricted collection...Marks, P. J. M.
John Jaffray bookbinder, 19th century, trade unionism, and bookbinding
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Transkribus
This dataset comprises 74 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
OCR, transcription, and Indian
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Recognition of Early Indian Printed Documents competition
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
Indian, transcription, and OCR
-
Dataset
Software Citation workshop panel session voting data
This data was captured from an interactive panel discussion session at the Software Citation workshop at the British Library on Monday 13 June. The audience were asked to vote on a series of questions via Mentimeter, with the panel (and audience members) then discussing the results. Mentimeter allows export of...Chue Hong, Neil ; Johnson, Jon ; Whitaker, Kirstie ; Madden, Frances
-
Dataset
Results From A 2015 Survey On Git/Distributed Version Control At Imperial College London
These are the - anonymised - results from a survey run at Imperial College London in November-December 2015. The survey was aimed at user of distributed version control systems, in particular Git. Before publishing the results I deleted all comments to avoid individuals being identified. The survey was designed to...Reimer, Torsten ; Boakye, Gifty
higher education, survey, Git, distributed version control, and software development
-
Dataset
Digitised Hebrew Manuscripts: Or 74 to Stowe Ch 297
This dataset comprises 32 digitised Hebrew manuscripts (1000 - 1903), with their shelfmarks in alphabetical order (Or 74 to Stowe Ch 297). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 1389 to Sloane MS 3173
This dataset comprises 32 digitised Hebrew manuscripts (1200 - 1899), with their shelfmarks in alphabetical order (Or 1389 to Sloane MS 3173). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2518 to Or 5834
This dataset comprises 33 digitised Hebrew manuscripts (900 - 1899), with their shelfmarks in alphabetical order (Or 2518 to Or 5834). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 27169 to Or 12983
This dataset comprises 21 digitised Hebrew manuscripts (1100 - 1899), with their shelfmarks in alphabetical order (Add MS 27169 to Or 12983). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised maps of the former British East Africa
This dataset comprises 581 images of maps of the former British East Africa created between 1890 and 1940 and a spreadsheet of related catalogue records. All Open Government Licence v1.0 (OGL). A user-friendly geographical search index of the maps is available on Google Maps. These JPEG files were converted from...Dykes, Nick
War Office Archive, documents, Intelligence, Uganda, East Africa, British East Africa, Maps, Military maps, and Kenya
-
Dataset
Italian Academies Project Database XML Records
Italian Academies 1525-1700 metadata.This dataset comprises 8598 XML records representing 587 Academies, 7100 people and 911 works. Some of the XML files in this dataset contain links to images, which are available in the Italian Academies Project Database Images dataset: https://doi.org/10.21250/iad2 XML nodes with the 'ImageId' attribute contain the filename...Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
Italian Academies Project Database Images
Italian Academies 1525-1700 images. This dataset comprises 1084 image files in JPEG format. The images include emblems, portraits and title pages for academies, people and works. The XML metadata associated with these images contains references to their file names: https://doi.org/10.21250/iad1Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
SherlockNet data
Using Convolutional Neural Networks to Explore Over 400 Years of Book Illustrations: Starting from February 2016, as part of the British Library Labs Competition, we embarked on a collaboration with the British Library Labs and the British Museum to tag and caption the entire British Library 1M Collection, a set...Zhao, Luda ; Do, Brian ; Wang, Karen
tagging, Flickr, images, tags, digitised, Microsoft, sherlocknet, and books
-
Dataset
Linked Open British National Bibliography - Forthcoming Books N-Triples and RDF/XML
This dataset includes metadata for forthcoming books to be published or distributed in the UK.Deliot, Corine
British National Bibliography, forthcoming, linked open data, BNB, N-Triples, RDF/XML, NT, CIP, and metadata
-
Dataset
Linked Open British National Bibliography - Books. 1950- N-Triples and RDF/XML.
This dataset includes metadata for books published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, BNB, NT, linked open data, RDF/XML, N-Triples, books, and metadata
-
Dataset
UK Selective Web Archive Classification Dataset. 1996 - 2010. TSV.
The dataset comprises a manually curated selective archive produced by UKWA which includes the classification of sites into a two-tiered subject hierarchy. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset of the Internet Archive’s web collection that relates to the UK. The JISC...UK Web Archive
archive, web domain dataset, JISC UK, classification dataset, UKWA Open Data, and 1996-2014
-
Dataset
Linked Open British National Bibliography - Serials. 1950- N-Triples and RDF/XML
This dataset includes metadata for serials published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, serials, linked open data, BNB, N-Triples, RDF/XML, NT, and metadata
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
medium sized images, books, images, digitised, and Microsoft
-
Dataset
Digitised Books - Images of the bound covers of books. c. 1510 - c. 1900. JPG
The dataset comprises c. 61,561 images identified as 'Book Covers' from the British Library's Flickr Commons collections, dating between c. 1510 - c. 1900.British Library ; British Library Labs
digitised, books, Microsoft, images, and bookcovers
-
Dataset
AAS Card Catalogues: Chinese (Wade Giles)
This dataset contains digitised cards from the Wade-Giles card catalogue.British Library
-
Dataset
Digitised Books - Images identified as Plates. c. 1528 - c. 1900. JPG
The dataset comprises c. 385,237 images identified as 'Plates' from the British Library's Flickr Commons collections, dating between c. 1528 – c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Plates have currently been...British Library ; British Library Labs
-
Dataset
Mapping Jewish Communities Of The Byzantine Empire
The aim of the project was to map the Jewish presence in the Byzantine empire using GIS (Geographical Information Systems). All references (published and unpublished) to Jewish communities in the Byzantine Empire were gathered and collated. The data were incorporated in a GIS which will be made freely available to...Rees, Gethin ; Lange, Nicholas De ; Panayotov, Alexander ; Palm, Fredrik
-
Dataset
Early Buddhist Rock-Cut Monasteries Of The Western Ghats: Locations
CSV files containing locations of early Buddhist rock-cut monasteries in the Western Ghats and Konkan coast regions of India. Only the caves cut between second century BCE and the early fifth century CE are included: from the earliest period of cutting up until just prior to the introduction of the...Rees, Gethin