Index Catalog // British Library

2020

Dataset

Books related to 19th Century British Colonies derived from the Digitised 19th Century books dataset

A dataset derived from the Digitised 19th Century Books dataset which contains books related to 19th Century British Colonies. The dataset of 1288 items was created using filtering by keywords of locations and then manually checked for accuracy. The data was augmented with additional columns including 'City', 'Colony Name' and...

British Library ; British Library Labs

Africa, colonialism, Canada, Ceylon, metadata, bibliographic, India, Australia, books, British Colony, and British Colonies

2020

Dataset

Books related to War derived from the Digitised 19th Century Books Dataset

A dataset which is derived from the Digitised 19th Century Books dataset comprising all non-fiction English language books related to armed conflicts. The dataset of 1127 items was developed by refining based on keywords such as 'war', 'battle', 'uprising', 'revolt', 'rebellion', 'invasion' and 'mutiny'. This dataset was curated by students...

British Library ; British Library Labs

non-fiction, War, books, metadata, and bibliographic

2020

Dataset

Books related to the Industrial Revolution derived from the Digitised 19th Century books dataset

A dataset which is a subset of the Digitised 19th Century Books dataset comprising books related to the Industrial Revolution in Britain. The subset of 354 items was refined by using keywords associated with placenames and the topic of industrialism. This dataset was curated by the Aepyi student group at...

British Library ; British Library Labs

books, industrialism, metadata, bibliographic, and Industrial Revolution

2019

Dataset

Latin American books in Digitised 19th century books

A dataset which is derived from the 19th Century Books dataset comprising c.1,100 books which are related to Latin America, written in Spanish, English, German, French, Italian, Swedish and Dutch.

British Library ; British Library Labs

books, Latin America, metadata, and bibliographic

2019

Dataset

Books divided by Genre from the Digitised 19th century books dataset

A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...

British Library ; British Library Labs

Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama

2019

Dataset

Books containing images about Finland

A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.

British Library ; British Library Labs

books, Finland, metadata, and bibliographic

2019

Dataset

Russian language books in the Digitised 19th century books dataset

A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.

British Library ; British Library Labs

books, metadata, bibliographic, and Russia

2019

Dataset

UK Doctoral Thesis Metadata from EThOS

This dataset has been superseded by a more recent version: https://doi.org/10.23636/1188 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...

British Library ; Rosie, Heather

dissertations, Higher Education, doctoral, and student

2019

Dataset

Ground Truth transcriptions for training OCR of historical Arabic handwritten texts

This dataset comprises 120 digitised images (TIFF files) drawn from a selection of historical Arabic scientific manuscripts (10th-19th century) digitised through the British Library Qatar Foundation Partnership. Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition (OCR) or handwritten text...

British Library ; Keinan-Schoonbaert, Adi

Arabic, transcription, and OCR

2018

Dataset

Judicial Committee of the Privy Council: Linked Appeals Data

The dataset in this collection contains Linked Data about appeal cases heard by the Judicial Committee of the Privy Council between 1860 and 1998. The Judicial Committee of the Privy Council (JCPC) is the final court of appeal for British overseas territories and Crown dependencies, as well as ecclesiastical and...

Middle, Sarah

2019

Dataset

Ground Truth transcriptions for training OCR of historical Bengali printed texts - Transkribus

This dataset comprises 74 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...

British Library ; Derrick, Tom

OCR, transcription, and Indian

2019

Dataset

Ground Truth transcriptions for training OCR of historical Bengali printed texts - Recognition of Early Indian Printed Documents competition

This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...

British Library ; Derrick, Tom

Indian, transcription, and OCR

2018

Dataset

Digitised Hebrew Manuscripts: Or 74 to Stowe Ch 297

This dataset comprises 32 digitised Hebrew manuscripts (1000 - 1903), with their shelfmarks in alphabetical order (Or 74 to Stowe Ch 297). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...

King, Ellie

Research Repository

Ricerca

Risultati della ricerca

2020

Dataset

2020

Dataset

2020

Dataset

2019

Dataset

2019

Dataset

2019

Dataset

2019

Dataset

2019

Dataset

2019

Dataset

2018

Dataset

2019

Dataset

2019

Dataset

2018

Dataset

2018

Dataset

2018

Dataset

2018

Dataset

2018

Dataset

2017

Dataset

2018

Dataset

Dataset

2018

Dataset

Dataset

Dataset

2013

Dataset

Dataset

2017

Dataset

2015

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017

Dataset

2017