Search Constraints
Search Results
-
Dataset
Selected edge painting on British Library printed books: A work in progress
Bookbindings were (and are) sometimes decorated via painting the edges of the leaves, usually but not exclusively, the fore edges of text blocks. The painting can be visible when the book is closed, or hidden beneath a layer of gold, when the edges have been gilt. This dataset covers examples...Marks, P.J.M.
bindings, painting under gilt, foreedge , fanned out leaves, fore-edge painting, foreedge paintings , fore-edge paintings, hidden fore edge paintings, fore-edge, and bookbindings
-
Dataset
British Library Television News Programme-Level List
This list provides a programme-level record of all television news and current affairs programmes recorded by the British Library’s Broadcast News service between March 2010 and May 2022. All of the channels featured were receivable free-to-air in the UK and licensed by Ofcom. All of the programmes listed can be...British Library
television, news, and current affairs
-
Dataset
StopsGB: Structured Timeline of Passenger Stations in Great Britain
Michael Quick's book _Railway Passenger Stations in Great Britain: a Chronology_ offers a uniquely rich and detailed account of Britain's changing railway infrastructure. Its listing of over 12,000 stations allows us to reconstruct the coming of rail at both micro- and macro-scales. However, being published originally as a book (and... -
Dataset
John Jaffray dataset; a hand list of printed books and scrap books compiled by Jaffray relating to bookbinding and trade unionism in (mainly) 19th century Victorian London
This set comprises 169 records on 222 pages of a PDF listing the contents of the Jaffray Collection (shelf mark Jaff 1 to Jaff 169) composed using free text. John Jaffray (1811-1869) was a bookbinder in Victorian London, interested in bookbinding, trade unionism and Chartism. This is a restricted collection...Marks, P. J. M.
John Jaffray bookbinder, 19th century, trade unionism, and bookbinding
-
Dataset
Digitised Books - Images of the bound covers of books. c. 1510 - c. 1900. JPG
The dataset comprises c. 61,561 images identified as 'Book Covers' from the British Library's Flickr Commons collections, dating between c. 1510 - c. 1900.British Library ; British Library Labs
digitised, books, Microsoft, images, and bookcovers
-
Dataset
Digitised Books - Images identified as Plates. c. 1528 - c. 1900. JPG
The dataset comprises c. 385,237 images identified as 'Plates' from the British Library's Flickr Commons collections, dating between c. 1528 – c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Plates have currently been...British Library ; British Library Labs
-
Dataset
Digitised Books - Images identified as Embellishments. c. 1510 - c. 1900. JPG
The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The images are in .JPEG format.British Library ; British Library Labs
embellishments, digitised, books, Microsoft, and images
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
digitised, books, Microsoft, images, and medium sized images
-
Dataset
Digitised Books. c. 1510 - c. 1900. JSON (OCR derived text)
The dataset comprises text created by OCR from the 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in JavaScript Object Notation (JSON) text...British Library ; British Library Labs
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
medium sized images, books, images, digitised, and Microsoft
-
Dataset
OCR text derived from digitised books published 1900 - c. 1946. ALTO XML.
Unfortunately we are unable to make this dataset available due to copyright reasons. This set consists 1251 volumes, published between 1900-1946. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history,...British Library ; British Library Labs
-
Dataset
OCR text derived from digitised books published 1880 - 1889 in ALTO XML
This set consists 10856 volumes, published between 1880-1889. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, ALTO, and nineteenth century
-
Dataset
OCR text derived from digitised books published c. 1510 - 1699 in ALTO XML
This set consists 693 volumes, published between c. 1510 - 1699. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and...British Library ; British Library Labs
XML, sixteenth century, books, digitised, seventeenth century, Microsoft, ALTO, and metadata
-
Dataset
OCR text derived from digitised books published 1840 - 1849 in ALTO XML
This set consists 4070 volumes, published between 1840-1849. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1700 - 1799 in ALTO XML.
The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format.British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, eighteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1800 - 1809 in ALTO XML
This set consists 1502 volumes, published between 1800-1809. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1810 - 1819 in ALTO XML
This set consists 2338 volumes, published between 1810-1819. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1820 - 1829 in ALTO XML
This set consists 2739 volumes, published between 1820-1829. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1830 - 1839 in ALTO XML
This set consists 2639 volumes, published between 1830-1839. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1870 - 1879 in ALTO XML
This set consists 8630 volumes, published between 1870-1879. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1850 - 1859 in ALTO XML
This set consists 5818 volumes, published between 1850-1859. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1860 - 1869 in ALTO XML
This set consists 7498 volumes, published between 1860-1869. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1890 - 1899 in ALTO XML
This set consists 14847 volumes, published between 1890-1899. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/j278-4b96 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
higher education, student, EThOS, research, doctoral, thesis, PhD, UK, dissertations, and theses
-
Dataset
British Library Newspaper Title-level List: A list of catalogued newspaper titles held by the British Library
A title-level list of catalogued newspapers held by the British Library.British Library
datasets, catalogues, media, newspapers, periodicals, and metadata
-
Dataset
Persistent Identifiers as IRO Infrastructure: Survey 2 Data
The survey ran from 4 October to 8 November 2021 and was open to everyone working in Galleries, Libraries, Archives and Museums internationally but the survey had a clear UK focus. Some responses have been removed or recoded to protect the identity of respondents.Kotarski, Rachael ; Madden, Frances
-
Dataset
Digitised Books. c. 1510 - c. 1900. JSONL (OCR derived text + metadata)
The dataset comprises metadata and OCR generated text from 49,455 digitised books published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in JSON Lines (JSONL) text format.British Library Labs ; British Library
OCR and monographs
-
Dataset
Digitised maps of the former British East Africa
This dataset comprises 581 images of maps of the former British East Africa created between 1890 and 1940 and a spreadsheet of related catalogue records. All Open Government Licence v1.0 (OGL). A user-friendly geographical search index of the maps is available on Google Maps. These JPEG files were converted from...Dykes, Nick
War Office Archive, documents, Intelligence, Uganda, East Africa, British East Africa, Maps, Military maps, and Kenya
-
Dataset
EThOS metadata files augmented with identifiers
A selection of files of the EThOS metadata augmented by the organisational identifiers listed below. These files were created to inform a deliverable which is part of the FREYA project which aims to gather enhanced provenance information in the EThOS metadata. The ReadMe file contains full details of the files,their...British Library
persistent identifiers, ISNI, ROR, and GRID
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/1137 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We...British Library ; Rosie, Heather
Higher education, dissertations, HE, research, doctoral, student, UK, theses, ethos, and thesis
-
Dataset
Mapping Irish Football
The Mapping Irish Football project called on the crowd to share any newspaper references they may have come across of women and any code of football prior to and including 1973. It is hoped that this project will start a conversation amongst researchers interested in Irish sports to do more...Byrne, Helena ; Bolton, Steve ; Carrier, John ; Farrell, Gerald ; Faller, Helge …
women's football, newspaper data, crowdsourcing, and Irish football history
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/ybpt-nh33 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
higher education, ethos, dissertations, thesis, research, PhD, doctoral, student, UK, and theses
-
Dataset
19th Century Books - Metadata 05/2021
This dataset contains metadata for a selection of monographs that are identified in the catalogue as being published during the 19th Century. This metadata has been extracted from British Library catalogue records. The metadata held within our main catalogue is updated regularly. This metadata dataset should be considered a snapshot...British Library
metadata and monographs
-
Dataset
IMPACT Digitisation Centre of Competence Dataset
The Impact Centre of Competence dataset contains more than half a million representative text-based images compiled by a number of major European libraries. Covering texts from as early as 1500, and containing material from newspapers, books, pamphlets and typewritten notes, the dataset is an invaluable resource for future research into...Universitat d’Alacant ; Instituut voor de Nederlandse Taal ; Koninklijke Bibliotheek ; Bibliothèque Nationale de France ; British Library …
-
Dataset
Early Music Online
Access is provided via the Official URL. These digitised volumes contain approximately 10,000 musical compositions, which have been individually indexed. The volumes mainly consist of partbooks of vocal polyphony, but also include some early printed tablatures for keyboard or plucked string instruments. They include music printed in Italy, Germany, France...Rose, Stephen ; Tuppen, Sandra
-
Dataset
19th Century Books - metadata with additional crowdsourced annotations
This dataset contains metadata for resources belonging to the British Library’s digitised printed books (18th-19th century) collection (bl.uk/collection-guides/digitised-printed-books). This metadata has been extracted from British Library catalogue records. The metadata held within our main catalogue is updated regularly. This metadata dataset should be considered a snapshot of this metadata. For...British Library
metadata, zooniverse, and monographs
-
Dataset
Digitised 19th Century Books - Metadata - 01/09/2021
This dataset contains metadata for resources belonging to the British Library’s digitised printed books (18th-19th century) collection (https://www.bl.uk/collection-guides/digitised-printed-books). This metadata has been extracted from British Library catalogue records. The metadata held within our main catalogue is updated regularly. This metadata dataset should be considered a snapshot of this metadata. For...British Library ; British Library Labs
Microsoft, JSON, metadata, books, and bibliographic
-
Dataset
Collective Wisdom crowdsourcing organiser and volunteer survey results
Results from two short surveys run for the Collective Wisdom project. Funded by the UK Arts and Humanities Research Council (AHRC), the Collective Wisdom project captures the collective wisdom of researchers and practitioners in crowdsourcing, citizen history, citizen science and public / community participation in research with cultural heritage collections....Ridge, Mia ; Ferriter, Meghan ; Blickhan, Samantha
-
Dataset
Locating a National Collection Our Place audience survey results
Results of an audience survey conducted by Locating a National Collection and funded by the AHRC. The research has been led by the National Trust in collaboration with the British Library and Research Bods, a market research company who have delivered results using the NT’s ‘Our Place’ online audience research...Vitale, Valeria ; Rees, Gethin ; Hunt, Alex
-
Dataset
Developing identifiers workshop analysis
A collection of use cases gathered for the Developing Identifiers for Heritage Collections resource (https://tanc-ahrc.github.io/PIDResources/). It describes all the use cases for which PIDs are used and was used to inform the aspects described in the resource.Madden, Frances
-
Dataset
UK Doctoral Theses (EThOS) Abstracts and Metadata - 01/03/2015. XLS.
This dataset has been superseded by a more recent version: https://doi.org/10.22021/ETHOSCSV201810 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We...British Library ; Rosie, Heather
-
Dataset
Catalogue records of photographs (1850-1950)
A set of catalogue records for photographs (created 1850-1950) that are held at the British Library. Export from the Integrated Archives and Manuscripts System of only CC0 published records. Personal or sensitive information has been removed. This dataset was created specifically for the Legacies of Catalogue Descriptions and Curatorial Voice:...British Library ; Moretto, Nicolas
datasets, metadata, catalogues, and photographs
-
Dataset
al-Durr al-naqī fī fann al-mūsīqī (Add MS 23494)
This dataset is a PDF file containing the images and transcription the manuscript titled al-Durr al-naqī fī fann al-mūsīqī الدرّ النقيّ في فنّ الموسيقي by Aḥmad ibn 'Abd al-Raḥmān al-Mawṣilī أحمد بن عبد الرحمن الموصلي. The manuscript was digitised through the British Library Qatar Foundation Partnership, and made available through...British Library ; Keinan-Schoonbaert, Adi
transcription, Arabic, and OCR
-
Dataset
Persistent Identifiers as IRO Infrastructure: Survey Data
The survey ran from 28 May to 14 September 2020 and was open to everyone working in Galleries, Libraries, Archives and Museums internationally but the survey had a clear UK focus. Some responses have been removed or recoded to protect the identity of respondents. It is intended to re-run the...Kotarski, Rachael ; Madden, Frances
-
Dataset
DUKweb (Diachronic UK web)
We present DUKweb, a set of large-scale resources useful for the diachronic analysis of contemporary English. The dataset is derived from JISC UK Web Domain Dataset (1996-2013), which collects resources from the Internet Archive that were hosted on domains ending in ‘.uk’. The dataset includes co-occurrences matrices for each year...Basile, Pierpaolo ; Tsakalidis, Adam
-
Dataset
Indexes to the Dispatches of the East India Company Court of Directors to Indian Governments
The IOR/E/4 Correspondence with India comprises 1112 volumes dating from 1703-1858. The material is arranged into eight series: four series of letters received by the Court of Directors from the administration in India; and four series of dispatches sent by the Court to the same administrations. Subject, name and place...India Office Library and Records ; British Library ; Hailey, Alex
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version (5): https://doi.org/10.23636/1344 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS,...British Library ; Rosie, Heather
higher education, ethos, dissertations, HE, research, PhD, doctoral, student, UK, theses, and thesis
-
Dataset
India Office Lists
The India Office Lists are annual reference works giving details of departments and post holders in: the India Office, London, 1858-1947; the Burma Office, 1935-47; the Government of India, Calcutta, later Delhi, 1858-1947; the main provincial administrations of Bengal, Bombay and Madras and minor administrations for the same dates. From...British Library
government, India Office, Bengal, Bombay, Colonial India, and Madras
-
Dataset
Government of India, Annual Administration Reports
Annual administration reports of the territories of British India for the following areas: Government of India 1870-1871; Government of Bengal, 1871-1936 ; Government of Burma, 1872-1899 ; Chin Hills, 1909-1923; Shan and Karenni States, 1889. For researchers seeking an overview of events and developments in the territories of British India...British Library
government, India Office, Bengal, Burma, Shan and Karenni States, administration reports, and Chin Hills
-
Dataset
Sir Hans Sloane's Catalogues of his Library and Manuscripts
The files in this dataset are derived from microfilm copies of the original library catalogue of Sir Hans Sloane, now presented across 9 volumes, Sloane MS 3972 C 1-8, and the name index to the Sloane library catalogue, Sloane MS 3972 D. The catalogues are crucial for understanding the development...British Library
library, catalogues, Sloane, and metadata
-
Dataset
Freemason manuscripts in the Modern Archive collections
Dataset of brief biographical entries for 365 prominent Freemasons, with links to relevant material held with the British Library’s Modern Archives and Manuscript Collections, from the 18th to the 20th century. The dataset was supported with funding from the American Friends of the British Library, and was created by Tabitha...British Library
Freemasons and archives
-
Dataset
Books related to theatre derived from the Digitised 19th Century Books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books pertaining to theatre written in English. The dataset of 841 items was created by filtering by keywords which are related to different genre of play including Drama, Act, Scene, Play, Comedy, Farce, Pantomime, Tragedy and Shakespeare and...British Library ; British Library Labs
act, genre, books, metadata, bibliographic, theatre, and play
-
Dataset
Books related to India from the Digitised 19th Century Books dataset
A dataset which is derived from the Digitised 19th Century books dataset focusing on books related to India. The dataset was created by refining the book title field using keywords related to names used for India during the period, places within India, cultural terms such as 'Hindu' and another term...British Library ; British Library Labs
books, metadata, bibliographic, and India
-
Dataset
Books related to 19th Century British Colonies derived from the Digitised 19th Century books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books related to 19th Century British Colonies. The dataset of 1288 items was created using filtering by keywords of locations and then manually checked for accuracy. The data was augmented with additional columns including 'City', 'Colony Name' and...British Library ; British Library Labs
Africa, colonialism, Canada, Ceylon, metadata, bibliographic, India, Australia, books, British Colony, and British Colonies
-
Dataset
Books related to War derived from the Digitised 19th Century Books Dataset
A dataset which is derived from the Digitised 19th Century Books dataset comprising all non-fiction English language books related to armed conflicts. The dataset of 1127 items was developed by refining based on keywords such as 'war', 'battle', 'uprising', 'revolt', 'rebellion', 'invasion' and 'mutiny'. This dataset was curated by students...British Library ; British Library Labs
non-fiction, War, books, metadata, and bibliographic
-
Dataset
Books related to the Industrial Revolution derived from the Digitised 19th Century books dataset
A dataset which is a subset of the Digitised 19th Century Books dataset comprising books related to the Industrial Revolution in Britain. The subset of 354 items was refined by using keywords associated with placenames and the topic of industrialism. This dataset was curated by the Aepyi student group at...British Library ; British Library Labs
books, industrialism, metadata, bibliographic, and Industrial Revolution
-
Dataset
Latin American books in Digitised 19th century books
A dataset which is derived from the 19th Century Books dataset comprising c.1,100 books which are related to Latin America, written in Spanish, English, German, French, Italian, Swedish and Dutch.British Library ; British Library Labs
books, Latin America, metadata, and bibliographic
-
Dataset
Books divided by Genre from the Digitised 19th century books dataset
A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...British Library ; British Library Labs
Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama
-
Dataset
Books containing images about Finland
A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.British Library ; British Library Labs
books, Finland, metadata, and bibliographic
-
Dataset
Russian language books in the Digitised 19th century books dataset
A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.British Library ; British Library Labs
books, metadata, bibliographic, and Russia
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/1188 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
-
Dataset
British and Irish Newspapers
A title-level list of British, Irish, British Overseas Territories and Crown Dependencies newspapers held by the British Library.British Library
datasets, catalogues, media, newspapers, periodicals, and metadata
-
Dataset
Ground Truth transcriptions for training OCR of historical Arabic handwritten texts
This dataset comprises 120 digitised images (TIFF files) drawn from a selection of historical Arabic scientific manuscripts (10th-19th century) digitised through the British Library Qatar Foundation Partnership. Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition (OCR) or handwritten text...British Library ; Keinan-Schoonbaert, Adi
Arabic, transcription, and OCR
-
Dataset
Judicial Committee of the Privy Council: Linked Appeals Data
The dataset in this collection contains Linked Data about appeal cases heard by the Judicial Committee of the Privy Council between 1860 and 1998. The Judicial Committee of the Privy Council (JCPC) is the final court of appeal for British overseas territories and Crown dependencies, as well as ecclesiastical and...Middle, Sarah
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Transkribus
This dataset comprises 74 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
OCR, transcription, and Indian
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Recognition of Early Indian Printed Documents competition
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
Indian, transcription, and OCR
-
Dataset
Software Citation workshop panel session voting data
This data was captured from an interactive panel discussion session at the Software Citation workshop at the British Library on Monday 13 June. The audience were asked to vote on a series of questions via Mentimeter, with the panel (and audience members) then discussing the results. Mentimeter allows export of...Chue Hong, Neil ; Johnson, Jon ; Whitaker, Kirstie ; Madden, Frances
-
Dataset
Results From A 2015 Survey On Git/Distributed Version Control At Imperial College London
These are the - anonymised - results from a survey run at Imperial College London in November-December 2015. The survey was aimed at user of distributed version control systems, in particular Git. Before publishing the results I deleted all comments to avoid individuals being identified. The survey was designed to...Reimer, Torsten ; Boakye, Gifty
higher education, survey, Git, distributed version control, and software development
-
Dataset
Digitised Hebrew Manuscripts: Or 74 to Stowe Ch 297
This dataset comprises 32 digitised Hebrew manuscripts (1000 - 1903), with their shelfmarks in alphabetical order (Or 74 to Stowe Ch 297). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 1389 to Sloane MS 3173
This dataset comprises 32 digitised Hebrew manuscripts (1200 - 1899), with their shelfmarks in alphabetical order (Or 1389 to Sloane MS 3173). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2518 to Or 5834
This dataset comprises 33 digitised Hebrew manuscripts (900 - 1899), with their shelfmarks in alphabetical order (Or 2518 to Or 5834). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 27169 to Or 12983
This dataset comprises 21 digitised Hebrew manuscripts (1100 - 1899), with their shelfmarks in alphabetical order (Add MS 27169 to Or 12983). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Italian Academies Project Database Images
Italian Academies 1525-1700 images. This dataset comprises 1084 image files in JPEG format. The images include emblems, portraits and title pages for academies, people and works. The XML metadata associated with these images contains references to their file names: https://doi.org/10.21250/iad1Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
Italian Academies Project Database XML Records
Italian Academies 1525-1700 metadata.This dataset comprises 8598 XML records representing 587 Academies, 7100 people and 911 works. Some of the XML files in this dataset contain links to images, which are available in the Italian Academies Project Database Images dataset: https://doi.org/10.21250/iad2 XML nodes with the 'ImageId' attribute contain the filename...Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
Linked Open British National Bibliography - Forthcoming Books N-Triples and RDF/XML
This dataset includes metadata for forthcoming books to be published or distributed in the UK.Deliot, Corine
British National Bibliography, forthcoming, linked open data, BNB, N-Triples, RDF/XML, NT, CIP, and metadata
-
Dataset
Linked Open British National Bibliography - Books. 1950- N-Triples and RDF/XML.
This dataset includes metadata for books published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, BNB, NT, linked open data, RDF/XML, N-Triples, books, and metadata
-
Dataset
UK Selective Web Archive Classification Dataset. 1996 - 2010. TSV.
The dataset comprises a manually curated selective archive produced by UKWA which includes the classification of sites into a two-tiered subject hierarchy. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset of the Internet Archive’s web collection that relates to the UK. The JISC...UK Web Archive
archive, web domain dataset, JISC UK, classification dataset, UKWA Open Data, and 1996-2014
-
Dataset
Linked Open British National Bibliography - Serials. 1950- N-Triples and RDF/XML
This dataset includes metadata for serials published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, serials, linked open data, BNB, N-Triples, RDF/XML, NT, and metadata
-
Dataset
JISC UK Web Domain Dataset Host Link Graph. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses from the 1996 - 2010 tranche of the JISC UK Web Domain Dataset which have been scanned for hyperlinks. For each link, UKWA extracts the host that the link targets, and uses this to build up a picture of which hosts have...UKWA Open Data
archive, 1996-2012, web domain dataset, JISC UK, host link graph, and UKWA Open Data
-
Dataset
JISC UK Web Domain Dataset Format Profile. 1996 - 2010.
The dataset is a format profile, summarising media type (MIME type) data formats contained within all of the HTTP 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset...UK Web Archive
archive, 1996-2010, web domain dataset, JISC UK, UKWA Open Data, and format profile
-
Dataset
JISC UK Web Domain Dataset Geoindex. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset Dataset which have been scanned for geographic references - specifically postcodes. This set of postcode citations, found at particular URLs and crawled at particular times, forms an historical geoindex...UK Web Archive
archive, 1996-2011, JISC UK, geoindex, UKWA Open Data, and web domain dataset
-
Dataset
JISC UK Web Domain Dataset Crawled URL Index. 1996 - 2013. CDX.
The dataset comprises original compound index (CDX) files that have been re-assembled into 18 separate CDX files for each year of crawling activity represented (1996 - 2013). Please note that the individual CDX files are not sorted. In order to enable access to web archives, UKWA uses CDX files to...UKWA Open Data
archive, 1996-2013, crawled URL index, web domain dataset, JISC UK, and UKWA Open Data
-
Dataset
AAS Card Catalogues: Chinese (Wade Giles)
This dataset contains digitised cards from the Wade-Giles card catalogue.British Library
-
Dataset
Mapping Jewish Communities Of The Byzantine Empire
The aim of the project was to map the Jewish presence in the Byzantine empire using GIS (Geographical Information Systems). All references (published and unpublished) to Jewish communities in the Byzantine Empire were gathered and collated. The data were incorporated in a GIS which will be made freely available to...Rees, Gethin ; Lange, Nicholas De ; Panayotov, Alexander ; Palm, Fredrik
-
Dataset
Early Buddhist Rock-Cut Monasteries Of The Western Ghats: Locations
CSV files containing locations of early Buddhist rock-cut monasteries in the Western Ghats and Konkan coast regions of India. Only the caves cut between second century BCE and the early fifth century CE are included: from the earliest period of cutting up until just prior to the introduction of the...Rees, Gethin
-
Dataset
Early Buddhist Rock-Cut Monasteries Of The Western Ghats: Caves And Sculptures
CSV files containing basic information on early Buddhist rock-cut monasteries in the Western Ghats and Konkan coast regions of India. Only the caves cut between second century BCE and the early fifth century CE are included: from the earliest period of cutting up until just prior to the introduction of...Rees, Gethin
-
Dataset
Theatrical playbills from Britain and Ireland
The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset in Portable Document Format (PDF). The playbills cover theatres in Bath (Royal),...British Library Labs ; Kirk, Tanya
singlesheet, playbill, and playbills
-
Dataset
Digitised Hebrew Manuscripts: Add MS 10456 - Add MS 17058
This dataset comprises 25 digitised Hebrew manuscripts (1200 - 1599), with their shelfmarks in alphabetical order (Add MS 10456 - Add MS 17058). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2510 - Or 2588
This dataset comprises 40 digitised Hebrew manuscripts (900 - 1747; unknown date), with their shelfmarks in alphabetical order (Or 2510 - Or 2588). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 18229 - Add MS 26897
This dataset comprises 27 digitised Hebrew manuscripts (1100 - 1799; unknown date), with their shelfmarks in alphabetical order (Add MS 18229 - Add MS 26897). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2626 - Or 6425
This dataset comprises 43 digitised Hebrew manuscripts (920 - 1845; unknown date), with their shelfmarks in alphabetical order (Or 2626 - Or 6425). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Harley MS 5709 - Or 11016
This dataset comprises 25 digitised Hebrew manuscripts (1100 - 1699; unknown date), with their shelfmarks in alphabetical order (Harley MS 5709 - Or 11016). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 9399 - Harley MS 5708
This dataset comprises 27 digitised Hebrew manuscripts (1200 - 1499; unknown date), with their shelfmarks in alphabetical order (Add MS 9399 - Harley MS 5708). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 26938 - Add MS 9398
This dataset comprises 35 digitised Hebrew manuscripts (1100 - 1816; unknown date), with their shelfmarks in alphabetical order (Add MS 26938 - Add MS 9398). These manuscripts are out of copyright. We would appreciate it if users could read our Ethical terms of use guide before reusing our Hebrew manuscripts...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2210 - Or 2364
This dataset comprises 21 digitised Hebrew manuscripts (1000 - 1655; unknown date), with their shelfmarks in alphabetical order (Or 2210 - Or 2364). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2365 - Or 2405
This dataset comprises 27 digitised Hebrew manuscripts (1200 - 1867; unknown date), with their shelfmarks in alphabetical order (Or 2365 - Or 2405). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 1103 - Or 2201
This dataset comprises 46 digitised Hebrew manuscripts (1250 - 1699; unknown date), with their shelfmarks in alphabetical order (Or 1103 - Or 2201). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2406 - Or 2509
This dataset comprises 41 digitised Hebrew manuscripts (1300 - 1799; unknown date), with their shelfmarks in alphabetical order (Or 2406 - Or 2509). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 63 - Or 9882
This dataset comprises 35 digitised Hebrew manuscripts (0900 - 1899), with their shelfmarks in alphabetical order (Or 63 - Or 9882). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add 10455 to Or 1099
This dataset comprises 39 digitised Hebrew manuscripts (1200 - 1799), with their shelfmarks in alphabetical order (Add 10455 - Or 1099). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Cronin, Catherine
manuscripts, Jewish, and Hebrew
- « Previous
- Next »
- 1
- 2
- 3