Ricerca
Risultati della ricerca
-
Dataset
OCR text derived from digitised books published 1850 - 1859 in ALTO XML
This set consists 5818 volumes, published between 1850-1859. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1860 - 1869 in ALTO XML
This set consists 7498 volumes, published between 1860-1869. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1870 - 1879 in ALTO XML
This set consists 8630 volumes, published between 1870-1879. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
OCR text derived from digitised books published 1880 - 1889 in ALTO XML
This set consists 10856 volumes, published between 1880-1889. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, ALTO, and nineteenth century
-
Dataset
Pelagios Project: Single Sheets and Supplementary Materials
This dataset comprises 30 images selected among individual maps and non-map based medieval materials such as travel accounts. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until 2039. However, given the age of the manuscripts...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Liber insularum Cycladum. Arundel MS 93.art.7
This dataset comprises 45 images from the Liber insularum Cycladum produced by Christophori Bondelmonti around 1422. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until 2039. However, given the age of the manuscripts and their...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Digitised Insularium Illustratum. Additional MS 15760
This dataset comprises 123 images from the Insularium Illustratum, an account of the islands of the Mediterranean, and of some others produced by Henricus Martellus Germanus in 1495. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Digitised Liber insularum Arcipelagi Cotton MS Vespasian a.XIII.art.1
This dataset comprises 82 images from the Liber insularum Arcipegelagi, an illustrated account of the islands and major ports of the Mediterranean produced by Christophori Bondelmonti around 1422. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Maps after Ptolemy's Geographia. Burney MS 111
This dataset comprises 68 images from a Greek manuscript edition of Ptolemy's Geography containing many diagrams and coloured maps, and produced between the 1375 and 1425. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Portolano. Egerton MS 2855
This dataset comprises 7 images from a Portolano produced in 1473 by Grazioso Benincasa. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Update – 9/3/2018: Please note that there were technical issues with this dataset. If you downloaded before 9/3/2018, please replace with this repaired...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Theatrical playbills from Britain and Ireland (OCR text only)
The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset contains text files (.TXT) in Optical Character Recognition (OCR) format. The playbills...British Library Labs
singlesheet, text, playbill, OCR, and playbills
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
digitised, books, Microsoft, images, and medium sized images
-
Dataset
Digitised Books - Images identified as Embellishments. c. 1510 - c. 1900. JPG
The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The images are in .JPEG format.British Library ; British Library Labs
embellishments, digitised, books, Microsoft, and images
-
Dataset
OCR text derived from digitised books published 1890 - 1899 in ALTO XML
This set consists 14847 volumes, published between 1890-1899. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
Portraits of actors, views of theatres and playbills (covering 1750 - 1821 in a single volume)
166 page PDF of collated portraits and views (with OCR-derived text) The dataset comprises one digitised volume (166 pages) of a collection of portraits of celebrated actors and actresses, views of theatres and playbills, dating 1750 - 1821. The dataset is in Portable Document Format (PDF).British Library
text, theatres, views, portraits, actors, OCR, and playbills
-
Dataset
Volumes of signs of taverns in England and Wales. 1628 - 1858
The dataset comprises 14 digitised volumes (as PDFs) of a collection of tavern signs in and England and Wales dating 1628 – 1858 (with OCR-derived text.)British Library
-
Dataset
Volumes of performances connecting Sir Henry Irving. 1879 - 1905.
Sir Henry Irving's American and Provincial Tours 1883 - 1905; miscellaneous performances, including some given by Royal Command, 1883 - 1903; Lyceum Theatre 1879 – 1902; and Drury Lane Theatre, 1903 and 1905. The collection was formed by Bram Stoker.British Library
-
Dataset
Volumes of Lysons Collectanea (Trades), comprising advertisements, cuttings, and illustrations relating to trades, professions, medical cures. 1660-1825.
The dataset comprises the OCR text derived from four digitised volumes of a collection of advertisements, cuttings and illustrations relating to trades, professions and medical cures from 1660 - 1825.British Library
text, newspapers, OCR, trades, and adverts
-
Dataset
Volumes of Lysons Collectanea (Amusements), comprising broadsides, cuttings, advertisements on amusements 1660-1840
The dataset comprises nine digitised volumes of a collection of broadsides, cuttings and advertisements, relating to public exhibitions and places of amusement from 1660 - 1840 (with OCR-derived text.) Part of the Lysons Collectanea collection.British Library
amusements, text, newspapers, broadsides, OCR, and adverts
-
Dataset
Volumes of portraits and biographies of officers in the South African wars collected by John Malcolm Bulloch. 1900 - 1902.
The dataset comprises six digitised volumes (in PDF) of a collection of portraits and biographical details of some officers distinguished in the South African War (1900 - 1902) (with OCR-derived text.) The collection was formed by John Malcolm Bulloch..British Library
South Africa, text, portraits, war, army, OCR, biographies, and biography
-
Dataset
Volumes of Madden's cuttings, views, and pamphlets about the British Museum. 1755-1870.
The dataset comprises four digitised volumes of a collection of cuttings, views and pamphlets made by Sir Frederic Madden about the British Museum, dating 1755 - 1870 (with OCR-derived text.)British Library
British Museum, text, and OCR
-
Dataset
Volume of Christmas ballads and broadsides. 1750 - 1840
110 page PDF of miscellaneous Christmas ballads and prose broadsides (with OCR-derived text.) The dataset comprises one digitised volume (110 pages) of a collection of Christmas ballads and prose broadsides chiefly printed in London by J. Pitts between 1750 - 1840. The dataset is in Portable Document Format (PDF).British Library
-
Dataset
Digitised Quarterly Lists XML and Metadata
Two Centuries of Indian Print 1867-1947. The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises text from the collection of digitised...Derrick, Tom
-
Dataset
Digitised Quarterly Lists PDFs and Metadata
The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises full-text searchable PDFs of 215 volumes as well as the associated metadata...Derrick, Tom
-
Dataset
Early German Bookbindings in the British Library
This dataset comprises approx. 650 records relating to mainly early printed books (and some manuscripts) in the British Library that have bookbindings from Germany or German influenced or speaking regions. Most were bound from 15c to 17c. Workshops have been identified using standard online and printed resources (see separate document...Marks, P. J. M.
finishing tools, tools, blind tooled, workshop, leather, blind tooled bookbinding, pigskin binding, Einbanddatenbank, German binding, Kyriss, calf, Schwenke/Schunke, and bookbinding
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/1137 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We...British Library ; Rosie, Heather
Higher education, dissertations, HE, research, doctoral, student, UK, theses, ethos, and thesis
-
Dataset
Digitised Hebrew Manuscripts: Metadata
This dataset contains metadata (TEI XML catalogue records) of all British Library digitised Hebrew manuscripts. These TEI XML files can be opened and manipulated using an XML editor.Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 5242 to Arundel Or 50
This dataset comprises 22 digitised Hebrew manuscripts (1100 - 1799), with their shelfmarks in alphabetical order (Add MS 5242 to Arundel Or 50). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 46 to Sloane 2642
This dataset comprises 35 digitised Hebrew manuscripts (1000 - 1899), with their shelfmarks in alphabetical order (Or 46 - Sloane 2642). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Cronin, Catherine
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add Ch 1250 to Or 11625
This dataset comprises 78 digitised Hebrew manuscripts (1182 - 1895), with their shelfmarks in alphabetical order (Add Ch 1250 to Or 11625). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...Cronin, Catherine
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Harley 5772 to Or 14580
This dataset comprises 42 digitised Hebrew manuscripts (1200 - 1871), with their shelfmarks in alphabetical order (Harley 5772 - Or 14580). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Cronin, Catherine
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add 10455 to Or 1099
This dataset comprises 39 digitised Hebrew manuscripts (1200 - 1799), with their shelfmarks in alphabetical order (Add 10455 - Or 1099). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Cronin, Catherine
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 63 - Or 9882
This dataset comprises 35 digitised Hebrew manuscripts (0900 - 1899), with their shelfmarks in alphabetical order (Or 63 - Or 9882). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2406 - Or 2509
This dataset comprises 41 digitised Hebrew manuscripts (1300 - 1799; unknown date), with their shelfmarks in alphabetical order (Or 2406 - Or 2509). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 1103 - Or 2201
This dataset comprises 46 digitised Hebrew manuscripts (1250 - 1699; unknown date), with their shelfmarks in alphabetical order (Or 1103 - Or 2201). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2365 - Or 2405
This dataset comprises 27 digitised Hebrew manuscripts (1200 - 1867; unknown date), with their shelfmarks in alphabetical order (Or 2365 - Or 2405). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2210 - Or 2364
This dataset comprises 21 digitised Hebrew manuscripts (1000 - 1655; unknown date), with their shelfmarks in alphabetical order (Or 2210 - Or 2364). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 26938 - Add MS 9398
This dataset comprises 35 digitised Hebrew manuscripts (1100 - 1816; unknown date), with their shelfmarks in alphabetical order (Add MS 26938 - Add MS 9398). These manuscripts are out of copyright. We would appreciate it if users could read our Ethical terms of use guide before reusing our Hebrew manuscripts...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 9399 - Harley MS 5708
This dataset comprises 27 digitised Hebrew manuscripts (1200 - 1499; unknown date), with their shelfmarks in alphabetical order (Add MS 9399 - Harley MS 5708). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Harley MS 5709 - Or 11016
This dataset comprises 25 digitised Hebrew manuscripts (1100 - 1699; unknown date), with their shelfmarks in alphabetical order (Harley MS 5709 - Or 11016). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2626 - Or 6425
This dataset comprises 43 digitised Hebrew manuscripts (920 - 1845; unknown date), with their shelfmarks in alphabetical order (Or 2626 - Or 6425). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 18229 - Add MS 26897
This dataset comprises 27 digitised Hebrew manuscripts (1100 - 1799; unknown date), with their shelfmarks in alphabetical order (Add MS 18229 - Add MS 26897). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2510 - Or 2588
This dataset comprises 40 digitised Hebrew manuscripts (900 - 1747; unknown date), with their shelfmarks in alphabetical order (Or 2510 - Or 2588). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Add MS 10456 - Add MS 17058
This dataset comprises 25 digitised Hebrew manuscripts (1200 - 1599), with their shelfmarks in alphabetical order (Add MS 10456 - Add MS 17058). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for...Keinan-Schoonbaert, Adi
manuscripts, Jewish, and Hebrew
-
Dataset
Theatrical playbills from Britain and Ireland
The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset in Portable Document Format (PDF). The playbills cover theatres in Bath (Royal),...British Library Labs ; Kirk, Tanya
singlesheet, playbill, and playbills
-
Dataset
Early Buddhist Rock-Cut Monasteries Of The Western Ghats: Caves And Sculptures
CSV files containing basic information on early Buddhist rock-cut monasteries in the Western Ghats and Konkan coast regions of India. Only the caves cut between second century BCE and the early fifth century CE are included: from the earliest period of cutting up until just prior to the introduction of...Rees, Gethin
-
Dataset
Early Buddhist Rock-Cut Monasteries Of The Western Ghats: Locations
CSV files containing locations of early Buddhist rock-cut monasteries in the Western Ghats and Konkan coast regions of India. Only the caves cut between second century BCE and the early fifth century CE are included: from the earliest period of cutting up until just prior to the introduction of the...Rees, Gethin
-
Dataset
Mapping Jewish Communities Of The Byzantine Empire
The aim of the project was to map the Jewish presence in the Byzantine empire using GIS (Geographical Information Systems). All references (published and unpublished) to Jewish communities in the Byzantine Empire were gathered and collated. The data were incorporated in a GIS which will be made freely available to...Rees, Gethin ; Lange, Nicholas De ; Panayotov, Alexander ; Palm, Fredrik
-
Dataset
Digitised Books - Images identified as Plates. c. 1528 - c. 1900. JPG
The dataset comprises c. 385,237 images identified as 'Plates' from the British Library's Flickr Commons collections, dating between c. 1528 – c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Plates have currently been...British Library ; British Library Labs
-
Dataset
AAS Card Catalogues: Chinese (Wade Giles)
This dataset contains digitised cards from the Wade-Giles card catalogue.British Library