Buscar
Resultados de la búsqueda
-
Journal article
Automated Language Identification of Bibliographic Resources
This article describes experiments in the use of machine learning techniques at the British Library to assign language codes to catalog records, in order to provide information about the language of content of the resources described. In the first phase of the project, language codes were assigned to 1.15 million...Morris, Victoria
machine learning, automatic metadata generation, legacy record enhancement, metadata, and language identification
-
Journal article
MARC transformed: MARC and XML – the perfect partnership?
I first met a very British version of MARC (Machine Readable Cataloguing) in 1983, straight out of university. I didn't know anything about cataloguing, indexing, classification, or data. MARC made sense of it all. AACR2 (Anglo-American Cataloguing Rules, 2nd Edition) was impenetrable without MARC as a framework. LCSH (Library of...Rosie, Heather
MarcEdit, LCSH, EThOS, Dublin Core, TDM, XML, MARC Report, metadata, MARC 21, MARC, OAI-PMH, and AACR2
-
Dataset
OCR text derived from digitised books (unknown precise publication dates) in ALTO XML
This set consists 284 volumes. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language...British Library ; British Library Labs
XML, microsoft, books, digitised, metadata, nineteenth century, and ALTO
-
Dataset
Digitised 19th Century Books - Metadata - 01/09/2013
The dataset holds metadata on the the books digitised within this collection, providing a quick means to connect a book identifier with some of the key bibliographic metadata about it. The metadata is held in JSON notation for ease of reuse.British Library ; British Library Labs
JSON, microsoft, books, metadata, and bibliographic
-
Dataset
Pelagios Project: Digitised Cornaro Atlas. Egerton MS 73
This dataset comprises 37 images from a Portolano executed by different Venetian artists between 1489 and 1492, known as the 'Cornaro Atlas'. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until 2039. However, given the...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Single Sheets and Supplementary Materials
This dataset comprises 30 images selected among individual maps and non-map based medieval materials such as travel accounts. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until 2039. However, given the age of the manuscripts...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Liber insularum Cycladum. Arundel MS 93.art.7
This dataset comprises 45 images from the Liber insularum Cycladum produced by Christophori Bondelmonti around 1422. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until 2039. However, given the age of the manuscripts and their...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Digitised Insularium Illustratum. Additional MS 15760
This dataset comprises 123 images from the Insularium Illustratum, an account of the islands of the Mediterranean, and of some others produced by Henricus Martellus Germanus in 1495. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Digitised Liber insularum Arcipelagi Cotton MS Vespasian a.XIII.art.1
This dataset comprises 82 images from the Liber insularum Arcipegelagi, an illustrated account of the islands and major ports of the Mediterranean produced by Christophori Bondelmonti around 1422. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Maps after Ptolemy's Geographia. Burney MS 111
This dataset comprises 68 images from a Greek manuscript edition of Ptolemy's Geography containing many diagrams and coloured maps, and produced between the 1375 and 1425. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Due to UK copyright law they are technically in copyright until...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Pelagios Project: Portolano. Egerton MS 2855
This dataset comprises 7 images from a Portolano produced in 1473 by Grazioso Benincasa. The digitisation was sponsored by A. W. Mellon Foundation through the Pelagios Project. Update – 9/3/2018: Please note that there were technical issues with this dataset. If you downloaded before 9/3/2018, please replace with this repaired...British Library
XML, medieval, metadata, manuscripts, and maps
-
Dataset
Digitised Quarterly Lists PDFs and Metadata
The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises full-text searchable PDFs of 215 volumes as well as the associated metadata...Derrick, Tom
-
Dataset
Digitised Quarterly Lists XML and Metadata
Two Centuries of Indian Print 1867-1947. The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises text from the collection of digitised...Derrick, Tom
-
Dataset
Linked Open British National Bibliography - Serials. 1950- N-Triples and RDF/XML
This dataset includes metadata for serials published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, serials, linked open data, BNB, N-Triples, RDF/XML, NT, and metadata
-
Dataset
Linked Open British National Bibliography - Books. 1950- N-Triples and RDF/XML.
This dataset includes metadata for books published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, BNB, NT, linked open data, RDF/XML, N-Triples, books, and metadata
-
Dataset
Linked Open British National Bibliography - Forthcoming Books N-Triples and RDF/XML
This dataset includes metadata for forthcoming books to be published or distributed in the UK.Deliot, Corine
British National Bibliography, forthcoming, linked open data, BNB, N-Triples, RDF/XML, NT, CIP, and metadata
-
Dataset
British and Irish Newspapers
A title-level list of British, Irish, British Overseas Territories and Crown Dependencies newspapers held by the British Library.British Library
datasets, catalogues, media, newspapers, periodicals, and metadata
-
Dataset
Russian language books in the Digitised 19th century books dataset
A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.British Library ; British Library Labs
books, metadata, bibliographic, and Russia
-
Dataset
Books containing images about Finland
A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.British Library ; British Library Labs
books, Finland, metadata, and bibliographic
-
Dataset
Books divided by Genre from the Digitised 19th century books dataset
A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...British Library ; British Library Labs
Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama