Buscar
Resultados de la búsqueda
-
Dataset
JISC UK Web Domain Dataset Geoindex. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset Dataset which have been scanned for geographic references - specifically postcodes. This set of postcode citations, found at particular URLs and crawled at particular times, forms an historical geoindex...UK Web Archive
archive, 1996-2011, JISC UK, geoindex, UKWA Open Data, and web domain dataset
-
Dataset
JISC UK Web Domain Dataset Format Profile. 1996 - 2010.
The dataset is a format profile, summarising media type (MIME type) data formats contained within all of the HTTP 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset...UK Web Archive
archive, 1996-2010, web domain dataset, JISC UK, UKWA Open Data, and format profile
-
Dataset
JISC UK Web Domain Dataset Host Link Graph. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses from the 1996 - 2010 tranche of the JISC UK Web Domain Dataset which have been scanned for hyperlinks. For each link, UKWA extracts the host that the link targets, and uses this to build up a picture of which hosts have...UKWA Open Data
archive, 1996-2012, web domain dataset, JISC UK, host link graph, and UKWA Open Data
-
Dataset
Linked Open British National Bibliography - Serials. 1950- N-Triples and RDF/XML
This dataset includes metadata for serials published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, serials, linked open data, BNB, N-Triples, RDF/XML, NT, and metadata
-
Dataset
UK Selective Web Archive Classification Dataset. 1996 - 2010. TSV.
The dataset comprises a manually curated selective archive produced by UKWA which includes the classification of sites into a two-tiered subject hierarchy. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset of the Internet Archive’s web collection that relates to the UK. The JISC...UK Web Archive
archive, web domain dataset, JISC UK, classification dataset, UKWA Open Data, and 1996-2014
-
Dataset
Linked Open British National Bibliography - Forthcoming Books N-Triples and RDF/XML
This dataset includes metadata for forthcoming books to be published or distributed in the UK.Deliot, Corine
British National Bibliography, forthcoming, linked open data, BNB, N-Triples, RDF/XML, NT, CIP, and metadata
-
Dataset
Italian Academies Project Database XML Records
Italian Academies 1525-1700 metadata.This dataset comprises 8598 XML records representing 587 Academies, 7100 people and 911 works. Some of the XML files in this dataset contain links to images, which are available in the Italian Academies Project Database Images dataset: https://doi.org/10.21250/iad2 XML nodes with the 'ImageId' attribute contain the filename...Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
Italian Academies Project Database Images
Italian Academies 1525-1700 images. This dataset comprises 1084 image files in JPEG format. The images include emblems, portraits and title pages for academies, people and works. The XML metadata associated with these images contains references to their file names: https://doi.org/10.21250/iad1Gianfrancesco, Lorenza ; Testa, Simone ; Everson, Jane ; Reidy, Dennis ; Sampson, Lisa …
-
Dataset
Digitised Hebrew Manuscripts: Add MS 27169 to Or 12983
This dataset comprises 21 digitised Hebrew manuscripts (1100 - 1899), with their shelfmarks in alphabetical order (Add MS 27169 to Or 12983). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 2518 to Or 5834
This dataset comprises 33 digitised Hebrew manuscripts (900 - 1899), with their shelfmarks in alphabetical order (Or 2518 to Or 5834). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image and...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 1389 to Sloane MS 3173
This dataset comprises 32 digitised Hebrew manuscripts (1200 - 1899), with their shelfmarks in alphabetical order (Or 1389 to Sloane MS 3173). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Digitised Hebrew Manuscripts: Or 74 to Stowe Ch 297
This dataset comprises 32 digitised Hebrew manuscripts (1000 - 1903), with their shelfmarks in alphabetical order (Or 74 to Stowe Ch 297). These manuscripts are out of copyright. These JPEG files were converted from TIFF files using IrfanView, and then further compressed using JPEGMini. They can be used for image...King, Ellie
manuscripts, Jewish, and Hebrew
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Recognition of Early Indian Printed Documents competition
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
Indian, transcription, and OCR
-
Dataset
Ground Truth transcriptions for training OCR of historical Bengali printed texts - Transkribus
This dataset comprises 74 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition software on historical...British Library ; Derrick, Tom
OCR, transcription, and Indian
-
Dataset
Judicial Committee of the Privy Council: Linked Appeals Data
The dataset in this collection contains Linked Data about appeal cases heard by the Judicial Committee of the Privy Council between 1860 and 1998. The Judicial Committee of the Privy Council (JCPC) is the final court of appeal for British overseas territories and Crown dependencies, as well as ecclesiastical and...Middle, Sarah
-
Dataset
Ground Truth transcriptions for training OCR of historical Arabic handwritten texts
This dataset comprises 120 digitised images (TIFF files) drawn from a selection of historical Arabic scientific manuscripts (10th-19th century) digitised through the British Library Qatar Foundation Partnership. Also contained are ground truth transcriptions (XML) for each page that can be used for training optical character recognition (OCR) or handwritten text...British Library ; Keinan-Schoonbaert, Adi
Arabic, transcription, and OCR
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/1188 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
-
Dataset
Russian language books in the Digitised 19th century books dataset
A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.British Library ; British Library Labs
books, metadata, bibliographic, and Russia
-
Dataset
Books containing images about Finland
A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.British Library ; British Library Labs
books, Finland, metadata, and bibliographic
-
Dataset
Books divided by Genre from the Digitised 19th century books dataset
A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...British Library ; British Library Labs
Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama
- « Anterior
- Siguiente »
- 1
- 2
- 3
- 4
- 5
- 6
- 7