Search Constraints
Search Results
-
Dataset
Denton and Haughton Examiner
Denton and Haughton Examiner was a weekly newspaper which has been digitised by the British Library for the Living with Machines project. Variant titles are 1873�74 The Denton, Haughton, & District Weekly News. 1874�75 Denton & Haughton Weekly News, and Audenshaw, Hooley Hill, and Dukinfield Advertiser, 1875�78 Denton Examiner, Audenshaw,...British Library
-
Dataset
Barrow Herald and Furness Advertiser
Barrow Herald and Furness Advertiser. (1863 - 1914) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Atherstone, Nuneaton, and Warwickshire Times
Atherstone, Nuneaton, and Warwickshire Times was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Swansea and Glamorgan Herald, and South Wales Free Press
Swansea and Glamorgan Herald, and South Wales Free Press. (1847 - 1890) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Cradley Heath & Stourbridge Observer
Cradley Heath & Stourbridge Observer. (1864 - 1888) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Potteries Examiner
Potteries Examiner (1871 - 1881) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Glasgow Chronicle
Glasgow Chronicle was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Northern Guardian (Hartlepool)
Northern Guardian (Hartlepool) (1891 - 1902) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
North Cumberland Reformer
North Cumberland Reformer (1890 - 1898) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Lancaster Standard and County Advertiser
Lancaster Standard and County Advertiser was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Lancaster Herald and Town and County Advertiser
Lancaster Herald and Town and County Advertiser was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Alston Herald, and East Cumberland Advertiser
Alston Herald, and East Cumberland Advertiser was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Weekly Journal
The file consists of the OCR (Optical Character Recognition) text in XML format for one year of Weekly Journal (Hartlepool) 1901. The full digitised newspaper comprises no. 1–407 (29 Nov.1901 – 17 Sep.1909). The digitised page images are available on the British Newspaper Archive website, https://www.britishnewspaperarchive.co.uk/titles/weekly-journal-hartlepool The British Newspaper Archive...British Library
-
Dataset
Tamworth Miners' Examiner and Working Men's Journal
Tamworth Miners' Examiner and Working Men's Journal (1873 - 1876) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Stretford and Urmston Examiner
Stretford and Urmston Examiner. (1879 - 1880) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Stalybridge Examiner
Stalybridge Examiner (1876) which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
St. Helens Examiner
St. Helens Examiner was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
The Stockton Examiner
The Stockton Examiner (1878-1879) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
The Cannock Chase Examiner
The Cannock Chase Examiner (1874-1877) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Midland Examiner and Wolverhampton Times
Midland Examiner and Wolverhampton Times (1874-1878) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Shropshire Examiner
Shropshire Examiner (1874-1877) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
South Staffordshire Examiner
South Staffordshire Examiner (1874) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Forest of Dean Examiner
Forest of Dean Examiner (1873-1877) was a weekly newspaper which has been digitised by the British Library for the Living with Machines project.British Library
-
Dataset
Cotton Factory Times
Cotton Factory Times (1885-1889, 1891-1901) was a weekly newspaper which has been digitised by the British Library for the Living with Machines projectBritish Library
-
Dataset
Selected edge painting on British Library printed books: A work in progress
Bookbindings were (and are) sometimes decorated via painting the edges of the leaves, usually but not exclusively, the fore edges of text blocks. The painting can be visible when the book is closed, or hidden beneath a layer of gold, when the edges have been gilt. This dataset covers examples...Marks, P.J.M.
bindings, painting under gilt, foreedge , fanned out leaves, fore-edge painting, foreedge paintings , fore-edge paintings, hidden fore edge paintings, fore-edge, and bookbindings
-
Dataset
Supporting documentation for A Literature Review of Palm Leaf Manuscript Conservation: Parts 1 and 2
Part 1: a historic overview, leaf preparation, materials and media, palm leaf manuscripts at the British Library and the common types of damage Part 2: historic and current conservation treatments, boxing and storage, religious and ethical issues, recommendations The closure of the British Library during the 2020-2021 Covid-19 pandemic allowed... -
Dataset
Frederick May's London Press Dictionary and Advertiser's Handbook (1883-1911)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Frederick May and successors, containing information on newspapers, magazines and periodicals and arranged in alphabetical and sometimes tabular order. Information for each title included price publisher office political and religious leaningFrederick May & Son ; British Library
-
Dataset
The Newspaper Press Directory (1881-1920)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...C. Mitchell and Co. ; British Library
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's...British Library ; Rosie, Heather
higher education, EThOS, student, research, UK, theses, dissertations, thesis, doctoral, and PhD
-
Dataset
Persistent Identifiers as IRO Infrastructure: Survey 2 Data
The survey ran from 4 October to 8 November 2021 and was open to everyone working in Galleries, Libraries, Archives and Museums internationally but the survey had a clear UK focus. Some responses have been removed or recoded to protect the identity of respondents.Kotarski, Rachael ; Madden, Frances
-
Dataset
Digitised Books. c. 1510 - c. 1900. JSONL (OCR derived text + metadata)
The dataset comprises metadata and OCR generated text from 49,455 digitised books published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in JSON Lines (JSONL) text format.British Library Labs ; British Library
OCR and monographs
-
Dataset
May's British and Irish Press Guide and Advertiser's Handbook & Dictionary etc. (1871-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Frederick May and successors, containing information on newspapers, magazines and periodicals and arranged in alphabetical and sometimes tabular order. Information for each title included price, publisher, office, political and religious leaning.Frederick May & Son ; British Library
-
Dataset
The Newspaper Press Directory (1846-1880)
Newspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...C. Mitchell and Co. ; British Library
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/j278-4b96 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
higher education, student, EThOS, research, doctoral, thesis, PhD, UK, dissertations, and theses
-
Dataset
Ordnance Survey Old / First series England and Wales 1:63360 (georeferenced sheet images)
Map sheet images for the Ordnance Survey Old Series / First Series England and Wales 1:63360, georeferenced and cropped at the neatlike (can be viewed together as a seamless composite). Geotiff format. The original (ungeoreferenced) sheet images can be found at: https://commons.wikimedia.org/wiki/Category:Ordnance_Survey_Old/First_series_England_and_Wales_1:63360_(full_sheets). The sheets were georeferenced by relating the sheet...Vane, Olivia
England, Wales, Ordnance Survey, maps, First Series, and Old Series
-
Dataset
Collective Wisdom crowdsourcing organiser and volunteer survey results
Results from two short surveys run for the Collective Wisdom project. Funded by the UK Arts and Humanities Research Council (AHRC), the Collective Wisdom project captures the collective wisdom of researchers and practitioners in crowdsourcing, citizen history, citizen science and public / community participation in research with cultural heritage collections....Ridge, Mia ; Ferriter, Meghan ; Blickhan, Samantha
-
Dataset
Locating a National Collection Our Place audience survey results
Results of an audience survey conducted by Locating a National Collection and funded by the AHRC. The research has been led by the National Trust in collaboration with the British Library and Research Bods, a market research company who have delivered results using the NT’s ‘Our Place’ online audience research...Vitale, Valeria ; Rees, Gethin ; Hunt, Alex
-
Dataset
Developing identifiers workshop analysis
A collection of use cases gathered for the Developing Identifiers for Heritage Collections resource (https://tanc-ahrc.github.io/PIDResources/). It describes all the use cases for which PIDs are used and was used to inform the aspects described in the resource.Madden, Frances
-
Dataset
Catalogue records of photographs (1850-1950)
A set of catalogue records for photographs (created 1850-1950) that are held at the British Library. Export from the Integrated Archives and Manuscripts System of only CC0 published records. Personal or sensitive information has been removed. This dataset was created specifically for the Legacies of Catalogue Descriptions and Curatorial Voice:...British Library ; Moretto, Nicolas
photographs, catalogues, datasets, and metadata
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/ybpt-nh33 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
higher education, ethos, dissertations, thesis, research, PhD, doctoral, student, UK, and theses
-
Dataset
IMPACT Digitisation Centre of Competence Dataset
The Impact Centre of Competence dataset contains more than half a million representative text-based images compiled by a number of major European libraries. Covering texts from as early as 1500, and containing material from newspapers, books, pamphlets and typewritten notes, the dataset is an invaluable resource for future research into...Universitat d’Alacant ; Instituut voor de Nederlandse Taal ; Koninklijke Bibliotheek ; Bibliothèque Nationale de France ; British Library …
-
Dataset
The Express
The Express (1846-1869) was an evening newspaper companion to the Daily News (1846-1912), published by Bradbury & Evans, and advocating reformist principles.British Library
-
Dataset
The Press.
The Press (1853-1866) was a weekly conservative newspaper, to which Benjamin Disraeli regularly contributed.British Library
-
Dataset
The Star
The Star (1788-1831, dataset 1801-1831) was the first daily London evening newspaper. Its circulation was facilitated by the success of the mail-coach service.British Library
-
Dataset
National Register.
The National Register (1808-1823) was a Conservative Sunday newspaper, owned by John Browne Bell, which was hostile to parliamentary reform.British Library
-
Dataset
The British Press; or, Morning Literary Advertiser
The British Press (1803-1826) was a daily newspaper founded in January 1803 in opposition to The Morning Post, with a conservative orientation. It printed the latest news, from home and abroad, for a London readership, and provided early journalistic employment for Charles Dickens.British Library
-
Dataset
Living Machines atypical animacy dataset
Atypical animacy detection dataset, based on nineteenth-century sentences in English extracted from an open dataset of nineteenth-century books digitized by the British Library (available via https://doi.org/10.21250/db14, British Library Labs, 2014). This dataset contains 598 sentences containing mentions of machines. Each sentence has been annotated according to the animacy and humanness... -
Dataset
Persistent Identifiers as IRO Infrastructure: Survey Data
The survey ran from 28 May to 14 September 2020 and was open to everyone working in Galleries, Libraries, Archives and Museums internationally but the survey had a clear UK focus. Some responses have been removed or recoded to protect the identity of respondents. It is intended to re-run the...Kotarski, Rachael ; Madden, Frances
-
Dataset
Living with Machines alpha and beta Zooniverse 'accident' task data
Data created through crowdsourcing tasks hosted on the Zooniverse platform. Members of the public were asked to look at a selection of articles from 19th century newspapers that mentioned machines and decide if they described an industrial accident. A further task asked participants to transcribe personal, organisational and place names...Zooniverse volunteers
citizen history, digital history, digital humanities, Living with Machines, newspapers, and crowdsourcing
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version (5): https://doi.org/10.23636/1344 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS,...British Library ; Rosie, Heather
higher education, ethos, dissertations, HE, research, PhD, doctoral, student, UK, theses, and thesis
-
Dataset
Faber Music and Music Sales Publications 2013 to 2018
The ‘Faber Music and Music Sales Publications 2013 to 2018’ dataset is an .xlsx (Excel Workbook) file containing metadata describing 57,202 digital and printed music publications published by Faber Music and Music Sales between 2013 and 2018 and deposited at the British Library under legal deposit legislation. The data was...Roper, Amelie ; British Library
music sales, legal deposit, Faber Music, and digital music publications
-
Dataset
Text extracted from digitised maps of eastern Africa circa 1880-1940
This dataset comprises an Excel spreadsheet of text extracted from almost 2,000 digital images of maps and documents held in the War Office Archive, covering a large part of eastern Africa between c.1880 and 1940. The items were catalogued and digitised with generous funding from Indigo Trust. The harvested text...Dykes, Nick
military maps, East Africa, computer vision, ethnography, War Office Archive, land use, place names, colonial history, and text extraction
-
Dataset
The Sun
The Sun was a daily evening newspaper founded in 1792 with the support of then Prime Minister, William Pitt, and his Tory government. By the mid-1830s the politics of the newspaper had shifted, and it was advocating liberal and free trade principles. Ran 1792-1871, with dataset covering 1801-1871.British Library
-
Dataset
The Liverpool Standard etc
The Liverpool Standard and General Commercial Advertiser (1832-1856, with two changes of title) was a Conservative newspaper established by local politicians to counter the rise of Radicalism and promote “Church and State” ideology.British Library
-
Dataset
Colored News
Colored News (1855) was an illustrated general interest weekly newspaper. It was the first British newspaper to publish illustrations in colour.British Library
-
Dataset
The Northern Daily Times etc
The Liverpool-based Northern Daily Times (1853-1861, with two changes of title) was the first provincial daily newspaper in England to enjoy a sustained run. It was also one of the very first one penny dailies.British Library
-
Dataset
EThOS metadata files augmented with identifiers
A selection of files of the EThOS metadata augmented by the organisational identifiers listed below. These files were created to inform a deliverable which is part of the FREYA project which aims to gather enhanced provenance information in the EThOS metadata. The ReadMe file contains full details of the files,their...British Library
persistent identifiers, ISNI, ROR, and GRID
-
Dataset
Books related to theatre derived from the Digitised 19th Century Books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books pertaining to theatre written in English. The dataset of 841 items was created by filtering by keywords which are related to different genre of play including Drama, Act, Scene, Play, Comedy, Farce, Pantomime, Tragedy and Shakespeare and...British Library ; British Library Labs
act, genre, books, metadata, bibliographic, theatre, and play
-
Dataset
Books related to India from the Digitised 19th Century Books dataset
A dataset which is derived from the Digitised 19th Century books dataset focusing on books related to India. The dataset was created by refining the book title field using keywords related to names used for India during the period, places within India, cultural terms such as 'Hindu' and another term...British Library ; British Library Labs
books, metadata, bibliographic, and India
-
Dataset
Books related to 19th Century British Colonies derived from the Digitised 19th Century books dataset
A dataset derived from the Digitised 19th Century Books dataset which contains books related to 19th Century British Colonies. The dataset of 1288 items was created using filtering by keywords of locations and then manually checked for accuracy. The data was augmented with additional columns including 'City', 'Colony Name' and...British Library ; British Library Labs
Africa, colonialism, Canada, Ceylon, metadata, bibliographic, India, Australia, books, British Colony, and British Colonies
-
Dataset
Books related to War derived from the Digitised 19th Century Books Dataset
A dataset which is derived from the Digitised 19th Century Books dataset comprising all non-fiction English language books related to armed conflicts. The dataset of 1127 items was developed by refining based on keywords such as 'war', 'battle', 'uprising', 'revolt', 'rebellion', 'invasion' and 'mutiny'. This dataset was curated by students...British Library ; British Library Labs
non-fiction, War, books, metadata, and bibliographic
-
Dataset
Books related to the Industrial Revolution derived from the Digitised 19th Century books dataset
A dataset which is a subset of the Digitised 19th Century Books dataset comprising books related to the Industrial Revolution in Britain. The subset of 354 items was refined by using keywords associated with placenames and the topic of industrialism. This dataset was curated by the Aepyi student group at...British Library ; British Library Labs
books, industrialism, metadata, bibliographic, and Industrial Revolution
-
Dataset
Latin American books in Digitised 19th century books
A dataset which is derived from the 19th Century Books dataset comprising c.1,100 books which are related to Latin America, written in Spanish, English, German, French, Italian, Swedish and Dutch.British Library ; British Library Labs
books, Latin America, metadata, and bibliographic
-
Dataset
Books divided by Genre from the Digitised 19th century books dataset
A dataset derived from the Digitised 19th Century Books dataset which classifies the books by genre (Drama, Poetry, Prose, Music and unidentified). For Drama, Music and Prose several types were identified. For Drama: comedy, play, recitation and tragedy. For Prose: novel, parody, romance, satire, story, history subset of story and...British Library ; British Library Labs
Music, Genre, Prose, books, Poetry, metadata, bibliographic, and Drama
-
Dataset
Books containing images about Finland
A dataset derived from the Digitised 19th Century books dataset comprising books with images about Finland, approximately 40 titles. This dataset was compiled by Ruby Dixon a student at Graveney School who completed work experience at British Library Labs in 2016.British Library ; British Library Labs
books, Finland, metadata, and bibliographic
-
Dataset
Russian language books in the Digitised 19th century books dataset
A dataset which is a subset of the Digitised 19th Century books dataset comprising Russian Language books. The spreadsheet contains metadata of 585 books in Russian. This dataset was compiled by Nadya Miryanova a student at Lady Eleanor Holles who completed work experience at British Library Labs in 2017.British Library ; British Library Labs
books, metadata, bibliographic, and Russia
-
Dataset
UK Doctoral Thesis Metadata from EThOS
This dataset has been superseded by a more recent version: https://doi.org/10.23636/1188 If you require access to an earlier version, please email openaccess@bl.uk, including the dataset title, date, and DOI in your request. The data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the...British Library ; Rosie, Heather
-
Dataset
John Jaffray dataset; a hand list of printed books and scrap books compiled by Jaffray relating to bookbinding and trade unionism in (mainly) 19th century Victorian London
This set comprises 169 records on 222 pages of a PDF listing the contents of the Jaffray Collection (shelf mark Jaff 1 to Jaff 169) composed using free text. John Jaffray (1811-1869) was a bookbinder in Victorian London, interested in bookbinding, trade unionism and Chartism. This is a restricted collection...Marks, P. J. M.
John Jaffray bookbinder, 19th century, trade unionism, and bookbinding
-
Dataset
Results From A 2015 Survey On Git/Distributed Version Control At Imperial College London
These are the - anonymised - results from a survey run at Imperial College London in November-December 2015. The survey was aimed at user of distributed version control systems, in particular Git. Before publishing the results I deleted all comments to avoid individuals being identified. The survey was designed to...Reimer, Torsten ; Boakye, Gifty
higher education, survey, Git, distributed version control, and software development
-
Dataset
Digitised maps of the former British East Africa
This dataset comprises 581 images of maps of the former British East Africa created between 1890 and 1940 and a spreadsheet of related catalogue records. All Open Government Licence v1.0 (OGL). A user-friendly geographical search index of the maps is available on Google Maps. These JPEG files were converted from...Dykes, Nick
War Office Archive, documents, Intelligence, Uganda, East Africa, British East Africa, Maps, Military maps, and Kenya
-
Dataset
SherlockNet data
Using Convolutional Neural Networks to Explore Over 400 Years of Book Illustrations: Starting from February 2016, as part of the British Library Labs Competition, we embarked on a collaboration with the British Library Labs and the British Museum to tag and caption the entire British Library 1M Collection, a set...Zhao, Luda ; Do, Brian ; Wang, Karen
sherlocknet, books, images, digitised, tags, Flickr, tagging, and Microsoft
-
Dataset
Linked Open British National Bibliography - Forthcoming Books N-Triples and RDF/XML
This dataset includes metadata for forthcoming books to be published or distributed in the UK.Deliot, Corine
British National Bibliography, forthcoming, linked open data, BNB, N-Triples, RDF/XML, NT, CIP, and metadata
-
Dataset
Linked Open British National Bibliography - Books. 1950- N-Triples and RDF/XML.
This dataset includes metadata for books published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, BNB, NT, linked open data, RDF/XML, N-Triples, books, and metadata
-
Dataset
UK Selective Web Archive Classification Dataset. 1996 - 2010. TSV.
The dataset comprises a manually curated selective archive produced by UKWA which includes the classification of sites into a two-tiered subject hierarchy. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset of the Internet Archive’s web collection that relates to the UK. The JISC...UK Web Archive
archive, web domain dataset, JISC UK, classification dataset, UKWA Open Data, and 1996-2014
-
Dataset
Linked Open British National Bibliography - Serials. 1950- N-Triples and RDF/XML
This dataset includes metadata for serials published or distributed in the UK since 1950.Deliot, Corine
British National Bibliography, serials, linked open data, BNB, N-Triples, RDF/XML, NT, and metadata
-
Dataset
JISC UK Web Domain Dataset Format Profile. 1996 - 2010.
The dataset is a format profile, summarising media type (MIME type) data formats contained within all of the HTTP 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset...UK Web Archive
archive, 1996-2010, web domain dataset, JISC UK, UKWA Open Data, and format profile
-
Dataset
JISC UK Web Domain Dataset Host Link Graph. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses from the 1996 - 2010 tranche of the JISC UK Web Domain Dataset which have been scanned for hyperlinks. For each link, UKWA extracts the host that the link targets, and uses this to build up a picture of which hosts have...UKWA Open Data
archive, 1996-2012, web domain dataset, JISC UK, host link graph, and UKWA Open Data
-
Dataset
JISC UK Web Domain Dataset Geoindex. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset Dataset which have been scanned for geographic references - specifically postcodes. This set of postcode citations, found at particular URLs and crawled at particular times, forms an historical geoindex...UK Web Archive
archive, 1996-2011, JISC UK, geoindex, UKWA Open Data, and web domain dataset
-
Dataset
JISC UK Web Domain Dataset Crawled URL Index. 1996 - 2013. CDX.
The dataset comprises original compound index (CDX) files that have been re-assembled into 18 separate CDX files for each year of crawling activity represented (1996 - 2013). Please note that the individual CDX files are not sorted. In order to enable access to web archives, UKWA uses CDX files to...UKWA Open Data
archive, 1996-2013, crawled URL index, web domain dataset, JISC UK, and UKWA Open Data
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
medium sized images, books, images, digitised, and Microsoft
-
Dataset
Digitised Books - Images of the bound covers of books. c. 1510 - c. 1900. JPG
The dataset comprises c. 61,561 images identified as 'Book Covers' from the British Library's Flickr Commons collections, dating between c. 1510 - c. 1900.British Library ; British Library Labs
digitised, books, Microsoft, images, and bookcovers
-
Dataset
Digitised Books - Images identified as Plates. c. 1528 - c. 1900. JPG
The dataset comprises c. 385,237 images identified as 'Plates' from the British Library's Flickr Commons collections, dating between c. 1528 – c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Plates have currently been...British Library ; British Library Labs
-
Dataset
Theatrical playbills from Britain and Ireland
The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset in Portable Document Format (PDF). The playbills cover theatres in Bath (Royal),...British Library Labs ; Kirk, Tanya
singlesheet, playbill, and playbills
-
Dataset
Digitised Quarterly Lists XML and Metadata
Two Centuries of Indian Print 1867-1947. The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises text from the collection of digitised...Derrick, Tom
-
Dataset
Digitised Quarterly Lists PDFs and Metadata
The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises full-text searchable PDFs of 215 volumes as well as the associated metadata...Derrick, Tom
-
Dataset
Volumes of performances connecting Sir Henry Irving. 1879 - 1905.
Sir Henry Irving's American and Provincial Tours 1883 - 1905; miscellaneous performances, including some given by Royal Command, 1883 - 1903; Lyceum Theatre 1879 – 1902; and Drury Lane Theatre, 1903 and 1905. The collection was formed by Bram Stoker.British Library
-
Dataset
Volumes of Lysons Collectanea (Trades), comprising advertisements, cuttings, and illustrations relating to trades, professions, medical cures. 1660-1825.
The dataset comprises the OCR text derived from four digitised volumes of a collection of advertisements, cuttings and illustrations relating to trades, professions and medical cures from 1660 - 1825.British Library
text, newspapers, OCR, trades, and adverts
-
Dataset
Volumes of Lysons Collectanea (Amusements), comprising broadsides, cuttings, advertisements on amusements 1660-1840
The dataset comprises nine digitised volumes of a collection of broadsides, cuttings and advertisements, relating to public exhibitions and places of amusement from 1660 - 1840 (with OCR-derived text.) Part of the Lysons Collectanea collection.British Library
amusements, text, newspapers, broadsides, OCR, and adverts
-
Dataset
Volumes of portraits and biographies of officers in the South African wars collected by John Malcolm Bulloch. 1900 - 1902.
The dataset comprises six digitised volumes (in PDF) of a collection of portraits and biographical details of some officers distinguished in the South African War (1900 - 1902) (with OCR-derived text.) The collection was formed by John Malcolm Bulloch..British Library
South Africa, text, portraits, war, army, OCR, biographies, and biography
-
Dataset
Volumes of Madden's cuttings, views, and pamphlets about the British Museum. 1755-1870.
The dataset comprises four digitised volumes of a collection of cuttings, views and pamphlets made by Sir Frederic Madden about the British Museum, dating 1755 - 1870 (with OCR-derived text.)British Library
British Museum, text, and OCR
-
Dataset
Volume of Christmas ballads and broadsides. 1750 - 1840
110 page PDF of miscellaneous Christmas ballads and prose broadsides (with OCR-derived text.) The dataset comprises one digitised volume (110 pages) of a collection of Christmas ballads and prose broadsides chiefly printed in London by J. Pitts between 1750 - 1840. The dataset is in Portable Document Format (PDF).British Library
-
Dataset
Portraits of actors, views of theatres and playbills (covering 1750 - 1821 in a single volume)
166 page PDF of collated portraits and views (with OCR-derived text) The dataset comprises one digitised volume (166 pages) of a collection of portraits of celebrated actors and actresses, views of theatres and playbills, dating 1750 - 1821. The dataset is in Portable Document Format (PDF).British Library
text, theatres, views, portraits, actors, OCR, and playbills
-
Dataset
Volumes of signs of taverns in England and Wales. 1628 - 1858
The dataset comprises 14 digitised volumes (as PDFs) of a collection of tavern signs in and England and Wales dating 1628 – 1858 (with OCR-derived text.)British Library
-
Dataset
OCR text derived from digitised books published 1890 - 1899 in ALTO XML
This set consists 14847 volumes, published between 1890-1899. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO
-
Dataset
Digitised Books - Images identified as Embellishments. c. 1510 - c. 1900. JPG
The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The images are in .JPEG format.British Library ; British Library Labs
embellishments, digitised, books, Microsoft, and images
-
Dataset
Digitised Books - Images identified as Medium Sized Images. c. 1567 - c. 1900. JPG
The dataset comprises c. 217,101 images identified as 'Medium Sized Images' from the British Library's Flickr Commons collections, dating between c. 1567 - c. 1900. The images were algorithmically gathered from 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900; Medium Sized...British Library ; British Library Labs
digitised, books, Microsoft, images, and medium sized images
-
Dataset
Theatrical playbills from Britain and Ireland (OCR text only)
The dataset comprises 264 volumes of digitised theatrical playbills published between 1660 – 1902 (mostly 19th century) from England, Scotland, Wales and Ireland. Digitised from the British Library's physical collection of over 500 volumes of playbills. The dataset contains text files (.TXT) in Optical Character Recognition (OCR) format. The playbills...British Library Labs
singlesheet, text, playbill, OCR, and playbills
-
Dataset
OCR text derived from digitised books published 1880 - 1889 in ALTO XML
This set consists 10856 volumes, published between 1880-1889. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, ALTO, and nineteenth century
-
Dataset
OCR text derived from digitised books published 1870 - 1879 in ALTO XML
This set consists 8630 volumes, published between 1870-1879. The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO)...British Library ; British Library Labs
XML, books, digitised, metadata, Microsoft, nineteenth century, and ALTO