%0 Online Database %T OCR text derived from digitised books published 1700 - 1799 in ALTO XML. %A British Library; British Library Labs %C London, UK %D 2014 %8 2018-12-18 %I British Library %X The dataset comprises text from the collection of digitised books created using Optical Character Recognition (OCR) technology. The books cover a wide range of subject areas including philosophy, history, poetry and literature. The dataset is in Analysed Layout and Text Object (ALTO) Extensible Markup Language (XML) format. %G English %[ 2024-03-29 %9 Dataset %~ Hyku %W British Library