Search Constraints
Search Results
-
Dataset
Digitised Quarterly Lists XML and Metadata
Two Centuries of Indian Print 1867-1947. The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises text from the collection of digitised...Derrick, Tom
-
Dataset
Digitised Quarterly Lists PDFs and Metadata
The files in this dataset are derived from the British Library’s collection of bound volume Quarterly Lists: printed catalogue records of Indian books published quarterly and by province of British India between 1867 and 1947. The dataset comprises full-text searchable PDFs of 215 volumes as well as the associated metadata...Derrick, Tom
-
Conference paper (unpublished)
ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019
This paper presents an objective comparative evaluation of page analysis and recognition methods for historical documents with text mainly in Bengali language and script. It describes the competition rules, dataset, and evaluation methodology. Results are presented for five methods - three submit-ted, one re-run, and one open source state-of-the-art system....Clausner, Christian ; Antonacopoulos, Apostolos ; Derrick, Tom ; Pletschacher, Stefan
-
Conference paper (published)
Cross-disciplinary Collaborations to Enrich Access to Non-Western Language Material in the Cultural Heritage Sector
The British Library is home to millions of items representing every age of written civilisation, including books, manuscripts and newspapers in all written languages. Large digitisation programmes currently underway are opening up access to this rich and unique historical content on an ever increasing scale. However, particularly for historical material...Derrick, Tom ; McGregor, Nora
HTR, page analysis, layout analysis, recognition, Bangla script, Arabic script, OCR, and datasets