Risultati della ricerca
DatasetNewspaper directories produced and published annually in contemporary 19th Britain by advertising agent Charles Mitchell. Newspapers listed primarily listed in alphabetical order of the town the newspaper where the title was published. Information for each title included: features connected with the district such as population and trade; principal towns in...
C. Mitchell and Co. ; British Library
AbstractResearch in computational linguistics has made successful attempts at modelling word meaning at scale, but much remains to be done to put these computational models to the test of historical scholarship (see e.g. Beelen et al. 2021). More importantly, a lot of computational research looks at texts in a historical...
Ridge, Mia ; Tolfo, Giorgia ; Westerling, Kalle ; Pedrazzini, Nilo ; McGillivray, Barbara
Journal articleThis article offers the edition of a divorce settlement of 369 housed in the British Library. The papyrus is one of the few fourth-century deeds of divorce. In addition to the standard clauses found in such settlements, provisions for the care of the minor children of the ex-couple are also...
Working paperThere is a growing interest in utilising Machine Learning (ML) techniques within Galleries, Libraries, Archives and Museums (GLAM), and a corresponding demand for training to enable practitioners to engage confidently in this area. Staff at these institutions are seeking practical knowledge and skills in ML concepts and methods specific to...
van Strien, Daniel ; Bell, Mark ; McGregor, Nora Rose ; Trizna, Michael
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and ResourcesIn recent years, large-scale data collection efforts have prioritized the amount of data collected in order to improve the modeling capabilities of large language models. This prioritization, however, has resulted in concerns with respect to the rights of data subjects represented in data collections, particularly when considering the difficulty in...
McMillan-Major, Angelina ; Alyafeai, Zaid ; Biderman, Stella ; Chen, Kimbo ; De Toni, Francesco …
Working paperIn this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution languages and time periods. Using a historical newspaper corpus in 3 languages as test-bed, we use prompts to extract possible named entities. Our results show that a naive...
De Toni, Francesco ; Akiki, Christopher ; de la Rosa, Javier ; Fourrier, Clémentine ; Manjavacas, Enrique …
DatasetThe data in this collection comprises the bibliographic metadata for all UK doctoral theses listed in EThOS, the UK's national thesis service. We estimate the data covers around 98% of all PhDs ever awarded by UK Higher Education institutions, dating back to 1787. Thesis metadata from every PhD-awarding university in...
British Library ; Rosie, Heather
Research reportBunnyfoot undertook an exploratory study for the British Library to investigate users’ perspectives on the preservation and use of Emerging Formats in the Library context. These findings are intended to help inform requirements for the collection, preservation, discovery and use of these materials at legal deposit libraries.
Tope, Andrew ; Kayukala, Mila