Ricerca
Risultati della ricerca
-
Conference paper (published)
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of...Laurençon, Hugo ; Saulnier, Lucile ; Wang, Thomas ; Akiki, Christopher ; Villanova del Moral, Albert …
-
Conference paper (published)
Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
In this work, we explore whether the recently demonstrated zero-shot abilities of the T0 model extend to Named Entity Recognition for out-of-distribution languages and time periods. Using a historical newspaper corpus in 3 languages as test-bed, we use prompts to extract possible named entities. Our results show that a naive...De Toni, Francesco ; Akiki, Christopher ; De La Rosa, Javier ; Fourrier, Clémentine ; Manjavacas, Enrique …
-
Presentation
WARCnet Special Report - An overview of Skills, Tools & Knowledge Ecologies in Web Archive Research
This is a recording of a presentation for the Aarhus Closing Conference 2022, 18 October 2022 on the WARCnet Special Report, ‘An overview of Skills, Tools & Knowledge Ecologies in Web Archive Research’ with the accompanying slides. It was presented by Sharon Healy, Helena Byrne, Nicola Bingham and Katharina Schmid....Healy, Sharon ; Byrne, Helena ; Bingham, Nicola ; Schmid, Katharina
-
Conference paper (published)
Towards a Collections Model for Preservation Planning at the British Library
The development of a framework for preservation planning at the British Library has highlighted the need for a more-structured understanding of its digital collections, in particular with regard to identifying the specific sets of objects that would be the focus of preservation plans. Work has recently commenced on developing a...Day, Michael ; Pennock, Maureen
-
Conference paper (published)
Design Patterns in Digital Preservation: Understanding Information Flows
This paper proposes a framework to help understand the different ways digital preservation goals can achieved, and the contextual factors these choices depend on. This is done through a worked example: three different design patterns representing the three possible modes of archival information flow, each illustrated with realistic examples and...Jackson, Andrew N
OAIS, design patterns, community, risk management, and innovation
-
Poster (published)
Quality Assurance for Born-Digital Interactive Narratives: The New Media Writing Prize Collection as a case study
The UK Legal Deposit Libraries have been researching and building experimental collections of emerging formats for the past five years, including curated collections of web-based interactive narratives in the UK Web Archive. The New Media Writing Prize Collection is one of such collections, created using web archiving tools to capture...Rossi, Giulia Carla ; Pyke, Tegan
innovation, resilience, quality assurance, New Media Writing Prize, digital interactive narratives, web archiving, and emerging formats
-
Conference paper (unpublished)
Closing remarks to Open and Engaged conference 2022.
Rachael's closing remarks summing up the discussion of the 2022 Open and Engaged event.Kotarski, Rachael
-
Conference paper (unpublished)
Shared Research Repository Service and competency framework for cultural heritage professionals
This presentation covers an overview of the British Library’s Shared Research Repository Service to make GLAM research discoverable and a proposed training programme for GLAM professionals to manage repository services as a result of a recent scoping report conducted by the British Library. British Library’s Shared Research Repository Service features...Basford, Jenny ; Holt, Ilkay
-
Conference paper (unpublished)
Climate change approach at the British Library
Maja outlines the initiatives the British Library is taking to reach Net Zero including infrastructure issues to reduce emissions. Collection initiatives include the UK Web Archive curated Collection on Climate Change, Voices of Science, Contemporary Science Collections Guide on Climate Change and highlighting the use of nineteenth century maps to...Maricevic, Maja
-
Conference paper (unpublished)
“Forever or 5 years”: Sustainability planning for Digital Research Infrastructure for Arts and Humanities
This short presentation will focus on the requirements for sustainability planning for Digital Research Infrastructure for Arts and Humanities, and more specifically for the federated repositories & data services’ ecosystem as part of the AHRC programme “Infrastructure for Digital Innovation and Curation in Arts and Humanities” (iDAH). From technical choices,...Sichani, Anna Maria
-
Conference paper (unpublished)
Climate justice at the Royal Botanic Garden Edinburgh
Founded in 1670, the Royal Botanic Garden Edinburgh delivers world-leading plant science, conservation and education programmes. The Garden’s mission is to ‘Explore, Conserve and Explain the World of Plants for a Better Future’ and the new five year strategy, ’Responding to the Biodiversity Crisis and Climate Emergency’, was launched in...Mitchell, Lorna
-
Conference paper (unpublished)
Valuing the breadth and depth of skills in the research library
Libraries are centres of technical and specialist expertise, the skills their staff can contribute to the wider organisation and as partners in research have been overlooked. This talk follows on from the RLUK-AHRC event The Technician Commitment and the role of research and academic libraries as centres of technical and...Knowles, Claire
-
Conference paper (published)
Exploring Software, Tools and Methods used in Web Archive Research
This paper is one part of a larger research project, titled, Web Archives - Researcher Skills and Tools (WARST). In this poster we focus on the data from the WARST study which examines the software, tools and methods used in the web archive research lifecycle.Schmid, Katharina ; Healy, Sharon ; Byrne, Helena
web archiving, web archive research, web archive users, and web archive creators
-
Presentation
Identifiers for Practice Research
Evans, Jenny
-
Abstract
Historic machines from 'prams' to 'Parliament': new avenues for collaborative linguistic research
Research in computational linguistics has made successful attempts at modelling word meaning at scale, but much remains to be done to put these computational models to the test of historical scholarship (see e.g. Beelen et al. 2021). More importantly, a lot of computational research looks at texts in a historical...Ridge, Mia ; Tolfo, Giorgia ; Westerling, Kalle ; Pedrazzini, Nilo ; McGillivray, Barbara
crowdsourcing, computational linguistics, and digital humanities
-
Presentation
Locating a national collection through audience research
Locating a National Collection (LaNC) aims to help cultural heritage organisations to use location data — such as where objects were made and used or the places they depict and describe — to connect collections and engage audiences. Location-based interfaces such as web maps offer opportunities to open up collections...Rees, Gethin
Interface design, Geography, Location, Web maps, Cultural Heritage, and Metadata
-
Presentation
Locating a National Collection
Locating a National Collection (LaNC) aims to help cultural heritage organisations to use location data — such as where objects were made and used or the places they depict and describe — to connect collections and engage audiences. Location-based interfaces such as web maps offer opportunities to open up collections...Rees, Gethin ; Vitale, Valeria
Interface design, Geography, Location, Web maps, Cultural Heritage, and Metadata
-
Conference paper (unpublished)
Developing Identifiers for Heritage Collections
'Developing Identifiers for Heritage Collections ' is a tool aimed at heritage professionals to support decision making for persistent identifier (PID) use. The project’s 2020 survey found the value of PIDs, for example in creating trustworthy links, was understood but needs to be more clearly addressed directly to decision makers...Madden, Frances
persistent identifiers, Towards a National Collection, and PIDs
-
Abstract
PIDs for Repositories: DOIs and URNs and Handles, oh my….
This presentation, given at a workshop on PID requirements in the UKRI Open Access Policy, gives an overview of implementation options to meet technical requirement 5a: "PIDs for research outputs must be implemented according to international recognised standards, examples of international standards include DOI, URN or Handle".Cope, Jez
PIDs, Open Access, and UKRI
-
Presentation
Publisher Submission Portal User Survey - Autumn 2021
A presentation given to the Collection Development and Acquisitions sub-group (CDAS) of the Legal Deposit Implementation Group in December 2021. CDAS brings together representatives from the six UK legal deposit libraries, to report on operational matters and plan policy for co-operation on managing the legal deposit collections. The Publisher Submission...Hazell, Lottie
-
Conference paper (published)
When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation
As languages evolve historically, making computational approaches sensitive to time can improve performance on specific tasks. In this work, we assess whether applying historical language models and time-aware methods help with determining the correct sense of polysemous words. We outline the task of time-sensitive Targeted Sense Disambiguation (TSD), which aims...Beelen, Kaspar ; Nanni, Federico ; Coll Ardanuy, Mariona ; Hosseini, Kasra ; Tolfo, Giorgia …
-
Conference paper (published)
Living Machines: A study of atypical animacy
This paper proposes a new approach to animacy detection, the task of determining whether an entity is represented as animate in a text. In particular, this work is focused on atypical animacy and examines the scenario in which typically inanimate objects, specifically machines, are given animate attributes. To address it,...Coll Ardanuy, Mariona ; Nanni, Federico ; Beelen, Kaspar ; Hosseini, Kasra ; Ahnert, Ruth …
nineteenth-century English, living machines, BERT, and animacy
-
Conference paper (unpublished)
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym... -
Abstract
Using smart annotations to map the geography of newspapers
Geographic information is a key component in the description of collection objects, and yet its format is often unsuited for use with methods of geographic analysis. Catalogue entries are often inconsistent, in plain text, and without geographic coordinates (much less coordinates linked to authority records). Georesolution of the relevant fields...Ryan, Yann ; Coll Ardanuy, Mariona ; van Strien, Daniel ; Hosseini, Kasra ; Beelen, Kaspar …
-
Conference paper (unpublished)
Contextualizing Victorian Newspapers
Beelen, Kaspar ; Ahnert, Ruth ; Beavan, David ; Coll Ardanuy, Mariona ; Hosseini, Kasra …
-
-
Conference paper (unpublished)
Defoe: A Spark-Based Toolbox for Analysing Digital Historical Textual Data
This work presents defoe, a new scalable and portable digital eScience toolbox that enables historical research. It allows for running text mining queries across large datasets, such as historical newspapers and books in parallel via Apache Spark. It handles queries against collections that comprise several XML schemas and physical representations.... -
Presentation
Mapping Irish Women’s Football
The Mapping Irish Football project is calling on the crowd to share any newspaper references they may have come across of women and any code of football prior to and including 1973. It is hoped that this project will start a conversation amongst researchers interested in Irish sports to do...Byrne, Helena ; Gibbs, Stuart
-
Conference paper (published)
Station to Station: Linking and Enriching Historical British Railway Data
The transformative impact of the railway on nineteenth-century British society has been widely recognized, but understanding that process at scale remains challenging because the Victorian rail network was both vast and in a state of constant flux. Michael Quick’s reference work Railway Passenger Stations in Great Britain: a Chronology offers...Coll Ardanuy, Mariona ; Beelen, Kaspar ; Lawrence, Jon ; McDonough, Katherine ; Nanni, Federico …
-
Conference paper (published)
Towards a Foundation for Collaborative Digital Archiving with Local Concert-Giving Organisations
The centenaries of former chapters of the British Music Society (BMS), established in 1918, have prompted their governing bodies to take stock of their histories and build on the cataloguing, documentation and preservation of their archival collections. The InterMusE project aims to support this shared instinct to archive by capturing... -
Conference paper (unpublished)
What does the future hold for "open" and cultural heritage institutions?
GLAMs’ [Galleries, Libraries, Archives and Museums] public interest mission is squarely aligned with the open access ethos. Indeed, making their collections as openly accessible, shareable, and reusable as possible is the best way for GLAMs to achieve their mission as they digitize and offer their collections online. But only a...Vézina, Brigitte
-
Conference paper (unpublished)
The Ethics of Open Access in the Endangered Archives Programme
The Endangered Archives Programme (also known as EAP) gives funding to people running projects to digitise and preserve archival materials at risk of destruction. These can date from any time before the middle of the twentieth century, and from most parts of the world except Europe and North America. The...Schaik, Sam van
-
Conference paper (unpublished)
Users understand OpenGLAM. Do GLAMs?
For more than a decade, a dedicated bunch of Galleries, Libraries, Archives and Museums [GLAM] around the world have been advocating for opening up cultural heritage collections while pushing for openness in their own institutions. Today, more than 1,200 GLAMs worldwide feature open access to their digitised assets – making...Sanderhoff, Merete
-
Conference paper (unpublished)
Increasing engagement through Towards a National Collection
At the centre of the £18.9m research development programme Towards a National Collection is the aim to increase engagement with the cultural heritage collections of the UK. Funded by the Arts and Humanities Research Council, the programme is working to link collections and encourage cross-searching of multiple collection types, to...Bailey, Rebecca
-
Presentation
Developing Identifiers Resource and Q&A
A presentation as part of 'Developing Identifiers for Heritage Collections' webinar held on 21st April 2021.Madden, Frances
-
Presentation
Persistent Identifiers Demonstrator
A presentation as part of 'Developing Identifiers for Heritage Collections' webinar held on 21st April 2021.Page, Roderic
-
Presentation
Question and answer session 1 : Increasing Engagement with Cultural Heritage Collections
Recording of the Question and Answer discussion from Session 1: Increasing engagement with cultural heritage collections.Vézina, Brigitte ; Schaik, Sam van ; Sanderhoff, Merete ; Bailey, Rebecca
-
Conference paper (unpublished)
Impact cannot be measured, and other sad half-truths about impact measurement
Not everything that can be counted counts, and not everything that counts can be counted. In this talk, I look at bringing algorithmic fairness to impact measurement, from web-scale attention tracking to computer-assisted data story-telling. Drawing on my experience with altmetrics, I argue that many proxies for impact correlate not...Boruta, Luc
-
Conference paper (unpublished)
Assessing the broader value of research culture: The hidden REF experience
The hidden REF was an experiment to counteract existing evaluation methods. UK REF Impact Case Studies have a narrative linearity which fails to appreciate the amazing plethora of interactions, individuals and different types of output that are part of our research culture. The hidden REF exercise aims to celebrate the...Derrick, Gemma
-
Conference paper (unpublished)
Making a Difference and 'Partnering for Impact'
This presentation will reflect on impact as defined in the Research Excellence Framework (REF2021) and that forms a key element of the current dual funding structure of research for Higher Education Institutions. Although impact in its broadest sense extends beyond research, it is most prominently highlighted in the REF as...Boddington, Anne
-
Conference paper (unpublished)
Best of both: combining arts and science to measure the benefits of online culture for mental health in young people
An inter-disciplinary project undertaken by museum and psychiatry staff at the University of Oxford in 2020 set out to find out if online cultural content could be effective against common mental health issues such as anxiety and depression. The O-ACE (Online Active Community Engagement) project used traditional arts engagement research...Adams, Helen
-
Conference paper (unpublished)
Question and answer session 2 : Measuring and evaluating impact beyond journal articles
Recording of the Question and Answer discussion from Session 2: Measuring and evaluating impact beyond journal articles.Boruta, Luc ; Derrick, Gemma ; Boddington, Anne ; Adams, Helen
-
Presentation
Developing Identifiers for Heritage Collections: Introduction and Case Studies
A presentation as part of 'Developing Identifiers for Heritage Collections' webinar held on 21st April 2021. Photo Credit: Manchester and Lancashire Family History Society © ArchivePlus/Max Bamber 2016Madden, Frances
-
Presentation
Research Data Management for IROCs: A whistle-stop tour!
Presentation delivered as part of the IROC Open Research Workshop, August 2021. Includes: - what is Research Data Management? - how do I do it? - what else is there? - further resourcesCope, Jez
data management planning, research data management, open research, outputs management, and FAIR data
-
Poster (published)
Creating and Archiving Electronic Literature During the Pandemic
The 2020 COVID-19 pandemic has had a considerable impact on the way cultural heritage organisations engage with their audiences. At a time when public exhibitions and events have to be postponed indefinitely or cancelled, many GLAM institutions have chosen to increase their online presence instead, looking at virtual platforms as...Rossi, Giulia Carla