Ricerca
Risultati della ricerca
-
Dataset
UK Selective Web Archive Classification Dataset. 1996 - 2010. TSV.
The dataset comprises a manually curated selective archive produced by UKWA which includes the classification of sites into a two-tiered subject hierarchy. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset of the Internet Archive’s web collection that relates to the UK. The JISC...UK Web Archive
archive, web domain dataset, JISC UK, classification dataset, UKWA Open Data, and 1996-2014
-
Dataset
JISC UK Web Domain Dataset Crawled URL Index. 1996 - 2013. CDX.
The dataset comprises original compound index (CDX) files that have been re-assembled into 18 separate CDX files for each year of crawling activity represented (1996 - 2013). Please note that the individual CDX files are not sorted. In order to enable access to web archives, UKWA uses CDX files to...UKWA Open Data
archive, 1996-2013, crawled URL index, web domain dataset, JISC UK, and UKWA Open Data
-
Dataset
JISC UK Web Domain Dataset Format Profile. 1996 - 2010.
The dataset is a format profile, summarising media type (MIME type) data formats contained within all of the HTTP 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset. In partnership with the Internet Archive and JISC, UKWA had obtained access to the subset...UK Web Archive
archive, 1996-2010, web domain dataset, JISC UK, UKWA Open Data, and format profile
-
Dataset
JISC UK Web Domain Dataset Host Link Graph. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses from the 1996 - 2010 tranche of the JISC UK Web Domain Dataset which have been scanned for hyperlinks. For each link, UKWA extracts the host that the link targets, and uses this to build up a picture of which hosts have...UKWA Open Data
archive, 1996-2012, web domain dataset, JISC UK, host link graph, and UKWA Open Data
-
Dataset
JISC UK Web Domain Dataset Geoindex. 1996 - 2010. TSV.
The dataset comprises ~2.5 billion 200 OK responses in the 1996 - 2010 tranche of the JISC UK Web Domain Dataset Dataset which have been scanned for geographic references - specifically postcodes. This set of postcode citations, found at particular URLs and crawled at particular times, forms an historical geoindex...UK Web Archive
archive, 1996-2011, JISC UK, geoindex, UKWA Open Data, and web domain dataset
-
Research report
UK Web Archive Statistics September 2017 Quarterly Report
This document replaces our monthly “Web Archiving Statistics” report. Following a review of the monthly report, and in consultation with stakeholders, the new document aims to convey a clearer, more narrative account of our web archiving statistics, reflecting key figures such as growth in our curated titles and usage of...Lelkes-Rarugal, Carlos
-
Research report
UK Web Archive Statistics December 2017 Quarterly Report
This document replaces our monthly “Web Archiving Statistics” report and is the second report in this new format. Following a review of the monthly report, and in consultation with stakeholders, the new document aims to convey a clearer, more narrative account of our web archiving statistics, reflecting key figures such...Lelkes-Rarugal, Carlos
-
Book
Quality Assurance Guide for the UK Web Archive: An introduction to reviewing the quality of archived websites through ACT
This guidebook contains strategies that can be used to support Quality Assurance (QA) undertaken with the UK Web Archive’s Annotation Curation Tool (ACT). The guide has several aims: to provide an introduction to web archiving generally and to the UK Web Archive in particular; to provide hints and tips on...Bingham, Nicola ; Byrne, Helena
-
Research report
UKWA Interim Report Second Quarter 2018
This is the second instalment of the Web Archiving Statistics Quarterly Report. Following a review of the previous format of the monthly report, and in consultation with stakeholders, this document aims to convey a clearer, more narrative account of our web archiving statistics, reflecting key figures such as growth in...Lelkes-Rarugal, Carlos
-
Research report
UKWA Interim Statistics Report First Quarter 2018
Covering months: April, May and June (2018).Lelkes-Rarugal, Carlos