Preliminary Agenda items (proposals) for the non-technical workshop
- Introduction
Presentation of participants Expectations Review/update of agenda Step by step experience from Netarchive.dk concerning broad crawlsPreparation Selection of sites How to manage deduplication Actual impact on computing and storage Experience during the crawl QA Metrics from the past domain crawls : how much, how many, how fast, etc. CollectionWhat's a collection? User management NetarchiveSuiteDifferent set of roles using the NetarchiveSuite
A simple user interface for people who are not very familiar with webarchiving. Statistics moduleBase for all kinds of calculations and general information about the webarchive Comparing results of crawls for quality control. AccessComparison of legal basis regarding access Access with Wayback