411
Comment:
|
774
|
Deletions are marked like this. | Additions are marked like this. |
Line 11: | Line 11: |
Experience from Netarchive.dk concerning broad crawls:: * Preparation procedure |
Step by step experience from Netarchive.dk concerning broad crawls:: * Preparation * How to manage deduplication * Actual impact on computing and storage |
Line 15: | Line 17: |
* Metrics from the past domain crawls : how much, how many, how fast, etc. | |
Line 16: | Line 19: |
Support of WARC:: * Status of support of WARC in !NetarchiveSuite * Experience with ARC -> WARC tools * Status of transferring old webarchives into Netarkivet.dk |
|
Line 18: | Line 24: |
Collection:: * What's a collection? |
Preliminary Agenda items for the non-technical workshop
- Introduction
Status of support of WARC in NetarchiveSuite
Experience with ARC -> WARC tools