Differences between revisions 9 and 10
Revision 9 as of 2009-03-26 15:11:25
Size: 906
Comment:
Revision 10 as of 2009-03-27 09:13:51
Size: 1161
Comment:
Deletions are marked like this. Additions are marked like this.
Line 29: Line 29:

 WARC::
 * Has your institution decided to use WARC?
  * When do you expect to start to store harvest materials in WARC?
 * Has your institution decided to convert existing ARC files to WARC?
  * When do you expect to start the converting process?

Preliminary Agenda items (proposals) for the non-technical workshop

Introduction
  • Presentation of participants
  • Expectations
  • Review/update of agenda
  • Step by step experience from Netarchive.dk concerning broad crawls
  • Preparation
  • Selection of sites
  • How to manage deduplication
  • Actual impact on computing and storage
  • Experience during the crawl
  • QA
  • Metrics from the past domain crawls : how much, how many, how fast, etc.
  • Collection
  • What's a collection?
  • User management NetarchiveSuite
  • Different set of roles using the NetarchiveSuite

  • A simple user interface for people who are not very familiar with webarchiving.
  • Statistics module
  • Base for all kinds of calculations and general information about the webarchive
  • Comparing results of crawls for quality control.
  • WARC
  • Has your institution decided to use WARC?
    • When do you expect to start to store harvest materials in WARC?
  • Has your institution decided to convert existing ARC files to WARC?
    • When do you expect to start the converting process?
  • PreliminaryAgendaItemsNonTech (last edited 2010-08-16 10:24:58 by localhost)