1363
Comment:
|
1474
|
Deletions are marked like this. | Additions are marked like this. |
Line 4: | Line 4: |
Introduction:: | Introduction (CHH):: * Overall presentation of Netarkivet.dk and !NetarchiveSuite |
Line 6: | Line 7: |
* Expectations * Review/update of agenda |
* Expectations (CLO) * Review/update of agenda (CLO) |
Line 9: | Line 10: |
Step by step experience from Netarchive.dk concerning broad crawls:: | Step by step experience from Netarchive.dk concerning broad crawls (CLO):: |
Line 26: | Line 27: |
User management NetarchiveSuite:: | User management NetarchiveSuite (CLO):: |
Line 30: | Line 31: |
Statistics module:: | Statistics module (CLO):: |
Line 34: | Line 35: |
Access:: | Access (CLO):: |
Line 38: | Line 39: |
Collection:: | Collection (CLO):: |
Preliminary Agenda items (proposals) for the non-technical workshop
- Introduction (CHH)
Overall presentation of Netarkivet.dk and NetarchiveSuite
- Scheduling
- Responsabilities, roles of participants in a broad crawl defining crawl target (number of URL, scope, seed lists, politeness,budget...)
- Dealing with junk data
- Sorting and spliting seed lists into different jobs running test crawl
- Using frontier reports
- Modifying settings, creating overrides
- Visual QA
- Running a patch crawl
Different set of roles using the NetarchiveSuite