1. Introduction

edit

This System Design description is still undergoing restructuring. It is intended that those parts which are complete should be accurate.

This document only describes the underlying design of the NetarchiveSuite software, i.e. it does not describe how to install, run, or use NetarchiveSuite, for that see the Installation Manual and the User Manual.

The first section gives an overview, and the remainder of the document gives more details about the design.

The reader is expected to be familiar with Java programming and have an understanding of the core issues involved in large-scale web harvesting. Having used Heritrix before is a definite plus, and an elementary understanding of SQL databases is required for some parts.

The code is available in the downloaded package (see Release Overview) or from our subversion repository.

System Design 3.16/Introduction (last edited 2011-04-26 11:21:10 by MikisSethSorensen)