System overview

edit

The primary function of the NetarchiveSuite is to plan, schedule and archive web harvests of parts of the internet. We use Heritrix as our webcrawler. The NetarchiveSuite can organize three different kinds of harvests:

The NetarchiveSuite is split into three main modules corresponding to harvesting, archiving and accessing via viewerproxy.

Please refer to the overview description for more details.

Quick Start Manual 3.14/System overview (last edited 2010-08-16 10:24:35 by localhost)