== System overview ==
<<Action(edit)>>

The primary function of the !NetarchiveSuite is to plan, schedule and archive web harvests of parts of the internet. We use Heritrix as our webcrawler. The !NetarchiveSuite can organize three different kinds of harvests:

 * Event harvesting (organize harvests of a set of domains related to a specific event, e.g. 9/11, Royal Weddings, and Elections).
 * Selective harvesting (recurrent harvests of a set of domains).
 * Snapshot harvesting (organizing a complete snapshot of all known domains)
The !NetarchiveSuite is split into three main modules corresponding to harvesting, archiving and accessing via viewerproxy.

 . {{attachment:Overview 3.14/Netarchive_structure_simplified2.png}}
Please refer to the [[Overview 3.14|overview]] description for more details.