Heritrix Configurations

edit

For configuration related to NetarchiveSuite, please refer to section on Configure Heritrix Process.

For more specific Heritrix configurations, please refer to appendix B and appendix C of this document.

The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.

Configuration Manual 3.12/Heritrix Configurations (last edited 2010-08-16 10:24:40 by localhost)