Heritrix Configurations

edit

For configuration related to NetarchiveSuite, please refer to section on Configure Heritrix Process.

For more specific Heritrix configurations, please refer to appendix B and appendix C of this document.

The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.

Configuration Manual 3.14/Heritrix Configurations (last edited 2010-08-16 10:24:35 by localhost)