Heritrix Configurations
For configuration related to NetarchiveSuite, please refer to section on Configure Heritrix Process.
For more specific Heritrix configurations, please refer to appendix B and appendix C of this document.
The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.