Differences between revisions 3 and 5 (spanning 2 versions)
Revision 3 as of 2009-10-23 13:24:59
Size: 594
Editor: TueLarsen
Comment:
Revision 5 as of 2010-08-16 10:25:09
Size: 592
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
[[Action(edit)]] <<Action(edit)>>
Line 4: Line 4:
For configuration related to !NetarchiveSuite, please refer to section on [:Configuration Manual 3.10#ConfigureHeritrixProcess:Configure Heritrix Process]. For configuration related to !NetarchiveSuite, please refer to section on [[Configuration Manual 3.10#ConfigureHeritrixProcess|Configure Heritrix Process]].
Line 6: Line 6:
For more specific Heritrix configurations, please refer to [:Configuration Manual 3.10#ManagingHeritrixHarvestTemplates:appendix B] and [:Configuration Manual 3.10#MigrateHeritrixTemplatesTo36:appendix C] of this document. For more specific Heritrix configurations, please refer to [[Configuration Manual 3.10#ManagingHeritrixHarvestTemplates|appendix B]] and [[Configuration Manual 3.10#MigrateHeritrixTemplatesTo36|appendix C]] of this document.
Line 8: Line 8:
The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in (cf. Configuration Manual, Section 8.1.2). The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.

Heritrix Configurations

edit

For configuration related to NetarchiveSuite, please refer to section on Configure Heritrix Process.

For more specific Heritrix configurations, please refer to appendix B and appendix C of this document.

The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.

Configuration Manual 3.10/Heritrix Configurations (last edited 2010-08-16 10:25:09 by localhost)