⇤ ← Revision 1 as of 2010-05-04 13:16:11
590
Comment: Generated documentation branch for 3.14
|
← Revision 2 as of 2010-08-16 10:24:35 ⇥
593
converted to 1.6 markup
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
[[Action(edit)]] | <<Action(edit)>> |
Line 4: | Line 4: |
For configuration related to !NetarchiveSuite, please refer to section on [:Configuration Manual 3.14#ConfigureHeritrixProcess:Configure Heritrix Process]. | For configuration related to !NetarchiveSuite, please refer to section on [[Configuration Manual 3.14#ConfigureHeritrixProcess|Configure Heritrix Process]]. |
Line 6: | Line 6: |
For more specific Heritrix configurations, please refer to [:Configuration Manual 3.14#ManagingHeritrixHarvestTemplates:appendix B] and [:Configuration Manual 3.14#MigrateHeritrixTemplatesTo36:appendix C] of this document. | For more specific Heritrix configurations, please refer to [[Configuration Manual 3.14#ManagingHeritrixHarvestTemplates|appendix B]] and [[Configuration Manual 3.14#MigrateHeritrixTemplatesTo36|appendix C]] of this document. |
Heritrix Configurations
For configuration related to NetarchiveSuite, please refer to section on Configure Heritrix Process.
For more specific Heritrix configurations, please refer to appendix B and appendix C of this document.
The crawling in NetarchiveSuite uses by default Deduplication. This feature and how to disable it is described in Configuration Manual, Section 8.1.2.