== Tools in Harvester Module == <> === dk.netarkivet.tools.harvester.CreateCDXMetadataFile (deprecated) === Given a specific jobID (e.g. 42), this tool can be used to create a metadata-1.arc containing the CDX-entries for all arc-files belonging to that job. ==== prequisites and arguments ==== You need to specify the repositoryclient used for accessing your archived-data. If you use the default client JMSArcRepositoryClient you also need to specify the archive replica you will use (defined by setting "settings.common.useReplicaId"), the environmentname, the applicationName, the applicationInstanceId. These can all be defined on the commandline as overrides to the default values, or defined in a local settings.xml file. Needed jarfiles in the classpath: dk.netarkivet.harvester.jar, dk.netarkivet.archive.jar (if using default repositoryclient) The tool only has one argument, the jobID ==== Sample usage of this tool ==== {{{ export INSTALLDIR=/home/test/netarchive export CLASSPATH=$INSTALLDIR/lib/dk.netarkivet.harvester.jar:$INSTALLDIR/lib/dk.netarkivet.archive.jar java -Ddk.netarkivet.settings.file=localsettings.xml dk.netarkivet.harvester.tools.CreateCDXMetadataFile 42 }}} === dk.netarkivet.harvester.tools.CreateLogsMetadataFile (deprecated) === In the beginning, the metadata-1.arc files did not include the Heritrix logs. This tool was made to allow us to make a metadata-2.arc file that contains the heritrix logs associated with a given job. Consider this tool deprecated. For further information see the javadoc for this method.