Browse in data from the first event harvest only
This page describes how to look at data harvested in the first event harvest
Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)
Go to http://$GUIadminserver:$http-port/HarvestDefinition/
- where GUIadminserver and http-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication
In the one-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074
Look at data from the <eh. name> harvest
Click 'Definitions'->'Selective Harvests' in the left menu
Click 'History' in column 7 on the line with the event harvest <eh. name>
- Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 0
- Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
- Check following in the 'Current Viewerproxy status'
- No errors are reported
- Check the "Currently does _not_ collect missing URLs." appear
- Check the "Current list of missing URLs contains 0 URLs."
Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
- Open a New tab or window in the browser (optionally, and in same kind of browser)
Go to page http://www.netarkivet.dk
- Check that an error occurs saying that www.netarkivet.dk was not found (DOES NOT WORK: NAS-2076)
Go to page http://www.kaarefc.dk
- Check that this page contains data
Go to page http://www.kaarefc.dk/wop/
- This page should exist.
Go to page http://indvandrerbiblioteket.dk
- Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php
- Check that a page containing date and time of the first harvest appears