1812
Comment:
|
← Revision 8 as of 2012-09-05 15:23:54 ⇥
1997
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
---+ Browse in data from the second event harvest only | '''Browse in data from the second event harvest only''' |
Line 5: | Line 5: |
---++ Do following in a browser that is set up to be local forward port: Start program * Go to =http://kb-test-adm-001.kb.dk:807?/HarvestDefinition/= (where '807?' is the port number) Look at data from the <eh. name> harvest * Click 'Definitions'->'Selective Harvests' in the left menu * Click 'History' in column 6 on the line with event harvest <eh. name> * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1 * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page) * Check following in the 'Current Viewerproxy status' * No errors are reported * Check the "Currently does _not_ collect missing URLs." appear * Check the "Current list of missing URLs contains 0 URLs." * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at. * Open a New tab or window in the browser (optionally, and in same kind of browser) * Go to page =http://www.netarkivet.dk= * Check that an error occurs saying that www.netarkivet.dk was not found * Go to page =http://www.kaarefc.dk= * Check that this page contains data * Click on a local link (e.g. =http://www.kaarefc.dk/wop in link for= 'Here'). * Check that this page contains data * Go to page =http://indvandrerbiblioteket.dk= * Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found * Go to page =http://sb-test-net-001.statsbiblioteket.dk/website/testsite/clock.php= * Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary) |
Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd) * Go to http://$GUIadminserver:$http-port/HarvestDefinition/ . where GUIadminserver and http-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication . In the one-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074 Look at data from the <eh. name> harvest * Click 'Definitions'->'Selective Harvests' in the left menu * Click 'History' in column 6 on the line with event harvest <eh. name> * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1 * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page) * Check following in the 'Current Viewerproxy status' * No errors are reported * Check the "Currently does _not_ collect missing URLs." appear * Check the "Current list of missing URLs contains 0 URLs." * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at. * Open a New tab or window in the browser (optionally, and in same kind of browser) * Go to page http://www.netarkivet.dk * Check that an error occurs saying that www.netarkivet.dk was not found * Go to page http://www.kaarefc.dk * Check that this page contains data * Click on a local link (e.g. =http://www.kaarefc.dk/wop/ in link for= 'Here'). * Check that this page contains data * Go to page http://indvandrerbiblioteket.dk * Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found * Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php * Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary) |
Browse in data from the second event harvest only
This page describes how to look at data harvested in the second event harvest
Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)
Go to http://$GUIadminserver:$http-port/HarvestDefinition/
- where GUIadminserver and http-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication
In the one-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074
Look at data from the <eh. name> harvest
Click 'Definitions'->'Selective Harvests' in the left menu
Click 'History' in column 6 on the line with event harvest <eh. name>
- Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
- Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
- Check following in the 'Current Viewerproxy status'
- No errors are reported
- Check the "Currently does _not_ collect missing URLs." appear
- Check the "Current list of missing URLs contains 0 URLs."
Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
- Open a New tab or window in the browser (optionally, and in same kind of browser)
Go to page http://www.netarkivet.dk
- Check that an error occurs saying that www.netarkivet.dk was not found
Go to page http://www.kaarefc.dk
- Check that this page contains data
Click on a local link (e.g. =http://www.kaarefc.dk/wop/ in link for= 'Here').
- Check that this page contains data
Go to page http://indvandrerbiblioteket.dk
- Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php
- Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)