Differences between revisions 1 and 8 (spanning 7 versions)
Revision 1 as of 2009-10-28 09:54:37
Size: 1812
Editor: TueLarsen
Comment:
Revision 8 as of 2012-09-05 15:23:54
Size: 1997
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
---+ Browse in data from the second event harvest only '''Browse in data from the second event harvest only'''
Line 5: Line 5:
---++ Do following in a browser that is set up to be local forward port:
Start program
   * Go to =
http://kb-test-adm-001.kb.dk:807?/HarvestDefinition/= (where '807?' is the port number)
Look at data from the <eh. name> harvest
   * Click 'Definitions'->'Selective Harvests' in the left menu
   * Click 'History' in column 6 on the line with event harvest <eh. name>
   * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
   * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
   * Check following in the 'Current Viewerproxy status'
      * No errors are reported
     * Check the "Currently does _not_ collect missing URLs." appear
      * Check the "Current list of missing URLs contains 0 URLs."
      * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
   * Open a New tab or window in the browser (optionally, and in same kind of browser)     * Go to page =http://www.netarkivet.dk=
   * Check that an error occurs saying that www.netarkivet.dk was not found
   * Go to page =http://www.kaarefc.dk=
   * Check that this page contains data
   * Click on a local link (e.g. =http://www.kaarefc.dk/wop
in link for= 'Here').
   * Check that this page contains data
    * Go to page =http://indvandrerbiblioteket.dk=
  
* Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
   * Go to page =http://sb-test-net-001.statsbiblioteket.dk/website/testsite/clock.php=
  
* Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)
Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)

 * Go to http://$GUIadminserver:$http
-port/HarvestDefinition/
  . where GUIadminserver and http
-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication
  . In the one
-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074
Look at data from the <eh. name> harvest

* Click 'Definitions'->'Selective Harvests' in the left menu
 * Click 'History' in column 6 on the line with event harvest <eh. name>
 * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
 * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
 * Check following in the 'Current Viewerproxy status'
  * No errors are reported
  * Check the "Currently does _not_ collect missing URLs." appear
  * Check the "Current list of missing URLs contains 0 URLs."
  * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
 * Open a New tab or window in the browser (optionally, and in same kind of browser)
* Go to page http://www.netarkivet.dk
 * Check that an error occurs saying that www.netarkivet.dk was not found
 * Go to page http://www.kaarefc.dk
 * Check that this page contains data
 * Click on a local link (e.g.
=http://www.kaarefc.dk/wop/ in link for= 'Here').
 * Check that this page contains data
  * Go to page http://indvandrerbiblioteket.dk
* Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
 * Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php
* Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)

Browse in data from the second event harvest only

This page describes how to look at data harvested in the second event harvest

Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)

Look at data from the <eh. name> harvest

  • Click 'Definitions'->'Selective Harvests' in the left menu

  • Click 'History' in column 6 on the line with event harvest <eh. name>

  • Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
  • Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
  • Check following in the 'Current Viewerproxy status'
    • No errors are reported
    • Check the "Currently does _not_ collect missing URLs." appear
    • Check the "Current list of missing URLs contains 0 URLs."
    • Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.

  • Open a New tab or window in the browser (optionally, and in same kind of browser)
  • Go to page http://www.netarkivet.dk

  • Check that an error occurs saying that www.netarkivet.dk was not found
  • Go to page http://www.kaarefc.dk

  • Check that this page contains data
  • Click on a local link (e.g. =http://www.kaarefc.dk/wop/ in link for= 'Here').

  • Check that this page contains data
  • Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
  • Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php

  • Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)

It18BrowseOnlyJob2 (last edited 2012-09-05 15:23:54 by SoerenCarlsen)