Differences between revisions 1 and 18 (spanning 17 versions)
Revision 1 as of 2009-10-28 09:53:35
Size: 1806
Editor: TueLarsen
Comment:
Revision 18 as of 2012-09-05 15:23:12
Size: 1938
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
---+ Browse in data from the first event harvest only '''Browse in data from the first event harvest only'''
Line 5: Line 5:
---++ Do following in a browser that is set up to be local forward port:
Start program
   * Go to =
http://kb-test-adm-001.kb.dk:807?/HarvestDefinition/= (where ‘807?’ is the port number)
Look at data from the <eh. name> harvest
  
* Click 'Definitions'->'Selective Harvests' in the left menu
   * Click 'History' in column 7 on the line with the event harvest <eh. name>
  
* Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 0
   * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
   * Check following in the 'Current Viewerproxy status'
      * No errors are reported
     * Check the "Currently does _not_ collect missing URLs." appear
      * Check the "Current list of missing URLs contains 0 URLs."
      * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
   * Open a New tab or window in the browser (optionally, and in same kind of browser)
   * Go to page =http://www.netarkivet.dk=
  
* Check that an error occurs saying that www.netarkivet.dk was not found
  
* Go to page =http://www.kaarefc.dk=
  
* Check that this page contains data     * Click on a local link (e.g. =http://www.kaarefc.dk/wop= in link for 'Here').
   * This page should exist.
   * Go to page =http://indvandrerbiblioteket.dk=
   * Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
   * Go to page =http://s
b-test-net-001.statsbiblioteket.dk/website/testsite/clock.php=
  
* Check that a page containing date and time of the first harvest appears
-- Main.kfc - 27 Apr 2006
Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)

 * Go to http://$GUIadminserver:$http
-port/HarvestDefinition/
  . where GUIadminserver and http
-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication
  . In the one
-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074
Look at data from the <eh. name> harvest

* Click 'Definitions'->'Selective Harvests' in the left menu
 * Click 'History' in column 7 on the line with the event harvest <eh. name>
* Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 0
 * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
 * Check following in the 'Current Viewerproxy status'
  * No errors are reported
  * Check the "Currently does _not_ collect missing URLs." appear
  * Check the "Current list of missing URLs contains 0 URLs."
  * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
 * Open a New tab or window in the browser (optionally, and in same kind of browser)
 * Go to page http://www.netarkivet.dk
* Check that an error occurs saying that www.netarkivet.dk was not found (DOES NOT WORK: NAS-2076)
* Go to page http://www.kaarefc.dk
* Check that this page contains data
 * Go to page http://www.kaarefc.dk/wop/
 * This page should exist.
 * Go to page http://indvandrerbiblioteket.dk
* Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
 * Go to page
http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php
* Check that a page containing date and time of the first harvest appears 

Browse in data from the first event harvest only

This page describes how to look at data harvested in the first event harvest

Do the following in a browser that is set up to be local forward port (http://netarchive.dk/suite/NetarkivInstallStd)

Look at data from the <eh. name> harvest

  • Click 'Definitions'->'Selective Harvests' in the left menu

  • Click 'History' in column 7 on the line with the event harvest <eh. name>

  • Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 0
  • Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
  • Check following in the 'Current Viewerproxy status'
    • No errors are reported
    • Check the "Currently does _not_ collect missing URLs." appear
    • Check the "Current list of missing URLs contains 0 URLs."
    • Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.

  • Open a New tab or window in the browser (optionally, and in same kind of browser)
  • Go to page http://www.netarkivet.dk

  • Check that an error occurs saying that www.netarkivet.dk was not found (DOES NOT WORK: NAS-2076)
  • Go to page http://www.kaarefc.dk

  • Check that this page contains data
  • Go to page http://www.kaarefc.dk/wop/

  • This page should exist.
  • Go to page http://indvandrerbiblioteket.dk

  • Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
  • Go to page http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php

  • Check that a page containing date and time of the first harvest appears

It18BrowseOnlyJob1 (last edited 2012-09-05 15:23:12 by SoerenCarlsen)