Differences between revisions 3 and 4
Revision 3 as of 2009-10-29 08:06:22
Size: 2042
Editor: TueLarsen
Comment:
Revision 4 as of 2010-01-27 12:10:35
Size: 1987
Comment:
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
Do following in a browser that is set up to be local forward port:
Start Program
Do the following in a browser that is set up to be local forward port: Start Program
Line 11: Line 10:
Look at data from the <eh. name> harvest
Line 12: Line 12:
Look at data from the &lt;eh. name> harvest
  
* Click 'Definitions'->'Selective Harvests' in the left menu
   * Click 'History' in column 6 on the line with event harvest &lt;eh. name>
   * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
   * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
   * Check following in the 'Current Viewerproxy status'
      * No errors are reported
     * Check the "Currently does _not_ collect missing URLs." appear
      * Check the "Current list of missing URLs contains 0 URLs."
      * Check there is a line expressing index used from harvest &lt;eh. name>, run 0 and built on jobs being looked at.
   * Open a New tab or window in the browser (optionally, and in same kind of browser)     * Go to page =http://www.netarkivet.dk=
   * Check that an error occurs saying that www.netarkivet.dk was not found
   * Go to page =http://www.kaarefc.dk=
   * Check that this page contains data
  
* Click on a local link (e.g. =http://www.kaarefc.dk/wop in link for= 'Here').
   * Check that this page contains data
  
* Go to page =http://indvandrerbiblioteket.dk=
   * Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
   * Go to page =http://sb-test-net-001.statsbiblioteket.dk/website/testsite/clock.php=
   * Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)
 * Click 'Definitions'->'Selective Harvests' in the left menu
 * Click 'History' in column 6 on the line with event harvest <eh. name>
 * Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
 * Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
 * Check following in the 'Current Viewerproxy status'
  * No errors are reported
  * Check the "Currently does _not_ collect missing URLs." appear
  * Check the "Current list of missing URLs contains 0 URLs."
  * Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.
 * Open a New tab or window in the browser (optionally, and in same kind of browser)
* Go to page =http://www.netarkivet.dk=
 * Check that an error occurs saying that www.netarkivet.dk was not found
 * Go to page =http://www.kaarefc.dk=
 * Check that this page contains data
* Click on a local link (e.g. =http://www.kaarefc.dk/wop/ in link for= 'Here').
 * Check that this page contains data
* Go to page =http://indvandrerbiblioteket.dk=
 * Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
 * Go to page =http://sb-test-net-001.statsbiblioteket.dk/website/testsite/clock.php=
 * Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)

Browse in data from the second event harvest only

This page describes how to look at data harvested in the second event harvest

Do the following in a browser that is set up to be local forward port: Start Program

Look at data from the <eh. name> harvest

  • Click 'Definitions'->'Selective Harvests' in the left menu

  • Click 'History' in column 6 on the line with event harvest <eh. name>

  • Click 'Show jobs' in column 'Total number of jobs' on the line with 'Run number' 1
  • Click 'Select these jobs for QA with viewerproxy' (it may take some time to create page)
  • Check following in the 'Current Viewerproxy status'
    • No errors are reported
    • Check the "Currently does _not_ collect missing URLs." appear
    • Check the "Current list of missing URLs contains 0 URLs."
    • Check there is a line expressing index used from harvest <eh. name>, run 0 and built on jobs being looked at.

  • Open a New tab or window in the browser (optionally, and in same kind of browser)
  • Go to page =http://www.netarkivet.dk=

  • Check that an error occurs saying that www.netarkivet.dk was not found
  • Go to page =http://www.kaarefc.dk=

  • Check that this page contains data
  • Click on a local link (e.g. =http://www.kaarefc.dk/wop/ in link for= 'Here').

  • Check that this page contains data
  • Check that an error occurs saying that www.indvandrerbiblioteket.dk was not found
  • Go to page =http://sb-test-net-001.statsbiblioteket.dk/website/testsite/clock.php=

  • Check that a page containing date and time of the second harvest appears (Note: "Refresh" may be necessary)

It18BrowseOnlyJob2 (last edited 2012-09-05 15:23:54 by SoerenCarlsen)