Differences between revisions 6 and 7
Revision 6 as of 2010-08-16 10:25:11
Size: 3866
Editor: localhost
Comment: converted to 1.6 markup
Revision 7 as of 2011-10-31 17:16:23
Size: 3934
Comment:
Deletions are marked like this. Additions are marked like this.
Line 12: Line 12:
Line 19: Line 20:
 * Check that you can change number of rows to be displayed per page e.g. 1 and 
 * Check that you can press next and previous page and 
 * Check that the reset button resets all changes to default
 * Check that you can change number of rows to be displayed per page e.g. 1 and
 * Check that you can press next and previous page and
 * Check that the reset button resets all changes to default(note that the display value is also blanked, but is 100 by default)

Verify that the harvest is activated and done

This page describes how to verify that a harvest is carried out correctly.

Do the following in a browser:

Start Program

Verify job status

  • Click 'Harvest status'->'All Jobs' in the left menu

  • Select "All" in "Only display job status" to the right from the menu
  • Click the "Show" button, until the jobs have stepped through statuses "NEW", "SUBMITTED", "STARTED", "DONE"
  • Wait until all jobs have got status "DONE"
  • Check that you can search on Harvest name, start and end date
  • Check that you can change number of rows to be displayed per page e.g. 1 and
  • Check that you can press next and previous page and
  • Check that the reset button resets all changes to default(note that the display value is also blanked, but is 100 by default)
  • Check the following for the domains =raeder.dk= and =statsbiblioteket.dk=: (Using page Harvest Status -> All jobs per domain)

    • Check that the domain has been harvested by one job of the name <eh. name>

    • Check that this job has configuration <eh. name>_forsider

    • Check that there is a number for 'Run number' and 'Job ID'
    • Check that the 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name> harvest

    • Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
    • Check that the 'Stopped due to' columns contain "Domain Completed"
  • Check the following job details for the domain =netarkivet.dk=: (Using page SelectiveHarvests->History->Run Number 0 ->JobID 1)FR 1108 + 1485 + 1146

    • Check that the 'Submit time', 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name> harvest FR 1146

    • Click on "Browse reports for jobs"
    • Check that you don't get any errors when you click on some of the links
    • Click on "Browse harvest files for job"
    • Check that you don't get any errors when you click on some of the links
    • Click on "Browse only relevant crawl-log lines for domain netarkivet.dk"
    • Check that you don't get any errors when you click on some of the links
  • Check the following for the domain =netarkivet.dk=: (Using page Harvest Status -> All jobs per domain)

    • Check that the domain has been harvested by 2 jobs of the name <eh. name>

    • Check that one of the jobs has configuration <eh. name>_forsider

    • Check that the 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name>

    • Check that one of the jobs has configuration <eh. name>_forsider_plus_2niveauer

    • Check that the 'Start time' and 'End time' approximately corresponds to time of test with <eh. name> harvest

    • Check that 'Run number' and 'Job ID' columns contains positive numbers
    • Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
    • Check that the 'Stopped due to' columns contains "Domain Completed"
    • Check the following for the domain =kaarefc.dk=: (Using page Harvest Status -> All jobs per domain)

      • Check that the domain has been harvested by 1 job of the name <eh. name>

      • Check that the job has configuration <eh. name>_forsider_plus_2niveauer

      • Check that the 'Start time' and 'End time' approximately corresponds to time of test with <eh. name> harvest

      • Check that 'Run number' and 'Job ID' columns contains positive numbers
      • Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
      • Check that the 'Stopped due to' columns contains "Domain Completed"

It15VerifyNormalOps (last edited 2013-01-04 16:46:17 by SoerenCarlsen)