1832
Comment:
|
← Revision 8 as of 2010-08-16 10:24:31 ⇥
1832
converted to 1.6 markup
|
No differences found! |
Verify that no alias domains are harvested
Start Program
Go to http://$GUIadminserver:$http-port/HarvestDefinition/
- where GUIadminserver and http-port are specified in the deploy configuration file under the application named dk.netarkivet.common.webinterface.GUIApplication
In the one-machine setup (deploy_example_one_machine.xml ) the link will be : http://localhost:8074
Wait for the job to be generated
- Choose 'Systemstate'
- Click on Application GUIWebServer
- Click 'Show all' in header Index in order to get all index lines for the application
- Check that a line with text "Created XXX jobs for harvest definition 'YYY'" is logged
Verify that alias domain =netarkivet.dk= is not harvested
Click 'Definitions' -> 'Snapshot Harvests' in the left menu
Click 'History' in column 6 on the line with snapshot harvest <snh2. name>
- Click 'Show jobs' in column 'Total number of jobs'
- For all numbers under the 'Job ID' column do:
- Look for =netarkivet.dk= on the resulting page. It should not be there
- Click on the link "Show harvest template". Check that the string =netarkivet.dk= does *only* appear in the following context:
<map name="http-headers"> . <string name="user-agent">Mozilla/5.0 (compatible; heritrix/1.5.0-200506132127+http://netarkivet.dk/website/info.html)</string> <string name="from"> netarkivet-svar@netarkivet.dk </string> </map>
Verify that alias =sulnudu.dk= is not harvested
Click 'Definitions' -> 'Snapshot Harvests' in the left menu
Click 'History’ in column 6 on the line with snapshot harvest <snh2. name>
- Click 'Show jobs' in column 'Total number of jobs'
- For all numbers under the 'Job ID' column do:
- Look for =sulnudu.dk= on the resulting page.
- Check that =sulnudu.dk= does *not* appear at all