Differences between revisions 8 and 9
Revision 8 as of 2009-10-05 13:13:20
Size: 3087
Editor: TueLarsen
Comment:
Revision 9 as of 2009-11-09 08:42:00
Size: 3056
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
Start a web browser by e.g. $ mozilla Note that it is important that the browser is started on the same machine as the RunNetarchiveSuite.sh is run on
Line 2: Line 3:
Start a web browser by e.g. $ mozilla Note that it is important that the browser is started on the same machine as the RunNetarchiveSuite.sh is run on setup the browser to proxy on port 8070 and exclude localhost e.g. in mozilla:
Line 4: Line 5:
setup the browser to proxy on port 8070 and exclude localhost e.g. in mozilla:

Choose in the mozilla toolbar:
Edit->Preferences->Advanced->Proxies
Checkmark:
Manual Proxy Configuration
and add:
Proxy: localhost
Port: 8070
No Proxy for: localhost
Write following url in the started browser http://localhost:8074/HarvestDefinition
Choose in the mozilla toolbar: Edit->Preferences->Advanced->Proxies Checkmark: Manual Proxy Configuration and add: Proxy: localhost Port: 8070 No Proxy for: localhost Write following url in the started browser http://localhost:8074/HarvestDefinition
Line 19: Line 9:
Forward the access port and set browser to be local forward port '''Forward the access port and set browser to be local forward port: '''
Line 23: Line 13:
Write the following in a prompt
<verbatim>
ssh -g -N -L$PORT:kb-test-acs-001.kb.dk:$PORT kb-test-acs-001.kb.dk
</verbatim>
Note that the shell will hang after this command.
The ssh must be killed after the test (by &lt;CTRL&gt;C) 
Write the following in a prompt  <verbatim> ssh -g -N -L$PORT:kb-test-acs-001.kb.dk:$PORT kb-test-acs-001.kb.dk </verbatim> Note that the shell will hang after this command. The ssh must be killed after the test (by <CTRL>C)
Line 34: Line 19:
Internet Explorer: Go to =Tools -> Internet options= choose =Connections= and click =LAN settings=. Mark "Use proxy with …" and insert "kb-prod-udv-001.kb.dk" in 'Address' with 'Port:' and set the test port (807?). Click 'Advanced' and insert =kb-test-adm-001.kb.dk= under 'Exception' ('Do not use proxy for …')  Internet Explorer: Go to =Tools -> Internet options= choose =Connections= and click =LAN settings=. Mark "Use proxy with …" and insert "kb-prod-udv-001.kb.dk" in 'Address' with 'Port:' and set the test port (807?). Click 'Advanced' and insert =kb-test-adm-001.kb.dk= under 'Exception' ('Do not use proxy for …')
Line 36: Line 21:
<nop>FireFox 2: Go to =General->Settings=, choose =Connection= and click =Settings=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use for all protocols’ and insert text “=kb-test-adm-001.kb.dk=” in 'No proxy for'  <nop>FireFox 2: Go to =General->Settings=, choose =Connection= and click =Settings=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use for all protocols’ and insert text “=kb-test-adm-001.kb.dk=” in 'No proxy for'
Line 38: Line 23:
<nop>FireFox 3: Go to =Tools->Settings=, choose =Advanced->Network= and click =Settings...=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use this profor all protocols’ and insert text “=kb-test-adm-001.kb.dk, kb-prod-udv-001.kb.dk=” in 'No proxy for'  <nop>FireFox 3: Go to =Tools->Settings=, choose =Advanced->Network= and click =Settings...=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use this profor all protocols’ and insert text “=kb-test-adm-001.kb.dk, kb-prod-udv-001.kb.dk=” in 'No proxy for'
Line 43: Line 28:
Line 53: Line 37:
Total handled: 88 
Duplicates found: 0 0.0% 
Bytes total: 6391852 (6.1 MB) 
Bytes discarded: 0 (0 0.0% 
New (no hits): 88 
Exact hits: 0 
Equivalent hits: 0 
Total handled: 88
Duplicates found: 0 0.0%
Bytes total: 6391852 (6.1 MB)
Bytes discarded: 0 (0 0.0%
New (no hits): 88
Exact hits: 0
Equivalent hits: 0

Start a web browser by e.g. $ mozilla Note that it is important that the browser is started on the same machine as the RunNetarchiveSuite.sh is run on

setup the browser to proxy on port 8070 and exclude localhost e.g. in mozilla:

Choose in the mozilla toolbar: Edit->Preferences->Advanced->Proxies Checkmark: Manual Proxy Configuration and add: Proxy: localhost Port: 8070 No Proxy for: localhost Write following url in the started browser http://localhost:8074/HarvestDefinition

If you are a netarchive tester:

Forward the access port and set browser to be local forward port:

Do the following on kb-prod-udv-001.kb.dk as a test user:

Write the following in a prompt <verbatim> ssh -g -N -L$PORT:kb-test-acs-001.kb.dk:$PORT kb-test-acs-001.kb.dk </verbatim> Note that the shell will hang after this command. The ssh must be killed after the test (by <CTRL>C)

Set browser (Mozilla, Internet Explorer or <nop>FireFox) to be local forward port.

Mozilla: Go to =preferences -> advanced.proxies= and activate 'manual proxy-settings', 'HTTP-proxy:' is set to "kb-prod-udv-001.kb.dk" and the belonging 'Port:' is set to the test port (807?). 'No proxy for' is set to "kb-test-adm-001.kb.dk"

Internet Explorer: Go to =Tools -> Internet options= choose =Connections= and click =LAN settings=. Mark "Use proxy with …" and insert "kb-prod-udv-001.kb.dk" in 'Address' with 'Port:' and set the test port (807?). Click 'Advanced' and insert =kb-test-adm-001.kb.dk= under 'Exception' ('Do not use proxy for …')

<nop>FireFox 2: Go to =General->Settings=, choose =Connection= and click =Settings=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use for all protocols’ and insert text “=kb-test-adm-001.kb.dk=” in 'No proxy for'

<nop>FireFox 3: Go to =Tools->Settings=, choose =Advanced->Network= and click =Settings...=. Mark 'Manual proxy configuration:' and insert "kb-prod-udv-001.kb.dk" in 'HTTP Proxy' and set 'Port:' to the test port (807?). Mark ‘Use this profor all protocols’ and insert text “=kb-test-adm-001.kb.dk, kb-prod-udv-001.kb.dk=” in 'No proxy for'

Click on the JobID for your finished snapshot harvest in the Job status overview

Click on "Browse reports for jobs"

Click on the "processors-report" e.g. "metadata://netarkivet.dk/crawl/reports/processors-report.txt?heritrixVersion=1.14.3&harvestid=1&jobid=1"

Check that there is no deduplicator processors-report like this one:

Total handled: 88
Duplicates found: 0 0.0%
Bytes total: 6391852 (6.1 MB)
Bytes discarded: 0 (0  0.0%
New (no hits): 88
Exact hits: 0
Equivalent hits: 0
......

It38CheckHarvestNotDeduplicated (last edited 2010-08-16 10:24:54 by localhost)