⇤ ← Revision 1 as of 2010-06-14 12:40:08
1008
Comment:
|
1197
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
Step 1: Install system Step 2: upload heritrix template downloaded from https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp |
Step 1: Install a QUICSTART system |
Line 8: | Line 5: |
After having replaced USERNAME and PASSWORD the paragraph |
Step 2: Go to Definitions->Edit harvest template Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp to your desktop Step 4: Change USERNAME and PASSWORD in the downloaded file |
Line 13: | Line 14: |
replaced USERNAME and PASSWORD with anonymous and your email-adress save the file |
|
Line 14: | Line 17: |
with username and password to the ftp.server sbftp.statsbiblioteket.dk [according to information given by BJA in a mail from March 24 2010] |
Step 4: Upload the new heritrix template with name default_orderxml_withftp_klid |
Line 17: | Line 19: |
Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save | |
Line 18: | Line 21: |
Add seedlist to domain statsbiblioteket.dk with content: | Step 6: Edit the harvest by clicking on the domain klid.dk and save |
Line 20: | Line 23: |
ftp://sbftp.statsbiblioteket.dk/PDF | Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp_klid |
Line 22: | Line 25: |
Change the default configuration to use the newly uploaded heritrix template. | Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save |
Line 24: | Line 27: |
Step 9: Activate the harvest | |
Line 25: | Line 29: |
Step 3: make selective harvest with the domain statsbiblioteket.dk Step 4: Activate this harvest Step 5: Verify, that 14 PDF-files have been harvested, and can be accessed. |
Step 10: Verify, that 1 PDF-file has been harvested by accesinng it using viewer-proxy |
Releasetest af FTP-harvesting
Step 1: Install a QUICSTART system
Step 2: Go to Definitions->Edit harvest template
Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp to your desktop
Step 4: Change USERNAME and PASSWORD in the downloaded file
<string name="username">USERNAME</string> <string name="password">PASSWORD</string> replaced USERNAME and PASSWORD with anonymous and your email-adress save the file
Step 4: Upload the new heritrix template with name default_orderxml_withftp_klid
Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save
Step 6: Edit the harvest by clicking on the domain klid.dk and save
Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp_klid
Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save
Step 9: Activate the harvest
Step 10: Verify, that 1 PDF-file has been harvested by accesinng it using viewer-proxy