1644
Comment:
|
1702
|
Deletions are marked like this. | Additions are marked like this. |
Line 30: | Line 30: |
Step 10: Verify, that 1 PDF-file has been harvested by accessing it using viewer-proxy | Step 10: Verify, that 1 PDF-file has been harvested by accessing it using viewer-proxy: ftp://ftp.klid.dk/OpenOffice/haandbog/Haandbog-2-2.pdf |
Releasetest af FTP-harvesting
Step 1: Install a QUICKSTART system, where you have defined the setting "settings.viewerproxy.tryLookupUriAsFtp" to true:
<viewerproxy> <baseDir>viewerproxy</baseDir> <tryLookupUriAsFtp>true</tryLookupUriAsFtp> </viewerproxy>
Step 2: Go to Definitions->Edit harvest template
Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp to your desktop
Step 4: Change USERNAME and PASSWORD in the downloaded file
<string name="username">USERNAME</string> <string name="password">PASSWORD</string> replaced USERNAME and PASSWORD with anonymous and your email-adress save the file
Step 4: Upload the new heritrix template with name default_orderxml_withftp
Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save
Step 6: Edit the harvest by clicking on the domain klid.dk and save
Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp, and set maximum objects to 200
Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save
Step 9: Activate the harvest
Step 10: Verify, that 1 PDF-file has been harvested by accessing it using viewer-proxy: ftp://ftp.klid.dk/OpenOffice/haandbog/Haandbog-2-2.pdf