Releasetest af FTP-harvesting

Step 1: Install a QUICKSTART system, where you have defined the setting "settings.viewerproxy.tryLookupUriAsFtp" to true:

    <viewerproxy>
        <baseDir>viewerproxy</baseDir>
        <tryLookupUriAsFtp>true</tryLookupUriAsFtp>
    </viewerproxy>

Step 1a. Remember to choose the same port for all protocols, when setting up the proxy in the browser. Otherwise the browser will still use the live internet for the ftp-protocol.

Step 2: Go to Definitions->Edit harvest template

Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp to your desktop

Step 4: Change USERNAME and PASSWORD in the downloaded file

<string name="username">USERNAME</string> <string name="password">PASSWORD</string> replaced USERNAME and PASSWORD with anonymous and your email-adress save the file

Step 4: Upload the new heritrix template with name default_orderxml_withftp

Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save

Step 6: Edit the harvest by clicking on the domain klid.dk and save

Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp, and set maximum objects to 200

Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save

Step 9: Activate the harvest

Step 10: Verify, that 1 PDF-file has been harvested by accessing it using viewer-proxy: ftp://ftp.klid.dk/OpenOffice/haandbog/Haandbog-2-2.pdf

TEST14 (last edited 2011-02-22 13:36:37 by SoerenCarlsen)