Differences between revisions 1 and 2
Revision 1 as of 2010-06-14 12:40:08
Size: 1008
Editor: TueLarsen
Comment:
Revision 2 as of 2010-06-14 14:26:12
Size: 1197
Editor: TueLarsen
Comment:
Deletions are marked like this. Additions are marked like this.
Line 3: Line 3:
Step 1: Install system
Step 2: upload heritrix template downloaded from
https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite
with name default_orderxml_withftp
Step 1: Install a QUICSTART system
Line 8: Line 5:
After having replaced USERNAME and PASSWORD
the paragraph
Step 2: Go to Definitions->Edit harvest template

Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite
with name default_orderxml_withftp to your desktop

Step 4: Change USERNAME and PASSWORD in the downloaded file
Line 13: Line 14:
replaced USERNAME and PASSWORD with anonymous and your email-adress
save the file
Line 14: Line 17:
with username and password to the ftp.server sbftp.statsbiblioteket.dk
[according to information given by BJA in a mail from March 24 2010]
Step 4: Upload the new heritrix template with name default_orderxml_withftp_klid
Line 17: Line 19:
Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save
Line 18: Line 21:
Add seedlist to domain statsbiblioteket.dk with content: Step 6: Edit the harvest by clicking on the domain klid.dk and save
Line 20: Line 23:
ftp://sbftp.statsbiblioteket.dk/PDF Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp_klid
Line 22: Line 25:
Change the default configuration to use the newly uploaded heritrix template. Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save
Line 24: Line 27:
Step 9: Activate the harvest
Line 25: Line 29:
Step 3: make selective harvest with the domain statsbiblioteket.dk

Step 4: Activate this harvest
Step 5: Verify, that 14 PDF-files have been harvested, and can be accessed.
Step 10: Verify, that 1 PDF-file has been harvested by accesinng it using viewer-proxy

Releasetest af FTP-harvesting

Step 1: Install a QUICSTART system

Step 2: Go to Definitions->Edit harvest template

Step 3: Download https://gforge.statsbiblioteket.dk/plugins/scmsvn/viewcvs.php/*checkout*/trunk/harvestdefinitionbasedir/order_templates_dist/default_withftp.xml?content-type=text%2Fplain&rev=1396&root=netarchivesuite with name default_orderxml_withftp to your desktop

Step 4: Change USERNAME and PASSWORD in the downloaded file

<string name="username">USERNAME</string> <string name="password">PASSWORD</string> replaced USERNAME and PASSWORD with anonymous and your email-adress save the file

Step 4: Upload the new heritrix template with name default_orderxml_withftp_klid

Step 5: Create a new harvest e.g. ftp_klid using the domain klid.dk and save

Step 6: Edit the harvest by clicking on the domain klid.dk and save

Step 7: Edit the defaultconfig and change the harvest template to default_orderxml_withftp_klid

Step 8: Edit the seed list and replace the klid.dk with ftp://ftp.klid.dk/OpenOffice/haandbog and save

Step 9: Activate the harvest

Step 10: Verify, that 1 PDF-file has been harvested by accesinng it using viewer-proxy

TEST14 (last edited 2011-02-22 13:36:37 by SoerenCarlsen)