Download and installation

edit

For a quick start, we have prepared a bash shell script that starts all the necessary components on one machine. We will use this script throughout this quickstart manual to allow you to get a feel for what the system can do and how it works without having to deal with issues of distributing to other servers.

Base system required

For the quick startup, NetarchiveSuite requires

To check that you have the right version of Java do the following

Downloading

Download of the newest release is described here

Note: Instead of downloading a NetarchiveSuite.zip you can also build itself from the SVN trunk:

JMS

NetarchiveSuite uses JMS for inter-process communication. JMS is the Java Messaging Service, which provides asynchronous communication between processes. You do not need any knowledge of JMS to use NetarchiveSuite. However you need to make sure that there are not already JMS brokers running on your system using PORT 8100.

Currently only the open-source version of Sun's JMS implementation is supported, since some functionality of other implementations does not match our assumptions well.

To download and install it, do the following:

To start it, do the following:

Configuration

Assuming a releasezipball of NetarchiveSuite NetarchiveSuite*.zip  is available in the directory ~/netarchive, you must do the following to configure the NetarchiveSuite for your system:

RunNetarchiveSuite.sh

deploy_standalone_example.xml

The first script is a simple script for doing all the steps during deployment. It takes a NetarchiveSuite package ('.zip'), a configuration file (the second file), and a temporary installation directory as arguments (in the given order).

In the configuration file all the applications are placed on one machine (e.g. the current machine, localhost). When the script is run with the installation USER it unpacks the zipfile to ~/netarchive/USER and then deploys it to ~+/home/test/USER on localhost. If you would rather deploy to an existing username on localhost you should edit deploy_standalone_example by changing "test" in the following two line to, for example, your own login name.

       <deployInstallDir>/home/test</deployInstallDir>
       <deployMachineUserName>test</deployMachineUserName>

If installation user is different from the 'test' user, remember to check, that a Sun JVM is in the path for both users (instead of GNU java compiler, that is default with some Linux'es.). If you already have a USER installation, then remember also, that the existing bitarchive, database and admin.data files will be untouched. You must explicit remove any previous USER installation, if you want a clean empty installation.

E.g. (you should use "USER" as the installation name to make things easy)

cd /home/test/netarchive
bash RunNetarchiveSuite.sh NetarchiveSuite.zip deploy_standalone_example.xml USER/
#if you have not setup your ssh keygen correctly, you need to login some times before the installation finish successfully. You must also have permission to ssh and scp to localhost ( try e.g "ssh localhost" and "scp somefile localhost:")

The script creates a deployment folder named "USER" in e.g. /home/test/netarchive, which contains methods for starting and stopping NetarchiveSuite, and starts the whole NetarchiveSuite. It deploys the installation locally to /home/test/USER

  • start a web browser by e.g.  $ firefox  Note that it is important that the browser is started on the same machine as the simple harvest script is run on

  • setup the browser to proxy on port 8070 and exclude localhost and the hostname (used by the Heritrix GUI) e.g. in firefox:

Choose in the firefox toolbar:
Edit->Preferences->Advanced->Network->Settings
Checkmark:
Manual Proxy Configuration
and add:
Proxy: localhost
Port: 8070
No Proxy for: localhost, kb-test-way-001.kb.dk
  • Write following url in the started browser http://localhost:'''8074'''/HarvestDefinition

  • You can now see the webinterface in the browser
  • You can now create, run and browse according to the following or the User Manual
  • if you want to stop and start the entire NAS system, then

cd /home/test/netarchive/USER
./killall_NATIONAL_LIBRARY.sh
./startall_NATIONAL_LIBRARY.sh
  • If you want to try other deploy examples, then go to "Examples of deploy configuration files" in the Installation Manual