Differences between revisions 22 and 23
Revision 22 as of 2009-09-02 13:05:25
Size: 4640
Editor: TueLarsen
Comment:
Revision 23 as of 2009-09-02 13:07:59
Size: 4083
Editor: TueLarsen
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
  * Verify that you have all the needed software installed by installing the QuickStart according to https://netarchive.dk/suite/Quick_Start_Manual e.g. in /home/test/netarchive by starting the Quickstart
  * Shutdown the QuickStart according to the QuickStart Manual

 
* Download following attached files to e.g. /home/test/netarchive:
 * Verify that you have all the needed software installed by installing the QuickStart according to https://netarchive.dk/suite/Quick_Start_Manual e.g. in /home/test/netarchive by starting the Quickstart
 * Shutdown the QuickStart according to the QuickStart Manual
 * Download following attached files to e.g. /home/test/netarchive:
Line 13: Line 11:
The first script is a simple script for doing all the steps during deployment.
It takes a !NetarchiveSuite package ('.zip'), a configuration file (the second file), and a temporary installation directory as arguments (in the given order).
The first script is a simple script for doing all the steps during deployment. It takes a !NetarchiveSuite package ('.zip'), a configuration file (the second file), and a temporary installation directory as arguments (in the given order).
Line 16: Line 13:
In the configuration file all the applications are placed on one machine (e.g. the current machine, ~+{{{localhost}}}+~).
This gives the same kind of instance as the !QuickStart, though this is configured and installed through the Deploy module.
If run directly it is installed and run from the directory ~+{{{/home/test/USER}}}+~.
In the configuration file all the applications are placed on one machine (e.g. the current machine, ~+{{{localhost}}}+~). This gives the same kind of instance as the !QuickStart, though this is configured and installed through the Deploy module. If run directly it is installed and run from the directory ~+{{{/home/test/USER}}}+~.
Line 31: Line 26:
Line 34: Line 28:
  * Set your browser to proxy according to the QuickStart Manual - now on port 8070
  * Choose the URL e.g. http://dia-test-int-01.kb.dk:8074/HarvestDefinition/
  * You can now create, run and browse  according to the QuickStart - or User Manual
 * Set your browser to proxy according to the QuickStart Manual - now on port 8070
 * Choose the URL e.g. http://dia-test-int-01.kb.dk:8074/HarvestDefinition/
 * You can now create, run and browse according to the QuickStart - or User Manual
Line 39: Line 32:

In the following are two examples of configuration files for deploy.
The first two requires adaptation to your own system before use.
In the following are two examples of configuration files for deploy. The first two requires adaptation to your own system before use.
Line 45: Line 36:
The instance with two replicas divided over two physical locations.
Each physical locations contain several machines. Bitarchive machines, harvester machine and viewerproxy machine.
Only one physical location has an administator machine, which contains the GUI application, the Bitarchive monitors and the arc repository.
The instance with two replicas divided over two physical locations. Each physical locations contain several machines. Bitarchive machines, harvester machine and viewerproxy machine. Only one physical location has an administator machine, which contains the GUI application, the Bitarchive monitors and the arc repository.
Line 50: Line 39:
Line 53: Line 41:
This is the instance with only one replica and one physical location.
It is very close to the first example, just with one replica removed.
This is the instance with only one replica and one physical location. It is very close to the first example, just with one replica removed.
Line 59: Line 46:
Line 61: Line 47:
Line 68: Line 53:
Duplicate the existing harvester <applicationName> definition within <deployMachine>
Line 69: Line 55:
Duplicate the existing harvester <applicationName> definition within
<deployMachine>
In the new duplicate harvester config, change all following duplicate values to new unique values within <deployMachine>:
Line 72: Line 57:
In the new duplicate harvester config, change all following duplicate values to new unique values
within <deployMachine>:
<applicationInstanceId> <common><jmx><port> and <rmiPort> <heritrix><guiport> and <jmxPort> <serverDir>harvester_high_2</serverDir>
Line 75: Line 59:
<applicationInstanceId>
<common><jmx><port> and <rmiPort>
<heritrix><guiport> and <jmxPort>
<serverDir>harvester_high_2</serverDir>


and set
<queuePriority>HIGHPRIORITY</queuePriority>
and set <queuePriority>HIGHPRIORITY</queuePriority>
Line 86: Line 63:
            <applicationName name="dk.netarkivet.harvester.harvesting.HarvestControllerApplication">
                <settings>
                    <common>
                       
<applicationInstanceId>high2</applicationInstanceId>
                        <jmx>
                           
<port>8112</port>
                           
<rmiPort>8212</rmiPort>
                        </jmx>
                    </common>
                    <harvester>
                        <harvesting>
                           
<queuePriority>HIGHPRIORITY</queuePriority>
                            <heritrix>
                               
<guiPort>8192</guiPort>
                               
<!-- T: jmxPort to be modified by test (was 8093) -->
                               
<jmxPort>8193</jmxPort>
                                 <jmxUsername>controlRole</jmxUsername>
                                
<jmxPassword>R_D</jmxPassword>
                            </heritrix>
                           
<serverDir>harvester_high_2</serverDir>
                        </harvesting>
                    </harvester>
                </settings>
           
</applicationName>
 . <applicationName name="dk.netarkivet.harvester.harvesting.HarvestControllerApplication">
  . <settings>
   . <common>
    .
<applicationInstanceId>high2</applicationInstanceId> <jmx>
     .
<port>8112</port> <rmiPort>8212</rmiPort>
    </jmx>
   </common> <harvester>
    . <harvesting>
     .
<queuePriority>HIGHPRIORITY</queuePriority> <heritrix>
      .
<guiPort>8192</guiPort> <!-- T: jmxPort to be modified by test (was 8093) --> <jmxPort>8193</jmxPort>
       . <jmxUsername>controlRole</jmxUsername> <jmxPassword>R_D</jmxPassword>
     </heritrix> <serverDir>harvester_high_2</serverDir>
    </harvesting>
   </harvester>
  </settings>
</applicationName>

Easy Installation of NetarchiveSuite

Action(edit)

attachment:RunNetarchiveSuite.sh

attachment:deploy_example_one_machine.xml

The first script is a simple script for doing all the steps during deployment. It takes a NetarchiveSuite package ('.zip'), a configuration file (the second file), and a temporary installation directory as arguments (in the given order).

In the configuration file all the applications are placed on one machine (e.g. the current machine, localhost). This gives the same kind of instance as the QuickStart, though this is configured and installed through the Deploy module. If run directly it is installed and run from the directory /home/test/USER.

Below, you find other deploy examples. ( They have to be modfied to your environment)

E.g.

#stop the Quickstart
cd /home/test/netarchive/scripts/simple_harvest/killhard.sh
cd /home/test/netarchive
bash RunNetarchiveSuite.sh NetarchiveSuite.zip deploy_example_one_machine.xml USER/
#if you have not setup your ssh keygen correctly, you need to login some times before the installation finish successfully

The script creates a "USER" folder in e.g. /home/test , which contains e.g. methods for starting and stopping NetarchiveSuite and starts the whole NetarchiveSuite.

Examples of deploy configuration files

In the following are two examples of configuration files for deploy. The first two requires adaptation to your own system before use.

attachment:deploy_example.xml

The instance with two replicas divided over two physical locations. Each physical locations contain several machines. Bitarchive machines, harvester machine and viewerproxy machine. Only one physical location has an administator machine, which contains the GUI application, the Bitarchive monitors and the arc repository.


attachment:deploy_example_single.xml

This is the instance with only one replica and one physical location. It is very close to the first example, just with one replica removed.


A running HW/SW setup example from June 2009 for Netarkivet.dk


http://netarchive.dk/suite/Installation_Manual_3.10?action=AttachFile&do=view&target=HW_SW_production_example.txt


How to add a harvester more on the same harvester machine and set all to only support HIGHPRIORITY selective harvesting in e.g. deploy_example.xml

Duplicate the existing harvester <applicationName> definition within <deployMachine>

In the new duplicate harvester config, change all following duplicate values to new unique values within <deployMachine>:

<applicationInstanceId> <common><jmx><port> and <rmiPort> <heritrix><guiport> and <jmxPort> <serverDir>harvester_high_2</serverDir>

and set <queuePriority>HIGHPRIORITY</queuePriority>

e.g.:

  • <applicationName name="dk.netarkivet.harvester.harvesting.HarvestControllerApplication">

    • <settings>

      • <common>

        • <applicationInstanceId>high2</applicationInstanceId> <jmx>

          • <port>8112</port> <rmiPort>8212</rmiPort>

          </jmx>

        </common> <harvester>

        • <harvesting>

          • <queuePriority>HIGHPRIORITY</queuePriority> <heritrix>

            • <guiPort>8192</guiPort> <!-- T: jmxPort to be modified by test (was 8093) --> <jmxPort>8193</jmxPort>

              • <jmxUsername>controlRole</jmxUsername> <jmxPassword>R_D</jmxPassword>

            </heritrix> <serverDir>harvester_high_2</serverDir>

          </harvesting>

        </harvester>

      </settings>

    </applicationName>

Installation Manual 3.10/AppendixC (last edited 2010-08-16 10:24:05 by localhost)