1395
Comment:
|
1538
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
The !NetarchiveSuite software is developed by the two national deposit libraries in Denmark, [[http://www.kb.dk/|The Royal Library]] and [[http://www.statsbiblioteket.dk|The State and University Library]], and has been running in production, harvesting the Danish world wide web since 2005. The Danish netarchive currently contains over 160 TB of data that are mirrored on two different geographical locations. | |
Line 3: | Line 4: |
The !NetarchiveSuite is the complete web archiving software package developed within the netarchive.dk project from 2004 and onwards. The primary function of the !NetarchiveSuite is to plan, schedule and run web harvests of parts of the Internet. The !NetarchiveSuite is built around the Heritrix web crawler and is scalable to national domain level crawls as well as built for small selective and thematic harvests. The software has built-in bit preservation functionality as well as the overall architecture is distributed among machines and geographical locations. For more information please refer to ["Overview"] |
The !NetarchiveSuite is the complete web archiving software package developed within the netarchive.dk project from 2004 and onwards. The primary function of the !NetarchiveSuite is to plan, schedule and run web harvests of parts of the Internet. It scales to a wide range of tasks, from small, thematic harvests (e.g. related to special events, or special domains) to harvesting and archiving the content of an entire national domain. The software has built-in bit preservation functionality. The systems architecture allows for the software to be distributed among several machines, possibly on more than one geographical location. The !NetarchiveSuite is built around the Heritrix web crawler, which it uses to harvest the web. You find more information in the [[Overview 3.10|overview]]. |
Line 9: | Line 6: |
[[Include(News)]] | <<Include(News)>> |
Line 11: | Line 8: |
To get started trying out the software in a simple setup. This should contain all needed information to get a simple test system up and running on a standard linux machine. Please refer to ["Get NetarchiveSuite"] and use the [:QuickStart Manual:Quick Start Manual] for the installation. Everything is released with full source under the LGPL license. |
To get started with !NetarchiveSuite, [[Get NetarchiveSuite|download]] it and try it out with our [[Quick Start Manual|Quick Start]] installation setup, which only requires one standard Linux machine. |
Line 14: | Line 10: |
The !NetarchiveSuite software was developed by the two national deposit libraries in Denmark, [http://www.kb.dk/ The Royal Library] and [http://www.statsbiblioteket.dk The State and University Library], and has been running in production, harvesting the Danish world wide web for two years. The Danish netarchive currently contains over 30 TB of data. | The software is released with full source under the LGPL license. |
Welcome to the NetarchiveSuite
The NetarchiveSuite software is developed by the two national deposit libraries in Denmark, The Royal Library and The State and University Library, and has been running in production, harvesting the Danish world wide web since 2005. The Danish netarchive currently contains over 160 TB of data that are mirrored on two different geographical locations.
The NetarchiveSuite is the complete web archiving software package developed within the netarchive.dk project from 2004 and onwards. The primary function of the NetarchiveSuite is to plan, schedule and run web harvests of parts of the Internet. It scales to a wide range of tasks, from small, thematic harvests (e.g. related to special events, or special domains) to harvesting and archiving the content of an entire national domain. The software has built-in bit preservation functionality. The systems architecture allows for the software to be distributed among several machines, possibly on more than one geographical location. The NetarchiveSuite is built around the Heritrix web crawler, which it uses to harvest the web. You find more information in the overview.
News about issues related to the NetarchiveSuite and this web is given below (To see earlier news, please refer to Old News)
Date |
News |
14/12 2011 |
Stable release 3.18.0 has been released. See release notes and download page |
08/09 2011 |
Development release 3.17.0 has been released. See release notes and download page |
28/06 2011 |
Stable release 3.16.1 has been released. See release notes and download page |
11/05 2011 |
Stable release 3.16.0 has been released. See release notes and download page |
01/03 2011 |
Development release 3.15.0 has been released. See release notes and download page |
16/02 2011 |
Stable release 3.14.1 has been released. See release notes and download page |
12/11 2010 |
Stable release 3.14.0 has been released. See release notes and download page |
15/09 2010 |
Stable release 3.12.2 has been released. See release notes and download page |
08/09 2010 |
Development release 3.13.1 has been released. See release notes and download page |
06/07 2010 |
Stable release 3.12.1 has been released. See release notes and download page |
15/06 2010 |
Development release 3.13.0 has been released. See release notes and download page |
03/05 2010 |
Stable release 3.12.0 has been released. See release notes and download page |
22/12 2009 |
Development release 3.11.0 has been released. See release notes and download page |
16/11 2009 |
Stable release 3.10.0 has been released. See release notes and download page |
10/9 2009 |
Stable release 3.8.2 has been released. See release notes and download page |
10/8 2009 |
Development release 3.9.0 has been released. See release notes and download page |
15/7 2009 |
Stable release 3.8.1 has been released. This is a patch release. See release notes and download page |
To get started with NetarchiveSuite, download it and try it out with our Quick Start installation setup, which only requires one standard Linux machine.
The software is released with full source under the LGPL license.