Differences between revisions 4 and 5
Revision 4 as of 2009-07-24 06:21:44
Size: 48877
Comment:
Revision 5 as of 2009-07-27 08:43:17
Size: 48675
Comment:
Deletions are marked like this. Additions are marked like this.
Line 39: Line 39:
||<tablewidth="100%"bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Tasks for iteration 38. Updated 24. July 2009''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Estimate md''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Main responsible''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Reviewer ''' ||<10% bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Remaining md at 10. June 2009''' ||<20% bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Comments''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Status''' || ||<tablewidth="100%"bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Tasks for iteration 38. Updated 27. July 2009''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Estimate md''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Main responsible''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Reviewer ''' ||<10% bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Remaining md at 27. July 2009''' ||<20% bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Comments''' ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Status''' ||
Line 51: Line 51:
||'''''Module Common:''''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=555 Bug 555] JMS connections cannot reconnect. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1218 Bug 1218]'' ''Exception while adding listeners to JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1299]'' ''Network I/O errors shuts down JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1645 Bug 1645]'' ''JMS connections very unstable''. ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1275]'' ''The message limit (maxNumMsgs) of 100000 has been reached. ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">KFC ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">Wait for code review ||
||'''''Module Archive:''''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1566 Bug 1566] Several Deduplicating processes started by error - only one should be possible ||<style="TEXT-ALIGN: center">0 ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center">KFC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">We consider this bug fixed, as there now is a method to avoid this bug in the future by using the dk.netarkivet.archive.tools.CreateIndex script ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">Fixed ||
||'''''Module Common:''''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=555 Bug 555] JMS connections cannot reconnect. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1218 Bug 1218]'' ''Exception while adding listeners to JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1299]'' ''Network I/O errors shuts down JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1645 Bug 1645]'' ''JMS connections very unstable''. ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1275]'' ''The message limit (maxNumMsgs) of 100000 has been reached. ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">KFC ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">-- ||
||'''''Module Archive:''''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1566 Bug 1566] Several Deduplicating processes started by error - only one should be possible ||<style="TEXT-ALIGN: center">0 ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center">KFC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">We consider this bug fixed, as there now is a method to avoid this bug in the future by using the dk.netarkivet.archive.tools.CreateIndex script ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">-- ||
Line 54: Line 54:
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1172 Bug 1172]'' ''password protected domain was not harvested ||<style="TEXT-ALIGN: center">1,5 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Sanity Tested, awaiting QA ''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1336 Bug 1336] Harvester job dies suddenly '' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Waiting for code review #NS-51 for revision #862 ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=928 Bug 928]'' ''The guess of initial size of unharvested domains is very bad on harvests with a large object limit ||<style="TEXT-ALIGN: center">1 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Waiting for code review #NS-55+NS-58 for revision #866 ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1656 Bug 1656]'' ''WARNING: Aborting crawl because og inactivity. URLS's in queue:19''". '' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">This bug cannot be reproduced. Therefore closed! ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1174 Bug 1174]'' ''Poor error message on dead job ||<style="TEXT-ALIGN: center">0 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">This should be fixed by fixing bug 1188. No further work is required ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1188 Bug 1188]'' ''Heritrix side exceptions on JMX calls are ignored ||<style="TEXT-ALIGN: center">3 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1172 Bug 1172]'' ''password protected domain was not harvested ||<style="TEXT-ALIGN: center">1,5 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1336 Bug 1336] Harvester job dies suddenly '' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Waiting for code review #NS-51 for revision #862 ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''' ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=928 Bug 928]'' ''The guess of initial size of unharvested domains is very bad on harvests with a large object limit ||<style="TEXT-ALIGN: center">1 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Waiting for code review #NS-55+NS-58 for revision #866 ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1656 Bug 1656]'' ''WARNING: Aborting crawl because og inactivity. URLS's in queue:19''". '' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">This bug cannot be reproduced. Therefore closed! ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1174 Bug 1174]'' ''Poor error message on dead job ||<style="TEXT-ALIGN: center">0 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">This should be fixed by fixing bug 1188. No further work is required ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1188 Bug 1188]'' ''Heritrix side exceptions on JMX calls are ignored ||<style="TEXT-ALIGN: center">3 ||<style="TEXT-ALIGN: center">CSR ||<style="TEXT-ALIGN: center">JOLF ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
Line 61: Line 61:
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1650 Bug 1650]'' ''It is not checked when creating the Heritrix process, that the JMX password file assigned to Heritrix exists ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Eleonora ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Code to fix this bug implemented, but not yet committed. Unittesting remains ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review''' || ||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1650 Bug 1650]'' ''It is not checked when creating the Heritrix process, that the JMX password file assigned to Heritrix exists ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Eleonora ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center">Code to fix this bug implemented, but not yet committed. Unittesting remains ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
Line 64: Line 64:
||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1644 Bug 1644]'' ''On Edit Domain page, the text field only shows 21 characters of the domainname ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Wait for code review ''' || ||<style="VERTICAL-ALIGN: top">'''''Module Harvester:''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1644 Bug 1644]'' ''On Edit Domain page, the text field only shows 21 characters of the domainname ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
Line 72: Line 72:
||<style="VERTICAL-ALIGN: top">'''''Module Common''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1654 Feature request 1654]'' . ''second-level domains for .at in settings.xml ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Andreas ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Fixed ''' || ||<style="VERTICAL-ALIGN: top">'''''Module Common''': ''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1654 Feature request 1654]'' . ''second-level domains for .at in settings.xml ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Andreas ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
Line 74: Line 74:
||<style="VERTICAL-ALIGN: top">'''''Module Harvester''':''''' '''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1014 Feature request 1014] No good way to mark a non-reported-stopped job as FAILED or DONE'''.''' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Waiting for Code Review ''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester''':''''' '''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1675 Feature request 1675] List of all Seeds of a selective Harvests'''.''' ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Andreas ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''Fixed ''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester''':''''' '''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1014 Feature request 1014] No good way to mark a non-reported-stopped job as FAILED or DONE'''.''' ||<style="TEXT-ALIGN: center">2 ||<style="TEXT-ALIGN: center">HBK ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''--''' ||
||<style="VERTICAL-ALIGN: top">'''''Module Harvester''':''''' '''[https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1675 Feature request 1675] List of all Seeds of a selective Harvests'''.''' ||<style="TEXT-ALIGN: center">? ||<style="TEXT-ALIGN: center">Andreas ||<style="TEXT-ALIGN: center">SVC ||<style="TEXT-ALIGN: center"> ||<style="TEXT-ALIGN: center"> ||<bgcolor="#cccccc" style="TEXT-ALIGN: center">'''-- ''' ||

Task list and timetable for iteration 38

Status

OK/Not Ok

1. Highlights approved

2. Assignment of tasks

3. Task list and time table approved

4. Implementation phase started

5. Release test phase started

6. Assignment phase for next iteration started

7. Iteration 38 completed

Highlights for Iteration

  • [http://kb-prod-udv-001.kb.dk/twiki/bin/edit/Netarkiv/SupportNetarchiveSuite Support] of released NetarchiveSuite (http://netarchive.dk/suite).

  • Enhance NetarchiveSuite wiki according to [:UpdateNetarchiveSuiteWiki:decided structure].

  • Implement prioritized bugs according to [https://gforge.statsbiblioteket.dk/tracker/index.php?group_id=7&atid=105 list] of priority 4 and priority 3 tasks

  • Enhancement of QA
  • Enhancement of Batch support
  • Finalize the support of Wayback in the Netarchive.dk production site. See [:IntegrationOfWaybck:List of tasks] and [:AssignmentWaybackIntegration:Assignment] for Wayback Integration

  • Migration of old Web materials to Netarchive.dk
  • Start of task according to roadmap
    • Module Archive

      • Enhanced support for Batch
    • Module Harvester

      • ...
    • Module Access

      • Support for Wayback
      • Test of Nutchwax
    • Module Common

      • ...
  • Bug fixes according to updated prioritized bug list
  • Iteration 38 is planned as a development release candidate.

Development procedure

Table of tasks

Tasks for iteration 38. Updated 27. July 2009

Estimate md

Main responsible

Reviewer

Remaining md at 27. July 2009

Comments

Status

Implementation phase (task x-n)

Open Source release + bugs and feature request

Total ?

-

-

Total x

-

Support of Open Source Release

1. [http://kb-prod-udv-001.kb.dk/twiki/bin/view/Netarkiv/SupportNetarchiveSuite Support] of released NetarchiveSuite

2

All (Google calender)

Ongoing

2. Implement translateprocess. Adjustment to Open Source partners.

1

KFC

ELZI

..

Bugs and Features requests

Prioritized bugs according to [https://gforge.statsbiblioteket.dk/tracker/index.php?group_id=7&atid=105 list] of priority 4 and priority 3 tasks.

Total 5,5

-

-

SubTotal x

..

-

Priority 4 bugs

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1254 Bug 1254] Database connections to MySQL close down intermittently?

2

Nicolas

SVC

..

--

Module Archive: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1254 Bug 1694] LocalArcRepositoryClient is broken

1

Nicolas

SVC

--

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1254 Bug 1628/1695] Add custom JVM parameters to Heritrix subprocess

1

Nicolas

SVC

--

Module Common: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=555 Bug 555] JMS connections cannot reconnect. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1218 Bug 1218] Exception while adding listeners to JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1299] Network I/O errors shuts down JMSConnection. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1645 Bug 1645] JMS connections very unstable. [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1275 Bug 1275] The message limit (maxNumMsgs) of 100000 has been reached.

?

KFC

SVC

--

Module Archive: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1566 Bug 1566] Several Deduplicating processes started by error - only one should be possible

0

SVC

KFC

We consider this bug fixed, as there now is a method to avoid this bug in the future by using the dk.netarkivet.archive.tools.CreateIndex script

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1690 Bug 1690] Keep track of order XML changes

?

KFC

SVC

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1172 Bug 1172] password protected domain was not harvested

1,5

CSR

JOLF

--

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1336 Bug 1336] Harvester job dies suddenly

2

HBK

SVC

Waiting for code review #NS-51 for revision #862

--

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=928 Bug 928] The guess of initial size of unharvested domains is very bad on harvests with a large object limit

1

HBK

SVC

Waiting for code review #NS-55+NS-58 for revision #866

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1656 Bug 1656] WARNING: Aborting crawl because og inactivity. URLS's in queue:19".

2

HBK

SVC

This bug cannot be reproduced. Therefore closed!

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1174 Bug 1174] Poor error message on dead job

0

CSR

JOLF

This should be fixed by fixing bug 1188. No further work is required

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1188 Bug 1188] Heritrix side exceptions on JMX calls are ignored

3

CSR

JOLF

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1680 Bug 1680] Broad harvest stability (Job fail)

?

Andreas

SVC

..

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1650 Bug 1650] It is not checked when creating the Heritrix process, that the JMX password file assigned to Heritrix exists

?

Eleonora

SVC

Code to fix this bug implemented, but not yet committed. Unittesting remains

--

Priority 3 bugs

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=688 Bug 688] hosts-report should be IDNA decoded when writing harvestInfo to the DB

2

We will need a domain name normalizer that both unmangles IDNA names and lowercases. This will take more than 1 MD. This and 596 must be solved together

..

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1644 Bug 1644] On Edit Domain page, the text field only shows 21 characters of the domainname

?

--

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1670 Bug 1670] Default timeout settings are set way too low in the default settings

?

..

Module Harvester:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1069 Bug 1069] How to setup an apache proxy used to control access to the GUI and viewerproxy servers is missing from the Installation manual

?

..

Module Archive:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1260 Bug 1260] Too much and wrong feedback information on "Missing pages"

1,5

This bug will automatically be solved if we chose to implement feature request #1380 "Avoid double initiations of commands by doubble click"

..

Module Archive:: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1193 Bug 1193] Exceptions from FileBatchJob stop batch job processing

?

..

Prioritized Feature Requests according to [:TaskTableFromMay2009Workshop:list] of priority 4 and priority 3 tasks

Total 21,5

-

-

SubTotal x

-

Priority 4 Feature request

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1298 Feature request 1298] Set JMXConnection timeout, if possible

2

CSR

SVC

...

..

Module Common: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1654 Feature request 1654] . second-level domains for .at in settings.xml

?

Andreas

SVC

--

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1678 Feature request 1678] Make CDX-entries for the deduplicate entries in the crawl.log, and append to the other CDX-entries.

?

CSR

SVC

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1014 Feature request 1014] No good way to mark a non-reported-stopped job as FAILED or DONE.

2

HBK

SVC

--

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1675 Feature request 1675] List of all Seeds of a selective Harvests.

?

Andreas

SVC

--

Module Common: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1687 Feature request 1687] French translation.

2

Sara

KFC

BnF: Will be part of next iteration.

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1678 Feature request 1678] Make CDX-entries for the deduplicate entries in the crawl.log, and append to the other CDX-entries.

?

CSR

JOLF

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1688 Feature request 1688] Monitoring broad crawls.

5

Sara

EZI TbC?

BnF: This is just in the assignment phase.

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1689 Feature request 1689] Managing crawls using object number.

?

Nicolas

KFC

BnF: Nicolas will work on it in August

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1641 Feature request 1641] It should be possible to turn off deduplication completely.

?

SVC

Nicolas?

BnF: Nicolas will be back in August

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1691 Feature request 1691] Configure which Heritrix reports to include in metadata ARC file.

?

Nicolas

KFC

Nicolas will work on it in August

..

Priority 3 Feature request

Module Access: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=623 Feature request 623] We need to normalize URLs when browsing data

5

Lighter solution

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=680 Feature request 680] Cannot browse harvested password protected material

10

At least partly solved by wayback. Investigations by collections sections ongoing.

..

Module Documentation: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1288 Feature request 1288] Batch and and use of Tools must be described

?

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1066 Feature request 1066] Show whether seed URL existed

2,5

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1112 Feature request 1112] Automatic checks of seeds when entered in the harvest definition interface

?

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1120 Feature request 1120] Crawlertrap info should be shareable between institutions

?

..

Module Archive: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1285 Feature request 1285] Storage of processed batch classes

?

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1482 Feature request 1482] Harvest information for job must report if there are problems in getting information

?

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1511 Feature request 1511] Thousand separators requested in user interface

?

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1681 Feature request 1681] Add seed to DB via webservice (via Browser Extension/Rich Client)

?

Andreas

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1682 Feature request 1682] Statistics (DB access, scripts, batch jobs ....)

?

Andreas

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1683 Feature request 1683] Util for regenerate admin.data file

?

Andreas

..

Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1684 Feature request 1684] Activity when domain is to be crawled. One table for seed

?

Andreas

..

Module None [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1677 Feature request 1677] Enable WARC file writing and handling in the NetarchiveSuite

?

Soeren

..

Module None [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1116 Feature request 1116] Global crawlertraps

?

Soeren

..

..

..

..

Roadmap tasks

Total 52?

-

-

Total x

-

Tasks from ...

[:AssignmentWaybackIntegration:Task Access 2.1] Wayback Into Version Control

1,5

CSR

KFC

OK

[:AssignmentWaybackIntegration:Task Access 2.2] Ant target for deployable wayback

2

CSR

JOLF

In Production Awaiting QA ?

[:AssignmentWaybackIntegration:Task Access 2.3] Create a PROPER version of NetarchiveResourceStore

5

HBK

CSR

Committed but not tested

Started

Assignment for enhanced QA tools

2

KFC

SVC

..

Finalize [:AssignmentHarvester2:Assigment] for Harvester for support of WARC format

?

SVC

KFC

..

Finalize assignment for [:AssignmentGroupB2:Assignment group B.2.2]

0,5

JOLF

KFC

..

Implement [:AssignmentGroupB2:Assignment B.2.2a] - Generalise replica to include all checksum voters

14?

JOLF

KFC

Started

Implement [:AssignmentGroupB2:Assignment B.2.2b] - Store bit preservation information in a database

8

JOLF

KFC

Started

Implement [:AssignmentGroupB2:Assignment B.2.3] - Use segments in bitarchives

6

..

Implement [:AssignmentGroupB2:Assignment B.2.4] - Write BitPreservation scheduler

5

..

Implement [:AssignmentGroupB2:Assignment B.2.5] - Write BitPreservation webinterface

6

..

Finalize assignment for [:AssignmentGroupB4:Assignment group B.4.4] - Yet more better infrastructure

2

..

..

..

Wayback/Nutchwax tasks independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

..

5

..

[:AssignmentWaybackIntegration:Task Access 2.4] Deduplicated CDX Indexing (Technical investigation)

1

..

Evaluation of NutchWax.

2?

..

Technical decision on type of production HW for Wayback and Nutchwax.

2?

..

..

Converting old Web collections to Netarchive.dk. See [http://udvikling.kb.dk/cvsshadow/digiliv/ProjektDokumenter/omkostninger%20ved%20indsamling%20af%20gammelt%20materiale-3.doc proposal]. These task will be independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

Investigation in dataformat as well as methods

?

SVC

HBK

..

Generic converter prototype

?

SVC

HBK

..

Old KB Webarchive

?

SVC

HBK

..

Old Webarchive harvested with ARC-Httrack

?

HBK

SVC

..

Old Webarchive harvested with Wget

?

HBK

SVC

..

Old Webarchive harvested with NedLib

?

SVC

HBK

..

Old Webarchive from Niels Brugger in waf format

?

HBK

JOLF

..

Old Webarchive from Kurt Vest Nielsen (Ingeniøren from 1995)

?

JOLF

HBK

..

Webarchive from the library of The Danish Parliament

?

SVC

HBK

..

Old Webarchives from Net-papers

?

SVC

HBK

..

Digital publications of The Danish Law Gazette from the missing period

?

SVC

HBK

..

..

..

..

Common tasks calculated as implementation tasks

Total x

-

-

Total x

-

Others

Total x

-

-

SubTotal x

-

Setup of new KB test system

TLR

..

Setup open Crucible server

KFC

..

Prepare release test

Total x

-

-

SubTotal x

-

Prepare [http://kb-prod-udv-001.kb.dk/twiki/bin/view/Netarkiv/Iteration37ReleaseTest release test]

..

Available man-days for implementation phase

Total x

-

-

Total x

-

Release test phase (task ...)

Release test

Total x

-

-

Total 10

-

Execute [http://kb-prod-udv-001.kb.dk/twiki/bin/view/Netarkiv/Iteration37ReleaseTest release test].

'

..

'

..

Release notes

Total x

-

-

Total 0,5

-

-

Available man-days for release test phase

Total x

-

-

Total 10

-

Assignment phase for next iteration (task ...)

Component bug/feature fix/management

QA

..

Define goals for [http://netarchive.dk/suite/Iteration38TaskList Iteration 39 task list]

CHH

..

Presentation of goals and tasks for Iteration 39. Achieve a common understanding of the purpose of the iteration and each task on status meeting

SVC

..

Assignment of tasks, bugs and feature request

QA

..

Update release test procedure

TLR

..

Available man-days for assigment phase

Total x

-

-

Total 22

-

Timetable

Timetable iteration 38. Updated 24. July 2009

Start time

End time

Responsible

Baseline 2. June 2009 . Start time

Baseline 2. June 2009 . End time

1. Implementation of decided tasks

10. August 2009

19. September 2009

10. August 2009

19. September 2009

2. Code freeze. Create the build for release test and notify when build is ready

21. September 2009

SVC

21. September 2009

3. Release test

21. September 2009

22. September 2009

TLR

21. September 2009

22. September 2009

4. Code unfreeze

23. September 2009

SVC

23. September 2009

5. Assignments, bug components and bug fixes

23. September 2009

25. September 2009

23. September 2009

25. September 2009

Iteration38TaskList (last edited 2010-08-16 10:25:16 by localhost)