Task list and timetable for iteration 38

Status

OK/Not Ok

1. Highlights approved

OK

2. Assignment of tasks

OK

3. Task list and time table approved

OK

4. Implementation phase started

OK

5. Release test phase started

OK

6. Assignment phase for next iteration started

OK

7. Iteration 38 completed

Highlights for Iteration

Development procedure

Table of tasks

Tasks for iteration 38. Updated 1. October 2009

Estimate md

Main responsible

Reviewer

Remaining md at 1. October 2009

Comments

Status

Implementation phase (task x-n)

Open Source release + bugs and feature request

Total ?

-

-

Total x

-

Support of Open Source Release

1. Support of released NetarchiveSuite

2

All (Google calender)

Ongoing

2. Implement translateprocess. Adjustment to Open Source partners.

1

KFC

ELZI

. Postponed .

Bugs and Features requests

Prioritized bugs according to list of priority 4 and priority 3 tasks.

Total 5,5

-

-

SubTotal 0

..

-

Priority 5 bugs

Module Archive: Bug 1547 Wrong synchronization in the IndexRequestServer and the FileBasedCache let two processes generate Index at the same time, and one of them fails

2

KFC

SVC

0

OK

Patch 3.8.2 release test

2

TLR

All

0

OK

Priority 4 bugs

Module Harvester: Bug 1073 resubmitting jobs redirects the browser to the list of all jobs

2

HBK

SVC

0

OK

Module Archive: Bug 1721 Batch timeout is not configurable

2

HBK

SVC

0

OK

Module Common: Bug 555 JMS connections cannot reconnect. Bug 1218 Exception while adding listeners to JMSConnection. Bug 1299 Network I/O errors shuts down JMSConnection. Bug 1645 JMS connections very unstable. Bug 1275 The message limit (maxNumMsgs) of 100000 has been reached.

2

KFC

JOLF

0

OK

Module Harvester:: Bug 1690 Keep track of order XML changes

2

KFC

SVC

0

OK

Module Harvester: Bug 1172 password protected domain was not harvested

1,5

CSR

JOLF

0

OK

Module Harvester: Bug 1336 Harvester job dies suddenly

2

HBK

SVC

0

Invalid

OK.

Module Harvester:: Bug 1174 Poor error message on dead job

0

CSR

JOLF

0

This should be fixed by fixing bug 1188. No further work is required

OK

Module Harvester:: Bug 1188 Heritrix side exceptions on JMX calls are ignored

3

CSR

JOLF

0

OK

Module Harvester:: Bug 1680 Broad harvest stability (Job fail)

?

Andreas

SVC

0

SVC will close this bug as it is a symptom on other bugs.

OK

Module Common:: Bug 1661 Too many warnings logged when looking up Heritrix running state

?

KFC

SVC

0

OK

Module Archive:: Bug 1719 Batch Job cannot instantiate loaded class

?

KFC

SVC

0

Invalid

OK

Priority 3 bugs

Module Harvester:: Bug 688 hosts-report should be IDNA decoded when writing harvestInfo to the DB

2

We will need a domain name normalizer that both unmangles IDNA names and lowercases. This will take more than 1 MD. This and 596 must be solved together

. Postponed .

Module Harvester:: Bug 1069 How to setup an apache proxy used to control access to the GUI and viewerproxy servers is missing from the Installation manual

?

. Postponed .

Module Archive:: Bug 1260 Too much and wrong feedback information on "Missing pages"

1,5

This bug will automatically be solved if we chose to implement feature request #1380 "Avoid double initiations of commands by doubble click"

. Postponed .

Module Archive:: Bug 1193 Exceptions from FileBatchJob stop batch job processing

?

. Postponed .

Module Harvester:: Bug 1729 Remove use of deprecated ARCWriter.write() method

?

KFC

JOLF

OK

Module Harvester:: Bug 1730 The prefix to the messages is thrown away

?

KFC

JOLF

OK

Prioritized Feature Requests according to list of priority 4 and priority 3 tasks

Total 21,5

-

-

SubTotal 0,5

-

Priority 4 Feature request

Module Harvester: Feature request 1298 Set JMXConnection timeout, if possible '

2

KFC

JOLF

0

...

OK

Module Harvester: Feature request 1678 Make CDX-entries for the deduplicate entries in the crawl.log, and append to the other CDX-entries. Analysis

8

CSR

SVC

0

..

OK

Module Common: Feature request 1687 French translation.

2

Sara

KFC

0

Postponed

Module Common: Feature request 1750 Italian translation.

2

Eleonora

SVC

0

OK

Module Harvester: Feature request 1688 Monitoring broad crawls.

5

Sara

SVC

0

OK. SVC reviewed assignment. Release test not dependent on this task

Module Harvester: Feature request 1689 Managing crawls using object number.

?

Nicolas

KFC

OK

Module Harvester:' Feature request 1641 It should be possible to turn off deduplication completely.

2

KFC

Nicolas

0

OK.

Module Harvester:' Feature request 1691 Configure which Heritrix reports to include in metadata ARC file.

?

Nicolas

SVC

0

OK

Priority 3 Feature request

Module Access:' Feature request 623 We need to normalize URLs when browsing data

5

Lighter solution

. Postponed .

Module Harvester:' Feature request 680 Cannot browse harvested password protected material

10

At least partly solved by wayback. Investigations by collections sections ongoing.

. Postponed .

Module Documentation:' Feature request 1288 Batch and and use of Tools must be described

?

Postponed ..

Module Harvester:' Feature request 1066 Show whether seed URL existed

2,5

. Postponed .

Module Harvester:' Feature request 1112 Automatic checks of seeds when entered in the harvest definition interface

?

. Postponed .

Module Harvester:' Feature request 1120 Crawlertrap info should be shareable between institutions

?

. Postponed .

Module Archive:' Feature request 1285 Storage of processed batch classes

?

. Postponed .

Module Harvester:' Feature request 1482 Harvest information for job must report if there are problems in getting information

?

. Postponed .

Module Harvester:' Feature request 1511 Thousand separators requested in user interface

1

HBK

KFC

OK

Module Harvester:' Feature request 1681 Add seed to DB via webservice (via Browser Extension/Rich Client)

?

Andreas

. Postponed .

Module Harvester:' Feature request 1682 Statistics (DB access, scripts, batch jobs ....)

?

Andreas

. Postponed .

Module Harvester:' Feature request 1683 Util for regenerate admin.data file

?

Andreas

. Postponed .

Module Harvester:' Feature request 1684 Activity when domain is to be crawled. One table for seed

?

Andreas

. Postponed .

Module None' Feature request 1677 Enable WARC file writing and handling in the NetarchiveSuite

?

Soeren

. Postponed .

Module None' Feature request 1116 Global crawlertraps

?

Soeren

. Postponed .

..

..

..

Roadmap tasks

Total 52?

-

-

Total 2

-

Tasks from ...

Task Access 2.2 Ant target for deployable wayback

2

CSR

JOLF

0

OK

Task Access 2.3 Create a PROPER version of NetarchiveResourceStore

2

HBK

CSR

1

Unit tested

Sanity test phase. Needs not to be completed before release test

Assignment for enhanced QA tools

2

KFC

SVC

0

Postponed

Finalize Assigment for Harvester for support of WARC format

?

SVC

KFC

0

. Postponed .

Finalize assignment for Assignment group B.2.2

0,5

JOLF

KFC

0

In progress

Implement Assignment B.2.2a - Generalise replica to include all checksum voters

14?

JOLF

KFC

0

OK

Implement Assignment B.2.2b - Store bit preservation information in a database

8

JOLF

KFC

0

OK

Implement Assignment B.2.3 - Use segments in bitarchives

6

JOLF

KFC

0

.Postponed .

Implement Assignment B.2.4 - Write BitPreservation scheduler

5

JOLF

KFC

0

.Postponed .

Implement Assignment B.2.5 - Write BitPreservation webinterface

6

JOLF

KFC

0

.Postponed .

Finalize assignment for Assignment group B.4.4 - Yet more better infrastructure

2

0

.Postponed .

..

..

Wayback/Nutchwax tasks independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

..

5

..

Task Access 2.4 Deduplicated CDX Indexing (Technical investigation)

1

CSR

SVC

Postponed

Evaluation of NutchWax.

2?

HBK

CSR

In progress

Technical decision on type of production HW for Wayback and Nutchwax.

2?

CSR

CLO

. In progress .

..

Converting old Web collections to Netarchive.dk. See proposal. These task will be independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

Investigation in dataformat as well as methods

?

SVC

HBK

..

Generic converter prototype

?

HBK

SVC

.. Done Committed in CVS needs code review

Old KB Webarchive

?

SVC

HBK

..

Old Webarchive harvested with ARC-Httrack

?

HBK

SVC

.. Under dev.

Old Webarchive harvested with Wget

?

HBK

SVC

..

Old Webarchive harvested with NedLib

?

SVC

HBK

..

Old Webarchive from Niels Brugger in waf format

?

HBK

JOLF

.. Done Waiting for Code Review

Old Webarchive from Kurt Vest Nielsen (Ingeniøren from 1995)

?

JOLF

HBK

..

Webarchive from the library of The Danish Parliament

?

SVC

HBK

..

Old Webarchives from Net-papers

?

SVC

HBK

..

Digital publications of The Danish Law Gazette from the missing period

?

SVC

HBK

..

..

..

..

Common tasks calculated as implementation tasks

Total x

-

-

Total 2

-

Others

Total x

-

-

SubTotal x

-

Setup of new KB test system

TLR

SVC

0

.In progress.

Setup open Crucible server

KFC

SVC

0

. Postpone .

Prepare release test

Total x

-

-

SubTotal 1

-

Prepare release test

6

TLR

0

OK

Available man-days for implementation phase

Total x

-

-

Total 6

-

Release test phase (task ...)

Release test

Total x

-

-

Total 6

-

Execute release test.

6

TLR

6

. In progress .

'

..

Release notes

Total x

-

-

Total 0,5

-

Write Release Notes

0,5

KFC

0,5

Awaiting end of release test

Available man-days for release test phase

Total x

-

-

Total 7,5

-

Assignment phase for next iteration (task ...)

Component bug/feature fix/management

QA

..

Define goals for Iteration 39 task list

CHH

..

Presentation of goals and tasks for Iteration 39. Achieve a common understanding of the purpose of the iteration and each task on status meeting

SVC

..

Assignment of tasks, bugs and feature request

QA

..

Update release test procedure

TLR

..

Available man-days for assigment phase

Total x

-

-

Total x

-

Timetable

Timetable iteration 38. Updated 29. September 2009

Start time

End time

Responsible

Baseline 3. August 2009. Start time

Baseline 3. August 2009. End time

1. Implementation of decided tasks

3. August 2009

30. September 2009

3. August 2009

21. September 2009'

2. Code freeze. Create the build for release test and notify when build is ready

1. Oktober 2009

KFC

21. September 2009'

3. Release test

1. Oktober 2009

5. Oktober 2009

TLR

21. September 2009 '

22. September 2009'

4. Code unfreeze

6. Oktober 2009

KFC

23. September 2009'

5. Assignments, bug components and bug fixes

1. Oktober 2009

5. Oktober 2009

23. September 2009 '

25. September 2009'

Iteration38TaskList (last edited 2010-08-16 10:25:16 by localhost)