Task list and timetable for iteration 40

Status

OK/Not Ok

1. Highlights approved

2. Assignment of tasks

3. Task list and time table approved

4. Implementation phase started

5. Release test phase started

6. Assignment phase for next iteration started

7. Iteration 40 completed

Highlights for Iteration

Development procedure

Table of tasks

Tasks for iteration 40. Updated 21. October 2009

Estimate md

Main responsible

Reviewer

Remaining md at 21. October 2009

Comments

Status

Implementation phase (task x-n)

Open Source release + bugs and feature request

Total 3

-

-

Total 3

-

Support of Open Source Release

1. [http://kb-prod-udv-001.kb.dk/twiki/bin/view/Netarkiv/SupportNetarchiveSuite Support] of released NetarchiveSuite

2

All (Google calender)

2

Ongoing

2. Implement translateprocess. Adjustment to Open Source partners.

1

-

Bugs and Features requests

Prioritized bugs according to [https://gforge.statsbiblioteket.dk/tracker/index.php?group_id=7&atid=105 list] of priority 4 and priority 3 tasks.

Total 5

-

-

SubTotal 5

..

-

Priority 4 bugs

3. Module Access: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1758 Bug 1758] UrlCanonicalizerFactory falls back to default value silently

0,5

CSV

SVC

0,5

4. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1777 Bug 1777] Add event seeds only accepts a very short list of seeds

3

HBK

SVC

3

Priority 3 bugs

5. Module Access: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1796 Bug 1796] Configurations manual doesn't include anything about setting up wayback

6. Module Access: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=823 Bug 823] No index = Internal server error

7. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1756 Bug 1756] JMX status page does not update when a new application is started on previously used JMX port

8. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1782 Bug 1782] Same datetime repeated many times, while logging batch checksum of files

9. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1779 Bug 1779] Improve documentation of the additional tools

10. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1764 Bug 1764] Poor information on failed batch job

11. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1732 Bug 1732] LocalArcRepositoryClient not documented

12. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1727 Bug 1727] Poor error message in RunBatch

13. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1708 Bug 1708] bitpreservation logic offers "add to archive" for file that is not in either location

14. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1705 Bug 1705] Make jmxremote.access writable before overwriting it (install script)

15. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1619 Bug 1619] Potential NullPointer exception in RemoveAndGetFileMessage.getData()

16. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1260 Bug 1260] Too much and wrong feedback information on "Missing pages"

17. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1205 Bug 1205] Security policy for unit tests contains hardcoded path to development environment

18. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1193 Bug 1193] Exceptions from FileBatchJob stop batch job processing

19.

..

Prioritized Feature Requests according to [:TaskTableFromMay2009Workshop:list] of priority 4 and priority 3 tasks

Total 21

-

-

SubTotal 21

-

Priority 4 Feature request

20. Module harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1116 Feature request 1116] Global crawlertraps

2

CSR

SVC

2

-

21. Module harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1696 Feature request 1696] Ingest domain seed URLs

5

Nicolas

SVC

5

-

22. Module Monitor: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1688 Feature request 1688] Monitoring broad crawls.

5

Nicolas

SVC

5

.

-

23. Module archive: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1736 Feature request 1736] Monitoring batchjobs

2

JOLF

CSR

2

.

-

24. Module harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1757 Feature request 1757] Need a way to remove an application from lists of monitored applications

2

HBK

JOLF

2

.

-

Priority 3 Feature request

25.Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1774 Feature request 1774] Stop using the JMS queues for queuing snapshot harvests

26. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1681 Feature request 1681] Add seed to DB via webservice (via Browser Extension/Rich Client)

?

Andreas

?

27. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1682 Feature request 1682] Statistics (DB access, scripts, batch jobs ....)

?

Andreas

?

28. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1683 Feature request 1683] Util for regenerate admin.data file

?

Andreas

?

29. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1684 Feature request 1684] Activity when domain is to be crawled. One table for seed

?

Andreas

?

30. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1743 Feature request 1743] When accessing Bitpreservation this takes really long time

?

Andreas

?

31. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1120 Feature request 1120] Crawlertrap info should be shareable between institutions

?

Andreas

?

32. Module Harvester: [https://gforge.statsbiblioteket.dk/tracker/index.php?func=detail&aid=1066 Feature request 1066] Show whether seed URL existed

?

Andreas

?

33

..

34

..

Roadmap tasks

Total 52?

-

-

Total 8,5

-

Tasks from ...

35. Review of WARC Tools Phase III SRS

5

SVC

HBK,CSR,TLR

36. Assignment for enhanced QA tools

2

37. Finalize [:AssignmentHarvester2:Assigment] for Harvester for support of WARC format

?

38. Finalize assignment for [:AssignmentGroupB2:Assignment group B.2.2]

0,5

39. Implement [:AssignmentGroupB2:Assignment B.2.2a] - Generalise replica to include all checksum voters

8

JOLF

CSR

2

40. Implement [:AssignmentGroupB2:Assignment B.2.2b] - Store bit preservation information in a database

8

41. Implement [:AssignmentGroupB2:Assignment B.2.3] - Use segments in bitarchives

6

42. Implement [:AssignmentGroupB2:Assignment B.2.4] - Write BitPreservation scheduler

5

43. Implement [:AssignmentGroupB2:Assignment B.2.5] - Write BitPreservation webinterface

6

44. Finalize assignment for [:AssignmentGroupB4:Assignment group B.4.4] - Yet more better infrastructure

2

..

..

Wayback/Nutchwax tasks independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

-

36. Evaluation of NutchWax.

2?

HBK

CSR

Awaiting review

37. Technical decision on type of production HW for Wayback and Nutchwax.

2?

CSR

CLO

In progress

..

Converting old Web collections to Netarchive.dk. See [http://udvikling.kb.dk/cvsshadow/digiliv/ProjektDokumenter/omkostninger%20ved%20indsamling%20af%20gammelt%20materiale-3.doc proposal]. These task will be independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

38. Investigation in dataformat as well as methods

?

SVC

HBK

In progress

39. Generic converter prototype

?

HBK

SVC

.. Done Committed in CVS needs code review

40. Old KB Webarchive

?

SVC

HBK

In progress

41. Old Webarchive harvested with ARC-Httrack

?

HBK

SVC

.. Under dev.

42. Old Webarchive harvested with Wget

?

HBK

SVC

In progress

43. Old Webarchive harvested with NedLib

?

SVC

HBK

In progress

44. Old Webarchive from Niels Brugger in waf format

?

HBK

JOLF

.. Done Waiting for Code Review

45. Old Webarchive from Kurt Vest Nielsen (Ingeniøren from 1995)

?

JOLF

HBK

Postponed

46. Webarchive from the library of The Danish Parliament

?

SVC

HBK

Postponed

47. Old Webarchives from Net-papers

?

SVC

HBK

Postponed

48. Digital publications of The Danish Law Gazette from the missing period

?

SVC

HBK

Postponed

49. Old Webarchive from Niels Brugger collected by HTTrack

?

HBK

SVC

Postponed

..

..

Common tasks calculated as implementation tasks

Total x

-

-

Total x

-

Others

Total x

-

-

SubTotal 2

-

50. Setup of new KB test system

2

TLR

SVC

2

In progress

..

Prepare release test

Total x

-

-

SubTotal 12

-

51. Prepare [http://netarchive.dk/suite/Iteration40Releasetest release test]

6

1

In progress

Available man-days for implementation phase

Total x

-

-

Total x

-

Release test phase (task ...)

Release test

Total x

-

-

Total 12

-

52. Execute [http://netarchive.dk/suite/Iteration40Releasetest release test].

12

TLR

All

12

Awaiting code freeze

'

..

Release notes

Total x

-

-

Total 0,5

-

53. Write release note

0,5

KFC

Awaiting end of code freeze

Available man-days for release test phase

Total x

-

-

Total 10

-

Assignment phase for next iteration (task ...)

54. Component bug/feature fix/management

QA

..

55. Define goals for [http://netarchive.dk/suite/Iteration41TaskList Iteration 41 task list]

CHH

..

56. Presentation of goals and tasks for Iteration 40. Achieve a common understanding of the purpose of the iteration and each task on status meeting

SVC

..

57. Assignment of tasks, bugs and feature request

QA

..

58. Update release test procedure

TLR

..

Available man-days for assigment phase

Total x

-

-

Total 22

-

Timetable

Timetable iteration 40. Updated 13. November 2009

Start time

End time

Responsible

Baseline 13. November 2009. Start time

Baseline 13. November 2009. End time

1. Implementation of decided tasks

16. November 2009

11. December 2009

16. November 2009

11. December 2009

2. Code freeze. Create the build for release test and notify when build is ready

14. December 2009

KFC

14. December 2009

3. Release test

14. December 2009

16. December 2009

TLR

14. December 2009

16. December 2009

4. Code unfreeze

17. December 2009

KFC

17. December 2009

5. Assignments, bug components and bug fixes

16. December 2009

18. December 2009

16. December 2009

18. December 2009

eration39TaskList here.