Task list and timetable for iteration 43

Status

OK/Not Ok

1. Highlights approved

OK

2. Assignment of tasks

OK

3. Task list and time table approved

OK

4. Implementation phase started

OK

5. Release test phase started

OK

6. Assignment phase for next iteration started

OK

7. Iteration 43 completed

Highlights for Iteration

Development procedure

Table of tasks

Tasks for iteration 43. Updated 9. June 2010

Estimate md

Main responsible

Reviewer

Remaining md at 9. June 2010

Comments

Status

Implementation phase (task x-n)

Open Source release + bugs and feature request

Total 3

-

-

Total 3

-

Support of Open Source Release

1. Support of released NetarchiveSuite

2

All (Google calender)

Ongoing

2. Implement translateprocess. Adjustment to Open Source partners.

1

CSR

SVC

-

3. Maintain French Translation files.

1

Nicolas/Sara

SVC

See also Task 22

-

4. Maintain Italian and german Translation files.

1

Andreas/Eleonora

SVC

See also Task 22

-

Bugs and Feature requests

Prioritized bugs according to list of priority 4 and priority 3 tasks.

Total 5

-

-

SubTotal 0

..

-

Priority 5 bug

5 Module harvester: Bug 1856 Schedule problem after first start on NAS 3.10.0. No schedule started

1

CSR

SVC

Bug seemingly fixed with patch release 3.10.2

Fixed

6 Module monitor?: Bug 1895 Running checksum gives Garbage Collector OutOfMemoryError and schedule stops

1

SVC

CSR

Max heap for Bitarchive monitors raised to 1936MB in prod. Awaiting upgrade of java in production environment

Priority 4 bugs

7..

.

Priority 3 bugs

8. Module harvester: Feature request 688 hosts-report should be IDNA decoded when writing harvestInfo to the DB

Postponed

9. Module Access: Bug 823 No index = Internal server error

Postponed

10. Module Monitor: Bug 1756 JMX status page does not update when a new application is started on previously used JMX port

Postponed

11. Module Archive: Bug 1782 Same datetime repeated many times, while logging batch checksum of files

Postponed

12. Module Documentation: Bug 1779 Improve documentation of the additional tools

FIXED

13. Module Archive: Bug 1708 bitpreservation logic offers "add to archive" for file that is not in either location

Postponed

14. Module Documentation: Bug 1732 LocalArcRepositoryClient not documented

Fixed in System Design 3.12 manual

Fixed

15. Module Archive: Bug 1260 Too much and wrong feedback information on "Missing pages"

Postponed

16. Module Monitor: Bug 1205 Security policy for unit tests contains hardcoded path to development environment

Postponed

17. Module Archive: Bug 1193 Exceptions from FileBatchJob stop batch job processing

Postponed

..

Prioritized Feature Requests according to list of priority 4 and priority 3 tasks

Total 21

-

-

SubTotal 21

-

Priority 4 Feature request

18. Module Harvester: Feature request 1696 Ingest domain seed URLs

?

Nicolas

SVC

Postponed

19. Module Harvester: Feature request 1688 Monitoring broad crawls.

?

Nicolas/Sara

SVC

http://kb-prod-udv-001.kb.dk:8060/cru/NS-152.

Review and followup done

20. Module Harvester: Feature request 1134 Filter job lists by category

?

Nicolas/Sara

SVC

http://kb-prod-udv-001.kb.dk:8060/cru/NS-151

Review and followup done

21. Module Harvester: Feature request 1668 Paginate and make sortable and searchable the list of jobs

?

Nicolas/Sara

SVC

http://kb-prod-udv-001.kb.dk:8060/cru/NS-151

Review and followup done

21a. Module Harvester: Feature request 1924 Allow to search a domain in active jobs (in case of webmaster complain)

?

Nicolas/Sara

http://kb-prod-udv-001.kb.dk:8060/cru/NS-151

Review and followup done

21b. Common: Feature request 1925 PostgreSQL connectivity (using the PostgreSQL driver version 8.4 - JDBC 4)

?

Nicolas

http://kb-prod-udv-001.kb.dk:8060/cru/NS-151

Review and followup done

21c. Module Harvester: Feature request 1926 Ability to disable the inactivity check

?

Nicolas

http://kb-prod-udv-001.kb.dk:8060/cru/NS-152

Waiting for review

21d. Module Harvester: Feature request 1927 Delay job end to allow Heritrix report generation

?

Nicolas/Sara

http://kb-prod-udv-001.kb.dk:8060/cru/NS-152

Review and followup done

21e. Module Harvester: Feature request 1928 Ability to easily resubmit a selection of failed jobs

?

Nicolas/Sara

?

Postponed

21f. Module Harvester: Feature request 1929 15 second level TLD related to the .fr and .re domains

?

Nicolas/Sara

Fixed

21g. Module Harvester: Feature request 1930 Ability to implement a different crawl control loop via HeritrixLauncher / new Heritrix JMX controller

?

Nicolas

http://kb-prod-udv-001.kb.dk:8060/cru/NS-152

Review and followup done

22. Module Harvester:Feature request 1813 An extra resubmit button to make it visible which jobs have already been handled

?

SVC

CSR

Postponed .

23. Module Harvester:Feature request 929 Documentation needed for how we split jobs (incl. maybe additional splitting modularity)

?

SVC

CSR

Postponed

24. Module Harvester:Feature request 1774 Stop using the JMS queues for queuing snapshot harvests

?

SVC

CSR

Postponed

25. Module Harvester:Feature request 1896 Crawl of password protected FTP-sites

2

SVC

CSR

High Priority

Fixed

25a. Module Harvester:Feature request 1951 Upgrade to Heritrix 1.14.4

2

SVC

CSR

High Priority

Fixed

Priority 3 Feature request

26.Module Harvester:Feature request 1774 Stop using the JMS queues for queuing snapshot harvests

Postponed

27. Module Harvester:Feature request 1681 Add seed to DB via webservice (via Browser Extension/Rich Client)

Andreas

Postponed

28. Module Harvester:Feature request 1682 Statistics (DB access, scripts, batch jobs ....)

Andreas

Postponed

29. Module Harvester:Feature request 1683 Util for regenerate admin.data file

Andreas

Postponed

30. Module Harvester:Feature request 1684 Activity when domain is to be crawled. One table for seed

Andreas

Postponed

31. Module Archive:Feature request 1743 When accessing Bitpreservation this takes really long time

Andreas

Postponed

32. Module Harvester:Feature request 1120 Crawlertrap info should be shareable between institutions

Andreas

SVC will add comments to this FR. Might be an easy solution to share Crawlertraps by emailing files with crawler trap informations.

Redundant (Copy of ??)

33. Module Harvester:Feature request 1066 Show whether seed URL existed

Andreas

Postponed

34. Module Archive:Feature request 1809 Write assignment for improving batchjob interface

JOLF

Hign priority

Done

Roadmap tasks

Total 52?

-

-

Total 8,5

-

Tasks from ...

35. QA: Assignment for enhanced QA tools

2

SVC

CSR

High priority

Postponed

36. WARC: Finalize Assigment for Harvester for support of WARC format

?

Postponed

37. Archive: Implement Assignment B.2.3 - Use segments in bitarchives

6

Postponed

38. Archive: Implement Assignment B.2.4 - Write BitPreservation scheduler

5

Postponed

39. Archive: Implement Assignment B.2.5 - Write BitPreservation webinterface

6

Postponed

-

..

..

Crawl-problems (Netarchive.dk) .

Total x

-

-

Total x

-

Focus on following crawl-problems

40. dinby.dk 2009-02-17

1

CSR

SVC

1

High priority

..

41. Kino.dk 2009-03-25

1

CSR

SVC

1

High priority

Awaiting review

42. Webmuseum.re-cph.com 2009-08-04

1

CSR

SVC

1

High priority

Awaiting review

43. Epn.dk 2009-08-30

1

CSR

SVC

1

High priority

..

44. Statstidende.dk

0.5

CSR

SVC

High priority

Awaiting review

45. seoghoer.dk

0.5

CSR

SVC

High priority

Awaiting review

45.b http://netarkivet.dk/netarkivet/index.php?title=Berlingske.dk

2

CSR

High priority

Awaiting review

46. Feature Request 1881 Quality assurance through a batchjob interface

0.5

JOLF

CSR

High priority

In progress

Wayback/Nutchwax tasks independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

47. Wayback: Implement Indexer

11

CSR

SVC

High Priotrity

Done

48. Wayback: Implement Index Aggregator

4

MKS

JOLF

High Priority

In progress

58.Wayback: Documentation of Indexer/Aggregator

2

CSR

JOLF

1

High priority

Postponed

59. Wayback deploy: If not possible with Jetty then test of Wayback deploy using Tomcat

2

JOLF

CSR

2

High priority

In progress

..

Converting old Web collections to Netarchive.dk. See proposal. These task will be independent of NetarchiveSuite code-freeze.

Total x

-

-

Total x

-

Tasks from ...

60. Old Web collection: Old KB Webarchive

SVC

JOLF

High priority

In progress

61. Old Web collection: Old Webarchive from Niels Brugger collected by HTTrack

HBK

SVC

High priority

In progress

62. Old Web collection: Prepare ingest of extracted data from Internet Archive into Netarkivet.dk

SVC

HBK

Wait for IA correction

63. Old Web collection: Ingest received data from Internet Archive into Netarkivet.dk

CLO

SVC

-.

Common tasks calculated as implementation tasks

Total x

-

-

Total x

-

Others

Total x

-

-

SubTotal 2

-

64. Upgrade: New KB-PROD-UDV

5

SVC

TLR

Postponed

65. Batch: Create/execute a batch test script specified by 1 or 2 researches

2

JOLF

TLR

Postponed

..

..

-

..

..

Prepare release test

Total x

-

-

SubTotal 12

-

66. Prepare release test

6

6

OK

Available man-days for implementation phase

Total x

-

-

Total x

-

Release test phase (task ...)

Release test

Total x

-

-

Total 12

-

67. Execute release test.

12

TLR

All

12

In progres

'

..

Release notes

Total x

-

-

Total 0,5

-

68. Write release note

0,5

SVC

Awaiting end of code freeze

Available man-days for release test phase

Total x

-

-

Total 10

-

Assignment phase for next iteration (task ...)

69. Component bug/feature fix/management

QA

..

70. Define goals for Iteration 44 task list

CHH

..

71. Presentation of goals and tasks for Iteration 43. Achieve a common understanding of the purpose of the iteration and each task on status meeting

SVC

..

72. Assignment of tasks, bugs and feature request

QA

..

73. Update release test procedure

TLR

..

Available man-days for assigment phase

Total x

-

-

Total 22

-

Timetable

Timetable iteration 43. Updated 4. May 2010

Start time

End time

Responsible

Baseline 4. May 2010. Start time

Baseline 4. May 2010'. End time

1. Implementation of decided tasks

4. May 2010

31. May 2010

4. May 2010

31. May 2010

2. Code freeze. Create the build for release test and notify when build is ready

9. June 2010

SVC

1. June 2010

3. Release test

9. June 2010

11. June 2010

TLR

1. June 2010

3. June 2010

4. Code unfreeze

14. June 2010

SVC

4. June 2010

5. Assignments, bug components and bug fixes

10. June 2010

11. June 2010

2. June 2010

3. June 2010

Iteration43TaskList (last edited 2010-08-16 10:24:49 by localhost)