Assignment group B.4 - Improve batch architecture

Printer friendly version

References

Reference documents

Dependencies

Terminology

Nothing yet.

Bugs

(maybe) addressed by these assignments (important ones in bold)

Feature requests

(maybe) addressed by these assignments (import ones in bold)

Assignment B.4.1 - use security manager for batch jobs

Read relevant parts of the Java security tutorial trail. Brief rundown of JavaSecurityCommands.

Set up our bitarchives to run with a security policy that grants AllPermission to code signed by the NetarchiveSuite, and only limited permissions to other code. The limited permissions should be just enough for third party class files to write results of batchjobs to the result files.

Update our build scripts to sign the jar files.

Update our deploy applications to start the bitarchives with a security policy.

Note: This may not be the correct or ideal solution, a better solution may present itself while reading the Java tutorial trail, which I have not done while writing this assignment.

Note: I am unsure if this will not still allow batch jobs to interfere with each other.

Estimated time

Estimator

3 md

KFC

Assignment B.4.2 - allow third-party batch jobs to be submitted

Basically, we need a batch job that takes a serialised class file, and loads it at the bitarchives and then runs it.

Such a class would look like this (but should of course be tested and documented properly, and have error handling):

public class ClassBatchJob extends FileBatchJob {
    private final byte[] fileBatchJobClass;
    private transient FileBatchJob job;
    public ClassBatchJob(byte [] fileBatchJobClass) {
        this.fileBatchJobClass = fileBatchJobClass;
    }
    public void initialize(OutputStream os) {
        Class c = new ClassLoader() {
                          Class initialize() {
                              return defineClass(null, fileBatchJobClass,
                                                 0, fileBatchJobClass.length);
            }
        }.initialize();
        try {
            job = (FileBatchJob) c.newInstance();
        } catch (InstantiationException e) {
            throw new IOFailure("Unable to initialise class", e);
        } catch (IllegalAccessException e) {
            throw new IOFailure("Illegal access for class", e);
        }
        job.initialize(os);
    }
    public boolean processFile(File file, OutputStream os) {
        return job.processFile(file, os);
    }
    public void finish(OutputStream os) {
        job.finish(os);
    }
}

Once that job has been written, it is merely a question of making a webpage where you can upload a class file, and submit the job and print the results.

The webpage should be asynchronous, so a web timeout does not prevent you from seeing the result of the batch job.

Don't forget to update the user manual, the developer manual, and the installation manual (about starting with security manager) afterwards.

Estimated time

Estimator

3 md

KFC

Assignment B.4.3 - better infrastructure

Update FileBatchJob with a merge method, which given two output files from this batch job, defines how they should be merged to one. The default implementation should simply concatenate them.

Update BitarchiveMonitor to call this method when merging.

Update developer documentation with information on the extended batch job definition.

Estimated time

Estimator

2 md

KFC

Assignment B.4.4 - Yet more better infrastructure

Collect exceptions during the execution of a FileBatchJob, and send these back as part of the BatchStatus (Bug 1193). At the same time, we should improve the information logged about the executed BatchJobs, as from now, we may not know the identity of these batchjobs: Bug #1279 Missing toString method on FileBatchJob classes.

Estimated time

Estimator

3.5 md

SVC

AssignmentGroupB4 (last edited 2010-08-16 10:24:55 by localhost)