GRAM Job Manager and Reporter Support for SGE (Sun Grid Engine)

  • Globus team has provided GRAM Job Manager and Reporter Scheduler Support for many popular schedulers (see download page). Unfortunately, the support for Sun Grid Engine is lacking.

  • Some time ago, I've made a perl module for GRD (now obsolete), which was used at London eScience Centre (LeSC).

  • Since then, the module has been improved, all known bugs fixed and packaged. Furthermore, a package for GRAM SGE Reporter is also available. You are welcome to download and use the packages.

  • I would like to thank Paulo Tiberio Bulhoes who was involved at one stage of the project for his suggestions.

  • The packages has been tested on all LeSC's Globus (v2.2.3 and v2.2.4) and SGE/SGEEE (v5.3) enabled resources. Of course, your feedback (positive or negative) is welcome (marko at imperial.ac.uk).

  • Please note that I take no responsibility for any loss and/or damage this might cause.

Download/Installation

  • The SGE job manager is available as: globus_gram_job_manager_setup_sge-0.9.tar.gz
  • The SGE reporter is available as: globus_gram_reporter_setup_sge-0.8.tar.gz
  • If you are happy with the default options (see below) for job manager, download it and do the same as with other Globus packages:

    $GPT_LOCATION/sbin/gpt-build globus_gram_job_manager_setup_sge-0.9.tar.gz
    $GPT_LOCATION/sbin/gpt-postinstall

  • If you would like to change some options, download the package for the job manager, untar it, and run configure (see configure --help for more info), make, make install. And finally, run $GPT_LOCATION/sbin/gpt-postinstall.

  • To install the reporter, run:

    $GPT_LOCATION/sbin/gpt-build globus_gram_reporter_setup_sge-0.8.tar.gz
    $GPT_LOCATION/sbin/gpt-postinstall

Features

  • The job manager is installed as jobmanager-sge by default. This option can be easily changed.
  • If available, Sun MPI is used by default. One can disable Sun MPI and use another implementation.
  • We have the following mapping:

    • If job type is single and count is one, run a single copy of a job
    • If job type is single and count is at least 2, submit a job array with corresponding count.
    • If job type is multiple, submit count copies of a job
    • If job is mpi, run mpirun

  • System admin can specify a default MPI Parallel Envirnonment (PE), which will be used for MPI/multiple jobs. Of course, only PEs integrated into SGE/SGEEE on your system are supported. By default, no PE is specified. Note that this value will be over-written by SGE_PE/GRD_PE value in a RSL file.
  • One can enable PE and queue validation. That is, available PEs and queues are found and added in a RVF validation file ($GLOBUS_LOCATION/share/globus_gram_job_manager/sge.rvf). PE and queue validation is disabled by default.
  • Monitoring your job by email is also supported, that is, one can specify email_address, emailonexecution, emailontermination, emailonabort and emailonsuspend as additional RSL attributes.
  • After installing the SGE job manager, your RVF ($GLOBUS_LOCATION/share/globus_gram_job_manager/sge.rvf) will look something like this.
  • A sample RSL file is also provided: sge.rsl.

The contents of this page are provided by Marko Krznaric, Department of Computing, Imperial College London, UK.

Changelog


Last update: 4th June 2003