Nextflow is workflow tool frequently used in bioinformatics that can be used to run complex, multi-stage pipelines. For more details on how to use nextflow see the online documentation at https://nextflow.io.
Nextflow is installed as a module in
/apps. To load the module:
Running nextflow on gadi
In nextflow, pipelines are defined as a series of tasks along with a set of a inputs and outputs for each task. Typically, each task is submitted as a separate job to the queue. This requires a long running nextflow process that can manage these tasks. The best way to run this is in its own separate batch queue job:
The version of nextflow installed on gadi has been slightly modified to make it easier to specify resource options for jobs submitted to the queueing system. Within the
nextflow.config file for your workflow:
- Use the
- Extra flags have been added to specify
gpusas an alternative to the
diskflag can be used to reserve space in /jobfs.
As an example, the process section of your config file might contain:
which is equivalent to: