PGP: parallel prokaryotic proteogenomics pipeline for MPI clusters, high-throughput batch clusters and multicore workstations

    loading  Checking for direct PDF access through Ovid



We present the first public release of our proteogenomic annotation pipeline. We have previously used our original unreleased implementation to improve the annotation of 46 diverse prokaryotic genomes by discovering novel genes, post-translational modifications and correcting the erroneous annotations by analyzing proteomic mass-spectrometry data.


This public version has been redesigned to run in a wide range of parallel Linux computing environments and provided with the automated configuration, build and testing facilities for easy deployment and portability.

Availability and implementation:

Source code is freely available from under GPL license. It is implemented in Python and C++. It bundles the Makeflow engine to execute the workflows.


Related Topics

    loading  Loading Related Articles