Pipeline Architecture: Based on our analysis of execution time for both single end read and paired end reads , we merged the steps to build each stage of the pipeline.
Cloud Set Up: PVT is able to overlap the execution of multiple data files and each pipeline stage works on a different execution step at the same time, it brings upon a huge improvement compared to TopHat. The PVT-Cloud set up can be implemented in a middleware based cloud architecture as shown below. Refer for more details: PVT-Cloud: pragmatic cloud architecture in Article |
Figure: Pipeline buffer management for two consecutive stages of PVT ( ⇢ denotes the buffer mounted in the even time slots, → denotes the buffer mounted in the odd time slots). |
Figure: Middleware based cloud architecture |
Note: If you find PVT (Pipelined Version of TopHat) useful, please cite us at: PVT: An Efficient Computational Procedure to Speed up Next-generation Sequence Analysis |