Hey everyone,
Our group plans to run a variant calling pipeline on Amazon's EC2 server and in order to utilize the spot-pricing we have to handle for job termination at various steps. Therefore, I plan to write a program that will decompose a galaxy workflow (.ga) and the associated tools (.xml) into discrete steps that can be represented as individual bash scripts or all piped through something like subprocess.
Before I start I figure I would ask the community if someone has already worked on something like this so I can use it as a reference or the tool itself.
Cheers,
John
Would this allow me to massively parallelize Galaxy workflows on EC2? How are you distributing the necessary tools?