Question: Exception( "Failed to extract job." )
1
northcoted • 30 wrote:
HI,
I get the following error when attempting to create a workflow:
*'Failed to find matching implicit job - job is %s, jobs are %s, assoc_name is %s.'*
Followed by:
Exception( "Failed to extract job." )
The workflow consists of 12 input datasets which were then created as a "dataset list" as input into the subsequent steps.
Cheers.
Hello, Where did this occur? http://usegalaxy.org (Main public Galaxy server) or your own local or cloud server?
Meanwhile, I am testing this out on main to see if I can independently reproduce there.
If your history is on Main (or you can recreate the problem there) and would like to share it, that will help, as the dataset collection list may not be the core issue. Rather, there could be a specific tool triggering the problem. Send a shared history link to galaxy-bugs@lists.galaxyproject.org along with a link to this post to help with that broader testing.
Thanks, Jen, Galaxy team
I seem to have found the following:
In extract.py:
The variables assoc_name and query_assoc_name
By changing the code:
template = "Failed to find matching implicit job - job is %s, jobs are %s, assoc_name is %s." message = template % ( job.id, jobs, assoc.name )
to the following:
template = "Failed to find matching implicit job - job is %s, jobs are %s, assoc_name is %s, query_assoc_name is %s message = template % ( job.id, jobs, assoc.name, query_assoc_name )
This returns assoc_name with the value "txt_file" whereas query_assoc_name holds the value "html_file" which breaks the comparison.
In wrapper file FastQC.xml
So it seems that the 2 output file types may be causing this failure in the if query_assoc_name == assoc_name: test.
Thank you for providing the details. We are still looking into this in more detail to determine if this is an unknown bug (there were cluster issues over the last several days hindering effective testing). I will write back with more details (and possibly an issue ticket) within the next few days. Thanks, Jen, Galaxy team
As an added note.
This same failure occurs when attempting to extract any workflow built under "analyze data" using "Dataset Collections" as input.
Thanks for posting this, I just ran into the exact same thing (also with FastQC) and it looks like the cause is the same.
Update: Our local user who ran into this was able to resolve it by noting that in the selection window some of the steps appeared multiple times (I'm guessing this is due to the multiple output entries in the history). He then ensured that only one such instance of these was selected and then the workflow creation worked.
Excellent. Many thanks. I'll give it a try.
Hi.
It occurred on a local Galaxy server which is unfortunately firewalled.
Basically I have 12 .bam datasets from which I have created a dataset list as input to fastqc
( FastQC Read Quality reports (Galaxy Version 0.63) ( devteam 2d094334f61e) on Galaxy Server v16.01-516-g45bd65f.
I can extract the workflow at the point after the dataset list has been created, however after completion of the fastqc is when the "export workflow" error occurs.
Also, in the "export Workflow" gui, I have the following message under the "Dataset Collection Creation" dialogue box.
"Dataset collection created in a way not compatible with workflows"
Unfortunately I am unable to share as it is patient data that I am using.
Regards
Derek