3.6 years ago by
QFAB, Brisbane
Hopefully someone else might come up with a more elegant solution, but a quick hack to do this might simply be to filter the VCF files for "0/1" at the beginning of column 10. In theory, you should be able to do this using the Filter tool with a filter condition of:
c10.split(":")[0]=="0/1"
This splits column 10 on the : character, and then only returns lines with 0/1 as the first string in that separation (first being index position zero, hence the [0])
But in testing, I've found some VCF files that this doesn't work for (possibly a bug in the filter or split function). If you get that, you could try the even cruder approach, using the Select tool to search for
\t0/1
This just looks for the string 0/1 immediately after a tab character (i.e. at the beginning of a column field), and seems to work reasonably well
Hello Mark,
Thankyou very much. I am trying this now.
Viva