Question: Filter Bowtie 2 alignement
2.4 years ago
Mic wrote:

Hi, I read this post ( which describes on how to filter Bowtie 2 alignment with the following command:

awk '$5 == 3 || $5 == 8 || $5 == 23 || $5 == 24 || $5 == 40 || $5 == 42' file.sam

The idea behind the command is to get "unireads" with decent alignment scores.

How is it possible to convert the above awk command in Galaxy?


alignment bowtie samtools • 876 views
2.4 years ago by Devon Ryan
2.4 years ago
Devon Ryan
Devon Ryan wrote:

If you convert your BAM file to SAM ("BAM to SAM" tool) then you can use the "text reformatting (with awk)" tool essentially given that script (sans file.sam).

Have said that, I would strongly encourage you to not do that, since what you're trying to do is rarely if ever useful (you might notice my name in that blog post, so I know a bit about the issue). A generally more useful process is to simply filter by MAPQ, which can be more easily done on the original BAM file.

2.4 years ago by Devon Ryan

Would you recommend to use MAPQ = 1?

2.4 years ago by Mic

More along the lines of 5 or 10, depending on the application.

2.4 years ago by Devon Ryan

What would you use for SNP discovery and Chip-Seq or do you have a list of MAPQ values are best for a particular applications?

2.4 years ago by Mic

If your variant caller doesn't handle MAPQ scores already then try 5 (you're going to filter variants by quality anyway, so the random low MAPQ alignment here and there won't matter much). For ChIPseq, just mark duplicates and don't bother filtering.

2.4 years ago by Devon Ryan

Thank you. Did you mean with "text reformatting (with awk)" this tools from the toolshed 'text_processing'?

2.4 years ago by Mic

Sounds right, yes.

2.4 years ago by Devon Ryan
