Question: Create and extract consensus sequence after initial genome mapping using a custom genome
gravatar for manjunath.keremane
8 months ago by
manjunath.keremane0 wrote:

I have aligned fastq sequences (from an insect vector) to a small bacterial genome of 1.2 mb (as text file) using Bowtie2 in Galaxy. The aligned sequences are in about 15 contigs with short gaps. How do I extract the consensus sequences as fasta contigs from a BAM or a SAM file?

ADD COMMENTlink modified 8 months ago by Jennifer Hillman Jackson25k • written 8 months ago by manjunath.keremane0
gravatar for Jennifer Hillman Jackson
8 months ago by
United States
Jennifer Hillman Jackson25k wrote:


The mapped BAM dataset will only contain the original reads, not assembled results based on overlap/other factors. I don't think this is what you are asking about, but those reads can be output again using the tool NGS: Picard > SamToFastq extract reads and qualities from SAM/BAM dataset and convert to fastq. Some mapping tools also include an option to output optional fastq datasets containing mapped vs unmapped reads.

To assemble those mapped reads into consensus sequences, please see the Galaxy tutorials here:

Prior Q&A might also help, review/search the posts in the right side bar >> or search all Galaxy resources here:

I am not sure if your data is RNA or DNA, but either way, one of the Galaxy tutorials probably fits what you want to do. If you cannot find a match, please explain more about your data content/analysis goals and we can help more.

Thanks, Jen, Galaxy team

ADD COMMENTlink written 8 months ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 172 users visited in the last hour