Question: Using the Extract Genomic DNA tool in Galaxy
gravatar for samiksha.kaul23
2.1 years ago by
samiksha.kaul230 wrote:

Hi everyone!

So I am trying to use the the NGS: SAMtools function in galaxy to generate consensus sequences from BAM files that I have for genomic data. I used the generate pileup from BAM dataset tool to generate a pileup and then based on this post I was trying to generate a consensus sequence in FASTA format. I did the Pileup to Interval step but when I went to do the Extract Genomic DNA step the interval dataset would not show up as an option I could extract Genomic DNA from. It would be really wonderful if someone could let me know what I am doing wrong. I am quite new to this so am not very sure of how each tool works and how the issues can be fixed.

Also does anyone know of a simple way to obtain a consensus sequence from a BAM file in FASTA or FASTQ format? I know most people use SAMtools but I am not well versed with Unix or programming at all so if there is a simpler way that would be very helpful. Please let me know. Thank you!

fasta bed extract-dna galaxy bam • 930 views
ADD COMMENTlink modified 2.1 years ago by Jennifer Hillman Jackson25k • written 2.1 years ago by samiksha.kaul230
gravatar for Jennifer Hillman Jackson
2.1 years ago by
United States
Jennifer Hillman Jackson25k wrote:


For the Extract Genomic DNA issue:

The problem could be the metadata setting on the input interval file. The "datatype" should be bed or interval, but still be in BED formatting (column) orders. Add columns as needed then use Cut to rearrange. Also, the "database" will be assigned**. Note that this will not generate a consensus (assembled) sequence from your reads - the tool is retrieving the genomic sequence that overlaps with the coordinates in the input.

**If you are working with a custom reference genome, you'll need to promote it to a custom build then assign that as the database. See the third bullet under Best Practises here:

For the BAM/SAM to Fastq question:

See the tool SAM to Fastq. This will output all reads in the file, as is, no assembly.

To assemble consensus reads and output as fasta:

Other tools are needed. Trinity is one option. Others can be found in the Tool Shed

Hopefully that helps, Jen, Galaxy team

ADD COMMENTlink written 2.1 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 175 users visited in the last hour