Galaxy-Fetch Sequences-How to extract genomic DNA from Fasta file?

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Galaxy-Fetch Sequences-How to extract genomic DNA from Fasta file?

0

3.6 years ago by

mohamed_ismail • 10

United States

mohamed_ismail • 10 wrote:

I am trying to extract Virus genomic DNA sequence using Fetch sequences tools. The source of genomic data is from my history (Fasta file with the name: >DQ900900.1).

Unlike human genomic dna, virus genome cannot be labelled with chromosome no. Therefore, I labelled the first column in the interval file as >DQ900900.1. On analysis, I end up with warning message as shown below:

Unable to fetch the sequence from '35123' to '100' for chrom '>DQ900900.1'.

I assume something wrong with my labels in the first column of the interval file. Please advice.

Thanks

galaxy • 1.3k views

ADD COMMENT • link •

modified 3.6 years ago by Jennifer Hillman Jackson ♦ 25k • written 3.6 years ago by mohamed_ismail • 10

0

3.6 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

Remove the ">" from the identifiers and this will likely solve part of the issue. Just make certain that the identifiers in the reference fasta dataset and the interval dataset are identical otherwise.

The other item to check is that the start coordinate is smaller than the end coordinate. And that the start is "0-based", the same as used in BED format. If the sequence to be extracted is on the complementary strand, designate that by including a strand field.

More about common bioinformatics file formats is in the Galaxy wiki (and also many other places across the internet):
http://wiki.galaxyproject.org/Learn/Datatypes

Best, Jen, Galaxy team

ADD COMMENT • link written 3.6 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Extract Genomic Dna-Strand Information Is Not Recognized
Hello, I am trying to extract sequences from a FASTA file containing genomic information. The co...
Extract Genomic DNA error with custom genome
Hi, I'm new to Galaxy, and I and trying to use the "extract genomic dna" tool. I would like to f...
Galaxy interval format - what should be provided as CHROM#?
Dear colleagues, I have a .txt file with >100 lines of the following format (1st column - seq...
Extract Genomic DNA - no recognized datasets
Hi, I am new to galaxy. I wish to use the Extract Genomic DNA tool, but under "Fetch sequences f...
Confusion From "Extract Genomic Dna (Version 2.2.3) "
Hi Galaxy team, I recently met two problems when I used " Fetch sequences> Extract Genomic ...
Problem Report!!
Dear Galaxy member, I'm sending you this e-mail because of a problem I have in fetching sequence...
Extracting Sequences with coordinates directly from the human genome(hg19), using biopython or Galaxy
Hi! I am not able to enter the build number in 'Fetch Sequences for intervals in' section in the ...
Extract Genomic DNA fasta file names?
Hello, I ran the Extract Genomic DNA feature with my gtf file and reference genome fasta file. ...
Setting Dbkey In A Workflow
I'm working on making my first workflow in Galaxy, using a local server. A high level overview of...
Extract Genomic DNA using coordinates from a gff file
Hi, I have been extracting genomic sequences, using "Extract genomic DNA" from "Fetch Alignment...
Axt And Nib Files For Alignseq.Loc
I was wondering how to get a hold of the axt and/or nib files required by alignseq.loc. It's not ...
Fetch alignments using stitch Gene Blocks but all the gene-sequences are in '--------'
I was using the GALAXY project (https://usegalaxy.org/) online server, the section is Fetch align...
Extract Sequences From [Gtf File] + [Genome Fasta File]
Hi Galaxy people, I have transcripts predicted by Cufflinks that are in a gtf file. How can I ex...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 175 users visited in the last hour