The tool geecee will take fasta sequence as input. I am not sure if
just have the bed coordinates of the regions of interest or already
the coordinates of genes contained within these regions yet.
If you need the genes, then one choice is to extract a track from the
UCSC table browser to obtain transcripts in bed12 format with the tool
"Get Data: UCSC Main". Tracks in the group "Genes and Gene
are most likely what you will want. You can read about the choices at
UCSC, but common selections include UCSC Genes, Refseq Genes, etc. You
can get them all, them use tools in the group "Operate on Genomic
Intervals" to limit the group to just those that fit within the
For a list of associated gene identifiers, related tables to most gene
tracks at UCSC contain that sort of information. Do a separate extract
operation to obtain a file that contains the gene and transcript
identifiers, then join the data together with the transcript you
after performing the above filtering, to link in the gene name.
Once you have the transcript coordinates, fasta sequence can be
in two ways. If you want to do the GC counts off of the mRNA, use the
transcript identifiers in the UCSC Table browser again, choose
output (not bed), and this time extract "mRNA" when prompted (not
genomic). If genomic sequence is fine, the tool "Fetch Sequences ->
Extract Genomic DNA" can be used.
Then use the fasta sequeces as input to the "geecee" tool - the
you were having were most likely with giving the tool the wrong type
This is a lot of steps, and how you decided to organzize the data
running geecee will affect how the summary stats are calculated.
any stretch of nucleotide fasta sequence can be used for input (I do
know of an upper length bound, but there probably is one, so just
for that - if an error comes up, work with smaller regions). You could
also just convert the fasta sequence to tabular, and add up the total
bases, count Gs, count Cs, etc. then perform a calculation on your
See also "Regional Variation -> Feature coverage", "Graph/Display
and "BEDTools*"*, each may be helpful, for different reasons.
There are several tutorials that do many of these same basic
as part of the analysis or tool demos. Reviewing them will help you to
know how to structure inputs, use particular tools, etc, if you would
like the guidance. Under "Shared Pages": pls see Galaxy 101 and Using
Galaxy 2012 for the introduction tutorials.
Galaxy Support and Training