Question: Input: gene name (UCSC-style) Desired output: promoter sequences in FASTA format
0
gravatar for virlana.shchuka
2.2 years ago by
virlana.shchuka0 wrote:

Hi everyone!

I have a list of 450 gene names for which I need to find the promoter sequences (I will later use a program to find the TF motifs in said promoter sequences). I have very little experience with bioinformatics and programming, and was wondering what is/are the most interface-friendly programs I can use to get my desired output?

Thanks!

fasta ucsc promoter extract galaxy • 739 views
ADD COMMENTlink modified 2.2 years ago by Jennifer Hillman Jackson25k • written 2.2 years ago by virlana.shchuka0
0
gravatar for Jennifer Hillman Jackson
2.2 years ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

The UCSC's Genome Table Browser will extract this information in one step if the genome is included there (http://genome.ucsc.edu). The output can be sent to Galaxy (http://usegalaxy.org).

The tool "Get Data > UCSC Main" can be used. An example of an appropriate track would be RefSeq Genes, but you can explore others. Enter the gene names as a filter, select output as fasta, submit to filter for promoter regions. More help here or you can contact that team for more detailed usage assistance.

There are other methods, but these would involve knowing the genome coordinates of the genes and the reference genome already indexed Galaxy or loaded as a custom genome. Let us know if you need help with that by sharing more about the target genome and format of the gene names.

Thanks, Jen, Galaxy team

ADD COMMENTlink written 2.2 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 175 users visited in the last hour