Question: Input: gene name (UCSC-style) Desired output: promoter sequences in FASTA format
19 months ago by
virlana.shchuka0 wrote:

Hi everyone!

I have a list of 450 gene names for which I need to find the promoter sequences (I will later use a program to find the TF motifs in said promoter sequences). I have very little experience with bioinformatics and programming, and was wondering what is/are the most interface-friendly programs I can use to get my desired output?


fasta ucsc promoter extract galaxy • 496 views
19 months ago by
United States
Jennifer Hillman Jackson24k wrote:


The UCSC's Genome Table Browser will extract this information in one step if the genome is included there ( The output can be sent to Galaxy (

The tool "Get Data > UCSC Main" can be used. An example of an appropriate track would be RefSeq Genes, but you can explore others. Enter the gene names as a filter, select output as fasta, submit to filter for promoter regions. More help here or you can contact that team for more detailed usage assistance.

There are other methods, but these would involve knowing the genome coordinates of the genes and the reference genome already indexed Galaxy or loaded as a custom genome. Let us know if you need help with that by sharing more about the target genome and format of the gene names.

Thanks, Jen, Galaxy team

