UCSC gene names

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: UCSC gene names

0

15 months ago by

jrberminghamjr • 10

jrberminghamjr • 10 wrote:

I have uploaded a database of genes from the UCSC genome browser (Hg38) to Galaxy, to identify the genes with the largest number of polymorphisms in paired-end sequences from a trio (for a Coursera project). I have two questions: 1) the name column in the uploaded dataset does not have standard gene names; how do I convert to standard gene names? 2) I have set the tools in my workflow to use Hg38 consensus, but it isn't available in the UCSC browser. Are the coordinates for Hg38 the same as for Hg38 consensus, or should I change the settings in my tools to Hg38? Thank you.

coursera gene name ucsc genome browser • 511 views

ADD COMMENT • link •

modified 15 months ago by Jennifer Hillman Jackson ♦ 25k • written 15 months ago by jrberminghamjr • 10

0

15 months ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

For 1) - Which track and format did you extract from UCSC? Often tables for a track will have gene names in related tables. Find where these are by reviewing table schemas and output using "selected fields from primary and related tables". Submit and a list of related tables will be then available to browse and select content from.

Alternatively, you might want to do the analysis first (using the original annotation data) then at the end link in associated gene names. Input a tabular file that contains the identifiers in your original annotation plus the new identifier you want to add in with and a tool like Text Manipulation > Join two files.

For 2) - The coordinates for the chromosomes in common are exactly the same. This means you can use hg38 canonical reference genome for analysis and view in the hg38 full UCSC browser. The output datasets from Galaxy analysis will have the primary base genome hg38 already assigned as the database metadata attribute. The assigned database for a dataset is the value key used to link into UCSC's content/genome browser.

Thanks, Jen, Galaxy team

ADD COMMENT • link written 15 months ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Critical Feedback
This student was more adventurous. I think he actually could do more of what he tried with more e...
How Do I Import Protein Coding Genes Location From Ucsc?
I have to import into Galaxy annotations of coding genes from the UCSC genome browser and then us...
Replicate Common Gene Names
This is somewhat Galaxy related, but more of a general question. I did all of my mapping and ini...
How to add and use a custom reference genome
How to import my genome (maize B73v3) into galaxy? I have B73v3 in custom-run UCSC genome browse...
Ucsc Tools
Hi, is there a way to create a jobin my history from UCSC genome browser that contains ONLY genes...
gene Symbol from cuffdiff output
Hi. I received output from cuffdiff which included gene_id, gene and locus for each differential...
Cufflinks tool locally cached 'hg38' malformed; error reads: 'An invalid option was selected for index, u'hg38', please verify'
Greetings, I am trying to use the Cufflinks tool in the main/public instance of Galaxy to gen...
FeatureCount output does not match view in IGV browser
I have a BAM file that I view locally in IGV browser. Specifically, I am focused on hsa-mir-122. ...
Finding gene names from TopHat results.
Hello, I am trying a build a workflow for RNA-Seq analysis and I want to verify my methods via co...
Main galaxy instance not working
I am a user of the main Galaxy server, and, this morning, I cannot get any tool working, when I c...
How to upload Mouse reference genome mm10, in Fasta format to My Galaxy History
I tried to use an imported "tuxedo protocol" RNA-seq pipeline from public workflows. I found mous...
There is a link to view human hg19 but not hg38 bed files using UCSC. Why?
I am working my way through the Galaxy 101 tutorial. When I use hg19 data I have the option of vi...
Chip-Seq And Gene Ontology With Galaxy-Ucsc
Hi, I am currently doing some ChIP-Seq experiments, to analyze the data I get from the PeakFinde...
Galaxy 101 Tutorial UCSC Browser version
Hi there, Not sure which version of UCSC Genome Browser was used for the Galaxy 101 Tutorial. I ...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 172 users visited in the last hour