Help To Identify Variants With Clinical/Phenotype Associations

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Help To Identify Variants With Clinical/Phenotype Associations

0

6.0 years ago by

Luis Santomé • 20

Luis Santomé • 20 wrote:

Hi all, I have a dataset with potential pathological variants and I'd like to combine them to a dataset with known clinical association variants to identify those responsible for the phenotype. I'll thank a lot any suggestion. -- *J. Luis Santomé Collazo*

• 730 views

ADD COMMENT • link •

modified 6.0 years ago by Jennifer Hillman Jackson ♦ 25k • written 6.0 years ago by Luis Santomé • 20

0

6.0 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hi Luis, There are a few options: The tool " Phenotype Association -> SIFT" will accept an input file of variant locations/alleles and retrieve annotations, including OMIM Disease associations. Alternatively, you could label your variants by rs identifiers (or perhaps you already have these), or just use genomic coordinates, to intersect with the GWAS Catalog dataset. The general path would be to obtain the most recent dbSNP and GWAS Catalog tracks from the UCSC Table Browser ("Get Data -> UCSC Main", set genome to be hg19, then under the group "Varation and Repeats", dbSNP 135 and GWAS are both listed as tables under the dbSNP track - get both, it will require two queries). You may be joining on common keys (such as rs numbers) or overlapping genomic coordinates, depending on the starting data format and how you choose to do the intersect. For tool help, the first protocol in the "Using Galaxy" paper & supplemental walks through how to extract data from the UCSC Table browser and join data by various methods. The protocol's goal is different from your goal, but the methods will be similar to what you will be doing. The second protocol has even more examples for importing and formatting datasets, if you want to manipulate datasets to customize/alter datatype. http://main.g2.bx.psu.edu/u/galaxyproject/p/using-galaxy-2012 I am assuming that you are using a human, hg19 dataset, but if you using another, SIFT will not be possible. Still, UCSC may have analogous tracks to select from, depending on the genome. Or you could try BioMart, or one of the other sources under "Get Data", if these have your data. You can also always directly import (upload or FTP) a reference dataset of known SNP Phenotypes from any source, mapped to your target genome, and use Galaxy's tools to perform the intersection and file manipulations. Hopefully this helps to get you started! Best, Jen Galaxy team -- Jennifer Jackson http://galaxyproject.org

ADD COMMENT • link written 6.0 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

VCF file samples not called
I'm trying to identify polymorphisms in three samples. When I get the VCF file there are many var...
Problem with RealignerTargetCreator in the CloudMap workflow
My lab has previously used the CloudMap pipeline to identify the location of mutations from in C....
Help: Determining The Exon Number Of Exonic Variants
Hi everyone, I have a list of genomic coodinates corresponding to exonic variants, and I'd like ...
Variant analysis help
Hello, I am trying to identify variants using data from a BAM file generated from an RNA-Seq exp...
Workflow Assistance
After viewing tutorials and reading the information associated with various tools, I ask that you...
simple qu: amalgamate and/or transpose mutational and clinical data
Hi Probably a beginners question for most of you so apologies! I have excel spreadsheet of 400+...
Variant analysis using Galaxy - tutorials, example workflows
I have my FASTQ files, and I am trying to align them to the mouse reference genome, and perform v...
Tools Won'T Go Away Even After Deleting It From Tool_Conf.Xml
Dear Galaxy users, Has anyone experienced this problem before? In my local instance of Galaxy o...
True variant not called by Varscan
Hi, I'm using the Varscan tool to call SNPs and INDELs. There are some TRUE variants that are no...
Problem with cloudmap workflow
Hi I am using cloudmap workflow on Galaxy main sever to identify the mutation of mutant C. elega...
Reference genome absent
Dear Biostar, For mapping our paired end reads against horse, we are unable to find a reference ...
Reference Genome In Lastz
I'm trying to map a series of short sequencing reads from a clinical isolate of Klebsiella pneumo...
How Can I Extract Sequence Information Fromm Cuffdiff Files?
Hi. I got cuffdiff files with gene differential expression on it. I don't have the annotation, t...
empty QUAL filed in Naive Variant caller vcf output
Hi, The QUAL field in vcf output of Naive Variant Caller is empty for all the variants. Any reas...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 175 users visited in the last hour