Question: Snp Finding
0
gravatar for Xiefan Fang
6.1 years ago by
Xiefan Fang30
United States
Xiefan Fang30 wrote:
Dear galaxy users, We have done deep sequencing on some known genomic loci using Hiseq2000. I have already mapped the reads to the reference sequences by using Galaxy. In the next step, I want to find SNPs and calculate the SNP percentage within the reads. There are 500,000 to 1,000,000 reads per biological sample. Can I do it with galaxy? If not, is there other programs available in windows? Considering that I am not very familiar with programming. Thanks, Xiefan University of Florida
galaxy • 1.2k views
ADD COMMENTlink modified 5.1 years ago by Jennifer Hillman Jackson25k • written 6.1 years ago by Xiefan Fang30
0
gravatar for Xiefan Fang
6.1 years ago by
Xiefan Fang30
United States
Xiefan Fang30 wrote:
Dear galaxy users, We have done deep sequencing on some known genomic loci using Hiseq2000. I have already mapped the reads to the reference sequences by using Galaxy. In the next step, I want to find SNPs and calculate the SNP percentage within the reads. There are 500,000 to 1,000,000 reads per biological sample. Can I do it with galaxy? If not, is there other programs available in windows? Considering that I am not very familiar with programming. Thanks, Xiefan University of Florida
ADD COMMENTlink written 6.1 years ago by Xiefan Fang30
0
gravatar for Jennifer Hillman Jackson
5.1 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello Xiefan, This page has a simple breakdown of how to call variants within Galaxy. This is brand new, so full annotation/video is pending, but it should still be straightforward to see how the data is prepped and which tools are used. https://usegalaxy.org/u/galaxyproject/p/galaxy-101-ngs-variant Another set of tutorial videos cover the tools, which you can use on the public site or on a scaled up cloud server as needed: https://vimeo.com/channels/galaxytoolshed Hopefully this help you to get started. Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD COMMENTlink written 5.1 years ago by Jennifer Hillman Jackson25k
Hi all, First of all, thank you again for all your efforts to develop a project like Galaxy, that makes leading-edge bioinformatic tools available for non-bioinformaticians and wet-lab biologists like me ! I am using GATK Unified Geneotyper through the Galaxy main server to analyze variations from whole-genome re-sequencing data. I have read in the GATK documentation that there is a tool called "CallableLoci", that gives a .bed file of the genome indicating specifically which base where callable or not by Unified Genotyper (UG), using different criteria such as read depth, base quality, mapping quality. The log & metrics files generated by UG in Galaxy give the general statistics of callable loci, but there is no such a file giving a detailed information of the eligibility of each base. Right now I am using the tool "depth of coverage on BAM file" to get an idea of this information, but it's only partial since it doesn't take into account all the parameters used by UG to consider a locus callable (notably base quality and mapping quality). Do you think it would be possible to implement the "CallableLoci" tool in Galaxy? Would someone propose me an alternative to get this precious information on which areas of the genome are considered callable ? Thanks for your help/advice, Fabrice
ADD REPLYlink written 5.0 years ago by Fabrice BESNARD130
Hi all, Would someone know how to get the information on which areas of the genome are considered callable after a call with Unified genotyper from GATK (as a .bed or pileup file)? Thanks for your help/advice, Fabrice -- Fabrice Besnard Institute of Biology of the Ecole Normale Supérieure (IBENS) 46 rue d'Ulm, 75230 Paris cedex 05, France 8th floor. Office: Room 802. Lab: Room 817. mail: fbesnard@biologie.ens.fr Tel: +33-1-44-32-39-31
ADD REPLYlink written 5.0 years ago by Fabrice BESNARD130
Hi Fabrice, I know I already replied to your other post about this tool and the status/reasons why it is not wrapped in Galaxy right now, but I was going to suggest that you ask on the GATK forum for help about using the "CallableLoci" tool on the command-line since you asked again. Then I checked for prior Q/A and I saw that you were a step ahead and already found advice on 11/8: http://gatkforums.broadinstitute.org/discussion/3448/unified- genotyper-with-galaxy-callable-base-and-depth-of-coverage-after-bqsr This really is the best advice, including the coverage part of the question there. As long as you are an acedemic/non-profit user, there should be no licensing issues using the tool directly. I didn't see anything posted at seqanswers, and that can often be another source of information, but I would have to agree with Geraldine that going command-line with the "CallableLoci" tool is probably your best option since this is the exact data that you want, and I am not aware of any web-based implementations. But, this is still open for others to comment on! The web is a big place & all input is welcome. Good luck with your project! Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD REPLYlink written 5.0 years ago by Jennifer Hillman Jackson25k
Hi Jen, Thanks a lot for spending time again on my issue ! I think I will go for command-line solution ! Thanks, Fabrice Le 03/12/2013 00:43, Jennifer Jackson a écrit : -- Fabrice Besnard Institute of Biology of the Ecole Normale Supérieure (IBENS) 46 rue d'Ulm, 75230 Paris cedex 05, France 8th floor. Office: Room 802. Lab: Room 817. mail: fbesnard@biologie.ens.fr Tel: +33-1-44-32-39-31
ADD REPLYlink written 5.0 years ago by Fabrice BESNARD130
Hi Fabrice, This tool is not implemented in Galaxy as it was released after the current GATK (beta) tool group was created. There is some development on the last GATK-lite pipeline, but as you know this does not include the latest version of all tools. The public Main Galaxy server usegalaxy.org) only contains tools that are licensed for general use, no restrictions. One option is to use the tool command-line, then upload the results into Galaxy for use with our other tools to compliment/incorporate it into your larger analysis. There is much help offered on the GATK forum, and this is just one tool. So, even if you are experienced with bioinformatics on the command-line, this might be possible. You seem to be very knowledgeable about these tools and how they function! My guess is that you'll be able to pick this up! Sorry that we could not help more directly right now. But please follow this Trello ticket to watch upcoming GATK tool wrapper development by the Galaxy team. If this specific tool does turn out to be added in the lite implementation, this is a great way to know when to look for it: https://trello.com/c/IPkT2spd Best! Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD REPLYlink written 5.0 years ago by Jennifer Hillman Jackson25k
0
gravatar for Mark Lindsay
5.1 years ago by
Mark Lindsay70
United Kingdom
Mark Lindsay70 wrote:
Dear Galaxy Users I wondered if anybody is having the same problem. I am trying to run CuffDiff using the latest version of GenCode18 and keep getting the following error message. Is there a problem with the network or is it me? Best wishes Mark
ADD COMMENTlink written 5.1 years ago by Mark Lindsay70
Hi Mark, This error is not tool related. Could you click the “bug” icon to send us an error report? —nate
ADD REPLYlink written 5.1 years ago by Nate Coraor3.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 173 users visited in the last hour