Question: Normalization An Dplotting Of Rpkm/Fpkm After Cufflink
0
gravatar for vasu punj
7.6 years ago by
vasu punj360
vasu punj360 wrote:
Hi,   I want to include the following discussion in my message regarding use Bam files of Tophat to visualize reads either in IGV or Galaxy or other tools.  I want to find out if I can plot RPKM/FPKM normalized values after running differential analysis in Cufflinks. On the seqanswer (http://seqanswers.com/forums/showthread.php?t=9947 ) there is a preliminary discussion about this how we can plot RPKM values to show the  differential abundance in samples. Do we have any of such functionality in Galaxy? alternatively, I would like to have suggestions on the topic especially to normalize per million reads.   Thanks.   Subject: Re: [galaxy-user] get wig file after tophat To: "David Matthews" <d.a.matthews@bristol.ac.uk> Cc: "Baxter, Adam" <adam.baxter@uncc.edu>, galaxy-user@bx.psu.edu Date: Tuesday, February 22, 2011, 11:30 AM All, For visualization, Galaxy now provides a built-in browser called Trackster. Trackster can visualize BAM files--as well as the junction file produced by Tophat and the GFF files produced by Cufflinks--and also provides coverage/summary information for all datatypes (see attached image).  You can start using Trackster by going to the Visualization --> New Track Browser to set up a browser and add tracks. Alternatively, you can click on the Trackster icon in a dataset to visualize the dataset in a new or existing visualization (see attached image). Thanks, J.   HI, The option you need in IGV tools is "count". You set a window size and this gives you a tdf file from your sorted bam (or sam) file which is nice and quick to view on IGV. Best Wishes, David. __________________________________ Dr David A. Matthews Senior Lecturer in Virology Room E49 Department of Cellular and Molecular Medicine, School of Medical Sciences University Walk, University of Bristol Bristol. BS8 1TD U.K. Tel. +44 117 3312058 Fax. +44 117 3312091 D.A.Matthews@bristol.ac.uk Dear David: thank you very much for helping me! I have download the IGV and I do find the IGVtools, however, I am not sure which tool I should use for generate a tdf file, the tile function will generate a tdf file, but the input file format does not include bam or sam file, instead it need wig file. But I have no wig file to put in. So I am wondering whether you need to use other tool first. I really appreciate your help! Thank you very much! Best Ying Quoting David Matthews <d.a.matthews@bristol.ac.uk>: Hi, You can get an equivalent visualisation from the IGV viewer by the Broad Institute - its under IGV tools and generates a tdf file from bam or sam files. This also gives a quick and easy way of looking at depth at any particular site and is very accessible. Cheers David Hi all, Ann is correct - Tophat does not produce .wig files when run anymore. However, it's fairly easy to use Galaxy to make a wiggle-like coverage file from a BAM file: (a) run the pileup tool on your BAM to create a pileup file; (b) cut columns 1 and 4 to get your coverage file. A final note: it's often difficult to visualize coverage files because they're so large. You might be better off visualizing the BAM file and using the coverage file for statistics. Best, J. Hello, I think I know the answer (sort of) to this question. This may be because newer versions of tophat stopped running the "wiggles" program, which is still part of the tophat distribution and is the program that makes the "coverage.wig" file. A later version of tophat might bring this back, however - there's a note to this effect in the tophat python code. So if you can run wiggles, you can make the "coverage.wig" file on your own. A student here at UNC Charlotte (Adam Baxter) made a few changes to the "wiggles" source code that would allow you to use it with samtools to make a "coverage.wig" file from the "accepted_hits.bam" file that TopHat creates. If you (or anyone else) would like a copy, please email Adam, who is cc'ed on this email. We would be happy to help add it to Galaxy if this would be of interest to you or other Galaxy users. If there is any way we can be of assistance, please let us know! Very best wishes, Ann Loraine Hi: I am using tophat in galaxy to analyze my paired-end RNA-seq data and find out that after the tophat analysis, we can not get the wig file from it anymore which is used to be able to. Do you have any idea of how to still be able to get the wig file after tophat analysis? Thanks a lot! Best Ying Zhang, M.D., Ph.D. Postdoctoral Associate Department of Genetics, Yale University School of Medicine 300 Cedar Street,S320 New Haven, CT 06519 Tel: (203)737-2616 Fax: (203)737-2286 _______________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ -- Ann Loraine Associate Professor Dept. of Bioinformatics and Genomics, UNCC North Carolina Research Campus 600 Laureate Way Kannapolis, NC 28081 704-250-5750 www.transvar.org _______________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ _______________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ _______________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ Ying Zhang, M.D., Ph.D. Postdoctoral Associate Department of Genetics, Yale University School of Medicine 300 Cedar Street,S320 New Haven, CT 06519 Tel: (203)737-2616 Fax: (203)737-2286 _______________________________________________ /
gff rna-seq cufflinks samtools bam • 2.2k views
ADD COMMENTlink modified 7.6 years ago • written 7.6 years ago by vasu punj360
0
gravatar for Jim Robinson
7.6 years ago by
Jim Robinson150
Jim Robinson150 wrote:
Hi, This is not a Galaxy answer specifically, but since you mention IGV, you can load the "expr" and of course "gtf" files in IGV 2.0. Its not released but is available from the "early access" links on the downloads page (htttp://www.broadinstitute.org/igv/downloads) If you want to use IGV with Galaxy we have some XML that will enable that, email me at igv-help@broadinstitute.org to request it. Best, Jim
ADD COMMENTlink written 7.6 years ago by Jim Robinson150
0
gravatar for Daniel Blankenberg
7.6 years ago by
Daniel Blankenberg ♦♦ 1.7k
United States
Daniel Blankenberg ♦♦ 1.7k wrote:
Hello, We would like to mention that a community contributed XML configuration file that enables BAM display at IGV was added to Galaxy in 5252:a62cec23a157. In your local Galaxy instance, you can enable it by removing the comments from around: in your datatypes_conf.xml file for the BAM datatype. This will let users view their BAM files in a local IGV instance or using web start. We would be interested in including additional displays in Galaxy for other datatypes when they are supported in IGV. If configurations already exist for other datatypes, please feel free to contribute them. Thanks for using Galaxy, Dan
ADD COMMENTlink written 7.6 years ago by Daniel Blankenberg ♦♦ 1.7k
Jim,   Thanks. My idea is to explore any option for visualizing RPKM in IGV or Galaxy or any other common tool. Since Galaxy has created some additional visualization track with Genome browser so I want to have feedback from other users.   Thanks       Subject: Re: [galaxy-user] Normalization an dplotting of RPKM/FPKM after cufflink To: "Jim Robinson" <jrobinso@broadinstitute.org> Cc: "vasu punj" <punjv@yahoo.com>, galaxy-user@bx.psu.edu Date: Thursday, April 14, 2011, 7:49 AM Hello, We would like to mention that a community contributed XML configuration file that enables BAM display at IGV was added to Galaxy in 5252:a62cec23a157. In your local Galaxy instance, you can enable it by removing the comments from around: in your datatypes_conf.xml file for the BAM datatype. This will let users view their BAM files in a local IGV instance or using web start. We would be interested in including additional displays in Galaxy for other datatypes when they are supported in IGV. If configurations already exist for other datatypes, please feel free to contribute them. Thanks for using Galaxy, Dan
ADD REPLYlink written 7.6 years ago by vasu punj360
0
gravatar for Jeremy Goecks
7.6 years ago by
Jeremy Goecks2.2k
Jeremy Goecks2.2k wrote:
Vasu, It's not clear what you're trying to accomplish, but I'll try to provide some information that you might find useful: Galaxy has a number of tools for analyzing numerical data; look under the menu items Statistics and Graph/Display Data for useful tools. If you're looking to plot FPKM values in addition to mapped reads from Tophat and Cufflinks transcripts, the Galaxy Tracks Browser might prove useful as it has filtering functionality so that you can move a slider to show/hide data based on FPKM values; its often useful to use the sliders for FPKM measures to get a sense of your data. See the attached screenshot for an example.
ADD COMMENTlink written 7.6 years ago by Jeremy Goecks2.2k
0
gravatar for vasu punj
7.6 years ago by
vasu punj360
vasu punj360 wrote:
Thanks Jeremy,   This appear to be a useful function. Could you please enlist the steps in workflow to achieve  the above visualization or alternatively point me to the URL where it is summarized please. I believe it will take Tophat out put Bam file and fpkm tracking file. I tried  but I dont see   track browser unless i convert to GTF file format. Further if you can point me how to get the slider window function  as shown in snap shot that will be great. Good work Jeremy!  Thanks.   Vasu       Subject: Re: Normalization an dplotting of RPKM/FPKM after cufflink To: "vasu punj" <punjv@yahoo.com> Cc: galaxy-user@bx.psu.edu Date: Sunday, April 17, 2011, 3:45 PM Vasu, Galaxy has a number of tools for analyzing numerical data; look under the menu items Statistics and Graph/Display Data for useful tools. If you're looking to plot FPKM values in addition to mapped reads from Tophat and Cufflinks transcripts, the Galaxy Tracks Browser might prove useful as it has filtering functionality so that you can move a slider to show/hide data based on FPKM values; its often useful to use the sliders for FPKM measures to get a sense of your data. See the attached screenshot for an example. Best, J.
ADD COMMENTlink written 7.6 years ago by vasu punj360
Vasu, Here are the steps to create this visualization; this is relatively new functionality, and you'll want to use our test server ( http://test.g2.bx.psu.edu/ ) for now. (1) Create a new visualization from the main menu: Visualization --> New Track Browser and choose your genome build. (2) Add your Cufflinks GTF files to the visualization using the 'Add Tracks' button in the upper right of the visualization. (Adding the Tophat reads and/or annotation tracks might prove useful as well.) (3) Zoom in (use the button at the top, double click on a point, or drag to select an area on the genome coordinates at the top) until the track's menu has the option 'Show filters' and choose this option to show filters. (4) Once filters are visible, you should be able to drag the slider to dynamically filter transcripts. Here's an example visualization of some mapped Tophat reads and Cufflinks transcripts that you can try out: http://test.g2.bx.psu.edu/u/jeremy.goecks/v/assembly-of-h1-hesc-rna- seq-data We're continuing to refine and extend this functionality and the Galaxy Track Browser in general; questions/comments/suggestions are most welcome. Best, J.
ADD REPLYlink written 7.6 years ago by Jeremy Goecks2.2k
I have to say that's a very nice visualization. Well done. Jim
ADD REPLYlink written 7.6 years ago by Jim Robinson150
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 172 users visited in the last hour