Question: Genome Browser Histogram Visualization Of Accepted Hits
0
gravatar for Fowler, Trent
6.5 years ago by
Fowler, Trent20 wrote:
Hello, I am attempting to run accepted hit data from Tophat output into the UCSC Genome Web Browser for visualization of sequencing hits in specific genes. However, the BAM files yield tiles and are too large to present through the browser. Is there a better file format to convert to that would allow better visualization such as histograms? Any tips or solutions on how to obtain histograms from sequencing data would be very welcome. Thanks Trent Fowler
rna-seq tophat • 1.5k views
ADD COMMENTlink modified 6.5 years ago by Jennifer Hillman Jackson25k • written 6.5 years ago by Fowler, Trent20
0
gravatar for Hiram Clawson
6.5 years ago by
Hiram Clawson260
Hiram Clawson260 wrote:
Please note this advice: http://genomewiki.ucsc.edu/index.php/Selecting_a_graphing_track_da ta_format You definitely do *not* want to convert BAMs to BEDs. That would be a step backwards. BAM files should display in the genome browser without difficulty, unless they are attempting to display pile-ups of thousands of reads in the same location. In this case you want to use the samtools functions to construct bigWig pile-up density graphs from your BAM files. BAM files can display from a URL to your file, you do not need to upload them to the genome browser. They are very efficient when used in this manner. --Hiram To: galaxy-user@bx.psu.edu Subject: [galaxy-user] Genome Browser Histogram Visualization of Accepted Hits Hello, I am attempting to run accepted hit data from Tophat output into the UCSC Genome Web Browser for visualization of sequencing hits in specific genes. However, the BAM files yield tiles and are too large to present through the browser. Is there a better file format to convert to that would allow better visualization such as histograms? Any tips or solutions on how to obtain histograms from sequencing data would be very welcome. Thanks Trent Fowler
ADD COMMENTlink written 6.5 years ago by Hiram Clawson260
0
gravatar for Jennifer Hillman Jackson
6.5 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hi Trent, Hiram's advice is very good for UCSC display (thanks Hiram!) You most likely heard of converting BAM->SAM->Interval->BED->BigBED. But, Hiram is exactly correct - there will be a great loss of information (notably the full individual sequence data itself, as only the coordinates + variation will remain, plus the data will be in BED6, which means no splicing, just global coordinates - you won't want this for RNA-seq). Since you are examining RNA-seq data, I also wanted to remind you of the choice to use Trackster for visualization (top Galaxy menu -> "Visualization"). A local or custom reference genome can be used, histogram or full read display is possible, and deeply sequenced areas can be "unpacked" in stages by using the visualization controls (really nice). Other features such as drag and drop composite track creation, new datasets from selects within Trackster (to feed more analysis), strand coloring, etc. are fairly recent enhancements (see: http://wiki.g2.bx.psu.edu/DevNewsBriefs/2012_03_12#Galaxy_Track_Browse r_.28GTB.29 http://wiki.g2.bx.psu.edu/DevNewsBriefs/2012_05_11#Galaxy_Track_Browse r_.28GTB.29). More undocumented (so far!) features are added all the time, so good to experiment. A new paper is in the works that will cover the details. And there will be tutorials, screencasts, etc. later on, but for now during the active development cycle, just using it is best and is pretty intuitive. We're interested in what you like or wish was added (this is true to everyone reading this!). No promises, but is good to get feedback! If there is a particular UCSC track (or another source mapped to same reference genome) that you wish to compare your data against, it could be imported into Galaxy and added to your Trackster visualization session. There are no known limits on data size/depth - only the general Galaxy dataset file size limit of 50G (applies to all datasets) - although I suppose an extreme corner case could probably be devised/uncovered. The point is that a routine RNA-seq experiment shouldn't present a problem. Special tuning to speed up and handle depth has been a big area of recent development activity. If you do run into issues with Trackster, feel free to ask questions (I'll probably ask you to send a share link to the visualization on the pubic main Galaxy instance and bring Jeremy into the discussion if the solution is not clear). Please let us know if you need more help, Jen -- Jennifer Jackson http://galaxyproject.org
ADD COMMENTlink written 6.5 years ago by Jennifer Hillman Jackson25k
Thanks Jen, I have not had good luck with the Visualization feature. Using BAM files, it has worked once to produce a single histogram but mostly seems stuck preparing data for up to 3 hours with no production. Additionally, once a graph is made how can one save the graph as a picture file; pdf, etc? Best, Trent To: Fowler, Trent Cc: galaxy-user@bx.psu.edu Subject: Re: [galaxy-user] Genome Browser Histogram Visualization of Accepted Hits Hi Trent, Hiram's advice is very good for UCSC display (thanks Hiram!) You most likely heard of converting BAM->SAM->Interval->BED->BigBED. But, Hiram is exactly correct - there will be a great loss of information (notably the full individual sequence data itself, as only the coordinates + variation will remain, plus the data will be in BED6, which means no splicing, just global coordinates - you won't want this for RNA-seq). Since you are examining RNA-seq data, I also wanted to remind you of the choice to use Trackster for visualization (top Galaxy menu -> "Visualization"). A local or custom reference genome can be used, histogram or full read display is possible, and deeply sequenced areas can be "unpacked" in stages by using the visualization controls (really nice). Other features such as drag and drop composite track creation, new datasets from selects within Trackster (to feed more analysis), strand coloring, etc. are fairly recent enhancements (see: http://wiki.g2.bx.psu.edu/DevNewsBriefs/2012_03_12#Galaxy_Track_Browse r_.28GTB.29 http://wiki.g2.bx.psu.edu/DevNewsBriefs/2012_05_11#Galaxy_ Track_Browser_.28GTB.29). More undocumented (so far!) features are added all the time, so good to experiment. A new paper is in the works that will cover the details. And there will be tutorials, screencasts, etc. later on, but for now during the active development cycle, just using it is best and is pretty intuitive. We're interested in what you like or wish was added (this is true to everyone reading this!). No promises, but is good to get feedback! If there is a particular UCSC track (or another source mapped to same reference genome) that you wish to compare your data against, it could be imported into Galaxy and added to your Trackster visualization session. There are no known limits on data size/depth - only the general Galaxy dataset file size limit of 50G (applies to all datasets) - although I suppose an extreme corner case could probably be devised/uncovered. The point is that a routine RNA-seq experiment shouldn't present a problem. Special tuning to speed up and handle depth has been a big area of recent development activity. If you do run into issues with Trackster, feel free to ask questions (I'll probably ask you to send a share link to the visualization on the pubic main Galaxy instance and bring Jeremy into the discussion if the solution is not clear). Please let us know if you need more help, Jen Hello, I am attempting to run accepted hit data from Tophat output into the UCSC Genome Web Browser for visualization of sequencing hits in specific genes. However, the BAM files yield tiles and are too large to present through the browser. Is there a better file format to convert to that would allow better visualization such as histograms? Any tips or solutions on how to obtain histograms from sequencing data would be very welcome. Thanks Trent Fowler ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ -- Jennifer Jackson http://galaxyproject.org
ADD REPLYlink written 6.5 years ago by Fowler, Trent20
0
gravatar for Jennifer Hillman Jackson
6.5 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hi Trent, I realized that part of your question went unanswered: Any track loaded into any of your histories can be added to a visualization. Click on the "plus" icon to add tracks. See addtrack1.png. You will be able to navigate by default through your active history, but can also move to any of your histories or the Shared Data Libraries, to select datasets (one or more) and import. See addtrack2.png. The visualization tool recently went through an intensive development cycle. Documentation/tutorials to cover the full functionality is coming. Until then, please let us know if you need pointers when exploring the UI. Best, Jen Galaxy team -- Jennifer Jackson http://galaxyproject.org
ADD COMMENTlink written 6.5 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 168 users visited in the last hour