Disk space needed for TopHat2

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Disk space needed for TopHat2

0

2.2 years ago by

ChickenRNA • 50

ChickenRNA • 50 wrote:

Hi, I am trying to run RNA-Seq on a local instance of Galaxy. I have almost 500gb of decompressed raw data, Is there a way to find out how much hard disk space I would need to run Tophat and cuff links on my local instance. I don't want to start the analyses have it stop due to lack of space. Thanks

rna-seq tophat galaxy local • 621 views

ADD COMMENT • link •

modified 2.2 years ago by y.hoogstrate • 460 • written 2.2 years ago by ChickenRNA • 50

2

2.2 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

It is difficult to extrapolate the output size from a given input size. There are many factors: parameters, target genome, duplication in the input fastq datasets, etc. The output will be similar in size as if the tool was run line command.

That said, if you have multiple dataset pairs to run that you believe have about the same content and each will be using the same run-time settings during mapping, perhaps execute Tophat with just one pair first and then use that yourself to estimate how much disk each pair will consume.

You could also try asking (or searching prior Q&A) at the user group for Tophat, but I expect that you will get a similar reply. The contact info is in the right side bar here: https://ccb.jhu.edu/software/tophat/faq.shtml

Thanks, Jen, Galaxy team

ADD COMMENT • link written 2.2 years ago by Jennifer Hillman Jackson ♦ 25k

1

2.2 years ago by

y.hoogstrate • 460

Netherlands

y.hoogstrate • 460 wrote:

From my experience the size BAM files are usually smaller than the the size of GZipped FASTQ files. Example: 2.1TB (gzipped!) FASTQ produced 1.3TB bam files using RNA-STAR. Of course, as Jennifer says, there are many factors playing a role here so it's not a rule of thumb. However, if you say you have 500gb of raw fastq, I can't imagine you will need more than another 500gb for the alignments. Compared to BAM files, the output files of cufflinks are pretty small.

ADD COMMENT • link written 2.2 years ago by y.hoogstrate • 460

Please log in to add an answer.

Similar posts • Search »

Galaxy disk space trouble
I deleted all my histories and data in galaxy, refreshed for recalculating space, I use 25% and I...
Wrong disk quota in Galaxy
Hi, I have totally 70 GB files however the systems show: You are using 206.7 GB of disk space in...
usage of history
After I have removed almost all results, the usage of space will not drop. Is it not a usage of d...
Monitoring Usage
Hi, I have a local install of Galaxy. Is there something that I could use to monitor the usag...
Local install - storage question
Just set up a local Galaxy installation for the lab and it is running fine, one thing was not cle...
My history dissapears, need help?
Can, someone check what happend to my history? I had run a workflow and my history was full and s...
Finding past history local instance Galaxy
Hello all, I have been trying this for a while (even trying to do this by ssh command line). I am...
Can Not Delete Data From My Local Galaxy
Hello, I am using local galaxy server. I have disc storage problem and I noticed that galaxy is u...
Disk usage not updating
Hello I have deleted and purged all data from my Galaxy Main account but the disk usage says I a...
Galaxy'S Recent Problems
Hello all, Galaxy has been ill recently due to disk space on the server filling up. In the short...
Disk Space rquired to Install a Galaxy Instance on a Cluster (for small lab)
I have found my galaxy account very useful but every so often I have to download or delete data f...
Negative Disk Quota
Hi. I see the following message when I look at my profile: You are using -23.3 GB of disk space...
Disk space miscalculation (full) ?
My disk quota seems to be miscalculated. From my "Saved Histories", I can see that I'm using up ...
jobs do not run (Gray) on Galaxy main
I submitted about 8 jobs on April 26 at 22.30(UTC). None of them run until now. These jobs are up...
How to free up space on Galaxy server (purge history doesn't delete files)
Hello, We are about to run out of space on our Galaxy instance. I purged some histories in order...
Over my disk quota
Hi, I'm over my disk capacity (message: You are using 253.5 GB of disk space in this Galaxy insta...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 169 users visited in the last hour