Question: DanRer10 Effective Genome Size?
gravatar for amaheras7091
23 months ago by
amaheras709110 wrote:


I'm trying to use bamCoverage to convert my TopHat bam files into bigwig files for visualization on UCSC genome browser. bamCoverage asks for the Effective Genome Size and if I'm using the zebra fish DanRer10 genome, does anyone know what the effective genome size would be for that?

Also, this is the information it gives about what the effective genome size is: "The effective genome size is the portion of the genome that is mappable. Large fractions of the genome are stretches of NNNN that should be discarded. Also, if repetitive regions were not included in the mapping of reads, the effective genome size needs to be adjusted accordingly. See Table 2 of or for several effective genome sizes."

I cam across the number 1371719383, but I think that is the total length rather than just the mappable length?

Thanks for any help!

rna-seq tophat galaxy • 796 views
ADD COMMENTlink modified 23 months ago by Jennifer Hillman Jackson25k • written 23 months ago by amaheras709110
gravatar for Jennifer Hillman Jackson
23 months ago by
United States
Jennifer Hillman Jackson25k wrote:


See the Assembly Statistics tab here:

Repetitive regions are not reported, as this can be a variable number depending on how repeats are categorized and masked. However, the danRer10 repeat tracks at UCSC ( could be reviewed, the appropriate one used, and the coverage subtracted.

Any that come up with a number (amaheras7091 or other readers), please share that back along with methodology/assumptions as a follow-up post. Other sources for this data (pre-calculated) are also welcome.

Best, Jen, Galaxy team

ADD COMMENTlink modified 23 months ago • written 23 months ago by Jennifer Hillman Jackson25k


On the UCSC Genome Browser (, there are several tracks that are under "Variation and Repeats" (Interrupted Rpts, Microsatellite, RepeatMasker, Simple Repeats, WM + SDust) and I looked at the summary statistics for each. In order to choose which one(s) to subtract from the total nucleotide length, I did the same process for the human genome which has a known effective genome size of 2,451,960,000. However, subtracting the "item bases" from any one (or a combination) of "Repeat" tracks from the total number of nucleotides 3,209,286,105 did not yield the known effective genome size. Any further assistance on how to calculate the effective genome size would be greatly appreciated.

ADD REPLYlink written 22 months ago by amaheras709110
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 111 users visited in the last hour