Question: Cufflinks assembles very small number of transcripts from RNA-seq data
gravatar for jaeil.han
3.0 years ago by
United States
jaeil.han10 wrote:

I mapped RNA-seq data to reference genome and tried to assemble transcripts using cufflinks.It is ribosomal RNA depleted samples from yeast RNA.

In the mapped BAM files, I can clearly see many reads in many noncoding RNA genes using genome browser, but they are all missing in the results of cufflinks with either bias correction or without it.

I cannot find what the reason is. Please give me any advice.



rna-seq cufflinks • 1.1k views
ADD COMMENTlink modified 3.0 years ago by Jennifer Hillman Jackson25k • written 3.0 years ago by jaeil.han10
gravatar for Jennifer Hillman Jackson
3.0 years ago by
United States
Jennifer Hillman Jackson25k wrote:


If the goal is to identify only non-coding transcripts, or even just some specific subset of transcripts, the option for including a Mask File containing transcripts that are not of interest or expected to be expressed in higher abundance (known, common) could be used. Find this option under Advanced Parameters.

Related gotcha: If coding transcript (only) reference annotation is used with Cufflinks "as truth" (instead of as a guide), the result could resemble what is described. Bring up the reference annotation along with the BAM hits in the visualization. If there is no overlap (expected), then use annotation instead "as a guide" or not at all.

Other factors could include how well paired-end reads mapped in "concordance" from Tophat, but those sorts of issues would impact all experiments, not just those seeking out less abundant or novel transcripts.

Best, Jen, Galaxy team


ADD COMMENTlink written 3.0 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 172 users visited in the last hour