Cuffdiff-differential gene expression-NOTEST

Question: Cuffdiff-differential gene expression-NOTEST

3.8 years ago by

Germany

raj_262920 • 20 wrote:

I am just starting up and getting to know and analyze my data from the RNA-seq. I have 2 biological replicates for 2 different conditions, Pair-end sequenced. The workflow is as follows: I ran FastQgroomer on all my reads, then I used these reads to align to a reference genome using Tophat2. Tophat 2 analysis showed about 80 % alignment with the reference. Then I ran Cufflinks on the accepted hits from Tophat2 with the reference genes and genome. I verfied the cufflinks output files and I can see FPKM values for the two different conditions and replicates. After this i used cuffmerge to merge the transcripts from all the cufflinks outputs. so i merged the four assembled transcripts files from cufflinks( 2 conditions and 1 replicate for each conditions). Then i ran cuffdiff to find differenctial expresssions and so on, using the merged transcript file from cuffmerge and i also used the reference genome. In the cuffdiff output..differential gene/transcript expression tracking files..I see the FPKM is totally zero and i see 'NOTEST' for all the samples.

What does that mean ? I dont think that means that there are no significanlty expressed genes between the samples ! . When i compare the FPKM values from cufflinks, there is big differene in FPKM values for the 2 conditions !

What can be done to solve this ? Please suggest !

Thank you and Best Regards

rna-seq galaxy • 3.2k views

ADD COMMENT • link •

modified 3.8 years ago by Jennifer Hillman Jackson ♦ 25k • written 3.8 years ago by raj_262920 • 20

I am having the same problem and I am trying to use the .gtf annotation file with the merged.gtf file from Cuffmerge on the Cufflinks. Does someone knows how to use both together in a workflow???

ADD REPLY • link written 2.8 years ago by juafonso_bio • 0

3.8 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

This indicates that there is not enough data to detect significant differential expression between the samples. Using a reference annotation dataset (GTF, GFF3) with Cuffmerge could help group the data together. Use the iGenomes version, if one is available for your genome. More here:
http://cole-trapnell-lab.github.io/cufflinks/cuffdiff/index.html#differential-expression-tests

Thanks, Jen, Galaxy team

ADD COMMENT • link written 3.8 years ago by Jennifer Hillman Jackson ♦ 25k

Hi Jennifer

Thanks ! So you meant that i should use in cuffmerge , the first transcript file (GTF file produced by cufflinks), could be the gtf annotation file from my reference ? and my other cufflinks generated transcripts as additional GTF files ?

If so, i should say 'no' to the option 'use reference annotations' ? or did you actually meant to use a reference annotation file here ? I actually used one before !

Thanks so much !

ADD REPLY • link modified 3.8 years ago • written 3.8 years ago by raj_262920 • 20

On the tool form, enter the Cufflinks output files for the option "GTF file produced by Cufflinks:", then also use the option "Use Reference Annotation:" as "yes" and include it there when the form resets. Hopefully that helps, Jen, Galaxy team

ADD REPLY • link written 3.8 years ago by Jennifer Hillman Jackson ♦ 25k

Oh, I didn't see that you had already used an annotation dataset. The data could just be too sparse, or that could potentially be contributing to the problem in Cuffdiff. Double check that the chromosome identifiers in the reference annotation file are an exact match for those in the reference genome used. If they do not match (are not based on the same exact reference genome), many issues can arise.
http://wiki.galaxyproject.org/Support#Reference_genomes

ADD REPLY • link modified 3.8 years ago • written 3.8 years ago by Jennifer Hillman Jackson ♦ 25k

Similar posts • Search »