Question: Any Thing Wrong With My Cufflink Process In Galaxy?
gravatar for yao chen
7.3 years ago by
yao chen50
yao chen50 wrote:
Dear all: Recently, I run cufflink in galaxy on the internet. I want to compare two samples, However, I found no transcript or gene passed the significant level, even many of them have large FPKM in one sample and 0 FPKM in another sample. Any thoughts? Below is my cufflink process: I have four samples belong to two group. the test have three samples, and the control has one sample. First, using accept_hit.bam from tophat, I run cufflink without annotation on each sample. Then, for the four "gtf" files from four samples, I run cuffcompare to combine these transcript and compare to the annotation genome. However, at this step, I found the transcript accuracy is very low. See one example: Missed exons: 10673/11776 ( 90.6%) Wrong exons: 1254/2007 ( 62.5%) Missed introns: 8529/8637 ( 98.7%) Wrong introns: 2/5 ( 40.0%) Missed loci: 0/504 ( 0.0%) Wrong loci: 1248/2002 ( 62.3%) at last, I run cufdiff between this two group sample. Thank you.
galaxy • 1.5k views
ADD COMMENTlink modified 7.3 years ago by Jeremy Goecks2.2k • written 7.3 years ago by yao chen50
gravatar for Jeremy Goecks
7.3 years ago by
Jeremy Goecks2.2k
Jeremy Goecks2.2k wrote:
Yao, It's difficult to tell what's wrong without seeing your analysis. However, you may want to use the reference annotation during the Cufflinks phase to either estimate isoform expression or guide assembly (this option will appear on our public server soon). Read the Cufflinks documentation to understand these options and what they do for your assembly and FPKM values: De novo assembly from mapped reads is often somewhat imprecise and incomplete, especially for low-coverage data. It's not surprising that a de novo assembly doesn't match especially well with the reference. If you're still not seeing any differential expression after using the reference GTF in Cufflinks, Cuffcompare, and Cuffdiff, you may want to email the Cufflinks/compare/diff authors and ask for some pointers: Good luck, J.
ADD COMMENTlink written 7.3 years ago by Jeremy Goecks2.2k
Hi Jeremy, I tried to run cufflinks to assemble transcripts after running Tophat against my own reference. This error was encountered. What was wrong? How to fix it? Error running cufflinks. [21:01:14] Inspecting reads and determining fragment length distribution. Processed 11130 loci. Warning: Using default Gaussian distribution due to insufficient paired-end reads in open ranges. It is recommended that correct paramaters (--frag-len-mean and --frag-len-std-dev) be provided. [21:01:18] Assembling transcripts and estimating abundances. Processed 11130 loci. [21:01:22] Loading reference annotation and sequence. No fasta index found for ref.fa. Rebuilding, please wait.. Error: sequence lines in a FASTA record must have the same length! Thanks Jiannong
ADD REPLYlink written 7.3 years ago by Jiannong Xu30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 171 users visited in the last hour