Question: Problem With Cuffcompare
0
gravatar for Chandu Galaxy
7.2 years ago by
Chandu Galaxy30 wrote:
Hello, I'm using Galaxy tool from about two months. I'm very new to this community. I ran into some problems when using the tools for NGS: RNA Analysis. I would like to get from the community. Here is the description of the problem. I've four different datasets (fastqsanger format). I ran the NGS: RNA Analysis tools like "Tophat for Illumina", "Cufflinks", "Cuffcompare". The Tophat and cufflinks tools are working fine but the cuffcompare is giveing out an error message like the below. Dataset generation errors *Dataset 57: Cuffcompare on data 20, data 23, and others: combined transcripts* Tool execution generated the following error message: Error running cuffcompare. Warning: Your version of Cufflinks is not up-to-date. It is recommended that you upgrade to Cufflinks v1.1.0 to benefit from the most recent features and bug fixes (http://cufflinks.cbcb.umd.edu). No fasta index found for ./input1. Rebuilding, please wait.. Error: sequence lines in a FASTA record must have the same length! The tool produced the following additional output: cuffcompare v1.0.3 (2403) cuffcompare -o cc_output -s ./input1 ./input2 ./input3 ./input4 Can anyone help me to figure out what the problem is? Let me know if you need more information. Thank you. -- Chandrasekhar
rna-seq cufflinks • 1.6k views
ADD COMMENTlink modified 7.1 years ago • written 7.2 years ago by Chandu Galaxy30
0
gravatar for Jeremy Goecks
7.1 years ago by
Jeremy Goecks2.2k
Jeremy Goecks2.2k wrote:
Chandu, Cufflinks/compare/diff requires that your reference genome dataset have the following format: AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGT AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGT AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGT ... Note that all lines of sequence data have the same length. The problem you're seeing is because there are lines in your sequence data that are not the same length, e.g. AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGT AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTA AGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCGGTAGTTACCG ... The FASTA Width tool in Galaxy can help you format your dataset correctly. Good luck, J.
ADD COMMENTlink written 7.1 years ago by Jeremy Goecks2.2k
Thank you for the response. I can't check my reference genome dataset because I'm using reference provided by Galaxy (*Mosquito (Anopheles gambiae): AgamP3*). Is there any solution? Thank you. -- Chandu
ADD REPLYlink written 7.1 years ago by Chandu Galaxy30
Chandu, Are you running your analysis on our public server ( main.g2.bx.psu.edu )? If so, can you share your history me please (Options-->Share/Publish-->Share with a User-->my email address). Thanks, J.
ADD REPLYlink written 7.1 years ago by Jeremy Goecks2.2k
0
gravatar for Chandu Galaxy
7.1 years ago by
Chandu Galaxy30 wrote:
Thank you very much Jeremy. Can I have a look at the re-ran datasets?
ADD COMMENTlink written 7.1 years ago by Chandu Galaxy30
Chandu, I've deleted my copy of your history to save space as your history was quite large. Please rerun Cuffcompare with the modifications I suggested below. Thanks, J.
ADD REPLYlink written 7.1 years ago by Jeremy Goecks2.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 177 users visited in the last hour