Question: Cufflinks duplicate GFF ID error
gravatar for jingshuowang
3.4 years ago by
jingshuowang10 wrote:

I think this is an old but common problem that many people encountered when running the cufflinks with the ENsemble annotation references.

I have modified the annotation file download from Ensembl (GRCh37.V75). When I selected the option -G (use reference annotation), the cufflinks could successfully ran. However, when I selected the option -g (use reference annotation as guide), the cufflinks running was failed, showing the  duplicate GFF ID error. Does anybody have the same problems, or you may give me some suggestions.explanations on such problems.

I used the canonical genome reference GRCh37_Ensemble_75 (without unplaced contigs). 

rna-seq cufflinks • 1.2k views
ADD COMMENTlink modified 3.4 years ago by Jennifer Hillman Jackson25k • written 3.4 years ago by jingshuowang10
gravatar for Jennifer Hillman Jackson
3.4 years ago by
United States
Jennifer Hillman Jackson25k wrote:


Duplicated GFF IDs are not accepted as input with this tool. As an alternative, consider the iGenomes version:

Best, Jen, Galaxy team

ADD COMMENTlink written 3.4 years ago by Jennifer Hillman Jackson25k

Hi Jen,

Thanks for suggestion.

I definitely downloaded the reference annotation from iGenome (2014-05-23, the latest version), and made some modification to fit the genome reference (hg19 canonical):

1) change chromosome 1, 2, 3,.. to chr1, chr2, chr3,...

2) remove all of annotation information of unplaced contigs, just keep the annotation information of the "regular" chromosomes (chr1-22, chrX, chrY and chrM). And the order is chr1,chr10, chr11, ...chr19, chr2, chr20, chr21, chr22, chr3, ...chr9, chrM, chrX and chrY.

Is somthing wrong?

ADD REPLYlink written 3.4 years ago by sunnj850
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 66 users visited in the last hour