Hi! I am attempting to perform a htseq-count on a HISAT2 aligned read BAM file containing aligned GFF3 and FASTQ data. I am searching for miRNAs, and am aware that the 'feature type' is going to be 'nc_RNA', but I cannot determine the ID attribute to match this. Under the 'annotations' tab in the GFF3 file, three separate annotations are used, and are structured as follows: ID=Cluster_#;DicerCall=_;MIRNA=_. What should I type into the ID Annotation box when performing htseq-count so that it works? Any advice is appreciated!!
Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search
Question: Non-coding RNA feature type/attribute
7 months ago by
exd5048 • 0
exd5048 • 0 wrote:
ADD COMMENT • link •modified 7 months ago by Jennifer Hillman Jackson ♦ 25k • written 7 months ago by exd5048 • 0
7 months ago by
Jennifer Hillman Jackson ♦ 25k
Jennifer Hillman Jackson ♦ 25k wrote:
The "ID Attribute" is used by the tool to group together annotation lines. My guess is that in your data the common attribute is "MIRNA".
Help from the tool form:
GFF attribute to be used as feature ID. Several GFF lines with the same feature ID will be considered as parts of the same feature. The feature ID is used to identity the counts in the output table. All features of the specified type MUST have a value for this attribute. The default, suitable for RNA-Seq and Ensembl GTF files, is gene_id. (--idattr)
Thanks, Jen, Galaxy team
ADD COMMENT • link written 7 months ago by Jennifer Hillman Jackson ♦ 25k
Please log in to add an answer.
Powered by Biostar version 16.09
Traffic: 176 users visited in the last hour