htseq read counts zero

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: htseq read counts zero

0

2.2 years ago by

fate.gh • 10

fate.gh • 10 wrote:

Hi,

I have imported some fastq files (homo sapiens) from http://www.ebi.ac.uk/ena/data/ to Galaxy. In order to obtain read counts, I aligned them to hg19 using HiSat (default parameters). Then since my reference genome was hg19, I used GTF file (Version 19 (July 2013 freeze, GRCh37) - Ensembl 74, 75) from Gencode to obtain read counts using htseq.

The total number of counts obtained for features is "10347508" which seems to be ok. While I have lost a number of counts about

__no_feature 2362227 __ambiguous 788874 __too_low_aQual 1001993 __not_aligned 2517255 __alignment_not_unique 3866370

Do you think the result is reasonable?

Something confusing is that from total 57820 genes, the counts for each gene up to gene 18356 are mostly non-zero, but counts for each gene from gene 18356 to gene 57820 are mostly zero (a few of them are non-zero).

Why is that?

Do you think I have to change my GTF file? Which version?

Or do you think I have to consider only the first 18356 genes for DE analysis ?

Thanks

rna-seq annotation read counts htseq • 846 views

ADD COMMENT • link •

modified 2.2 years ago by Jennifer Hillman Jackson ♦ 25k • written 2.2 years ago by fate.gh • 10

1

2.2 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

Check for a mismatch between the chromosome names in the inputs. This prior Q&A explains: https://biostar.usegalaxy.org/p/18171/

A reference GTF file for hg19 with chromosome identifiers that match the natively indexed hg19 can be obtained from UCSC or iGenomes. https://galaxyproject.org/support/chrom-identifiers/

Best, Jen, Galaxy team

ADD COMMENT • link modified 18 months ago • written 2.2 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

DEXseq count reads tool obtaining only zero counts for all exons
I'd like to run DEXseq on my RNA-seq dataset to identify any potential differences in exon inclus...
htseq-count no read align
Hi I am trying to get raw count of mapped read, so I am running htseq-count on tophat accepted h...
RNA-seq analysis in Galaxy
I am doing RNA-seq analysis for several mouse samples and I encounter problems during differentia...
De novo transcriptome assembly and reference guided transcriptome assembly
Hi, I have four related questions about de novo RNAseq data analysis. I have 4 RNAseq data obtai...
htseq-count no hits
Hello, I'm having quite abit of trouble handling my RNAseq fastq files. These were originally ob...
htseq-count obtains zero counts
I am using the following command: htseq-count -s no -a 0 FourA.sam hg19.gtf > FourA.count an...
Primer Contamination, Miranalyzer
Hi Galaxy, Ive got 2 problems for you; 1) Ive got microRNA Illumina NGS data that I want to ana...
HELP! My HT seq table counts are in zero
Hello!! I am running HT seq using my sam files (obtained from bowtie) and the gtf file I obtaine...
de novo aseembly counting
Hi, I have assembled de novo reference gene and mapped it to tophat now i want to calculate t...
Reads are 0 in htseq-count
Hello, I have a trouble using htseq-count, I used TopHat to aling my data, but when I tried to c...
Galaxy HTSeq tool give zero counts
Hi, I tried galaxy HTseq tool on BAM file generated by HISAT2 to generated counts. I provided GT...
All reads are 0 in htseq-count
Hello, I have a trouble using htseq-count, I used TopHat to aling my data and it looks ok but wh...
htseq-count with gff error
Dear all. Please take a look at my errors: http://goo.gl/Oa2IxR In the link, you will see 2 ima...
Duplicate row names error with EdgeR on FeatureCounts files
Hi all, I created tabular count files using FeatureCounts with a GTF file (iGenomes, UCSC hg38) ...
Alignment rate differes using hg19 and hg38
I have some RNA-seq fastq files.I'm using HiSat to align the fastq files. To do so, I use the bui...
HTSeq-Count Error on Cloudman
Hi I'm working on Cloudman doing RNASeq transcriptome analysis. I've made BAM file using TopHat2...
Impossible to use Htseq-count on BAM files from Tophat2
Hello, I'm currently facing troubles using galaxy. I want to compare differentially expressed ...
Table With Gene Count Reads
Hi, I was wondering if there is any tool on Galaxy were I can obtain a table with how many rea...
Cufflinks: reference annotation file for Nile Tilapia (Oreochromis_niloticus)
I want to do differential gene expression analysis on some Nile Tilapia RNA-Seq data using the Cu...
RNA genes count using HTseq
Hello: I am new to Galaxy, but based on my understanding of how things work, I mapped some RNA-s...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 175 users visited in the last hour