Hi All,
I have some paired end stranded RNAseq data prepared using an Illumina TruSeq Stranded mRNA kit. I have mapped the reads to the genome using HISAT.
Based on my understanding of the SAM flags, concordant reads should be assigned XS values as follows:
If gene is on the forward strand of the genome:
read1 maps to reverse strand of genome, SAM flag = 83, XS:A:+
read2 maps to forward strand of genome, SAM flag = 163, XS:A:+
If gene is on the reverse strand of the genome:
read1 maps to forward strand of genome, SAM flag = 99, XS:A:-
read2 maps to reverse strand of genome, SAM flag = 147, XS:A:-
For some reason HISAT is assigning reads with a SAM flag = 147 an XS value of '+'
We've mapped the same data to the same genome with TopHat and the XS values are correct.
Is this a bug in HISAT?
Thanks,
Matt Nicotra
University of Pittsburgh
Upon further investigation, it seems like HISAT2 is being run with the --rna-strandness parameter set to R and not RF, despite the fact that we selected paired end reads.
Here's the information from Galaxy for the HISAT run:
But here's the information from the Header of the SAM/BAM file:
Shouldn't --rna-strandness be RF and not R?