I have some RNA-seq fastq files.I'm using HiSat to align the fastq files. To do so, I use the built-in genome reference in Galaxy. When I align them to hg19, the alignment rate is much higher than when I align them to hg38. What causes such big difference?
Here this is an example:
* Alignment to hg19:
format bam
database hg19
37328991 reads; of these:
37328991 (100.00%) were unpaired; of these:
3767939 (10.09%) aligned 0 times
16947652 (45.40%) aligned exactly 1 time
16613400 (44.51%) aligned >1 times
89.91% overall alignment rate
[bam_sort_core] merging from 24
* Alignment to hg38:
format bam
database hg38
37328991 reads; of these:
37328991 (100.00%) were unpaired; of these:
11718927 (31.39%) aligned 0 times
15954103 (42.74%) aligned exactly 1 time
9655961 (25.87%) aligned >1 times
68.61% overall alignment rate
[bam_sort_core] merging from 24
What should I do?
Will it cause a problem if I want to obtain read counts for DE analysis using htseq?