variable, low alignment rates with HISAT2

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: variable, low alignment rates with HISAT2

1

11 months ago by

stephanie.major • 10

stephanie.major • 10 wrote:

Hello,

I have some single end mouse RNA sequencing data that I would like to use galaxy to analyze. I originally attempted using TopHat for alignment and was getting pretty low alignment rates (30-40%) for all of my files. After reading on this forum and other places, I saw that HISAT2 should be used. I thought that maybe this would improve my percentages since it is a more sensitive tool. However, I have run my files through and my some results improved, some are about the same, and some got worse. My overall alignment percentages range from as low as 18% to the highest being around 80%, which still seems fairly low compared to what I expected/what I have seen in others' projects. I am using the indexed reference genome (mm10). My files are fastq illumina 1.8+ scaled. Some additional notes: The alignment rates are not much different with trimming, some are even less than they were without it. I have used default parameters and read the hisat manual and changed a few parameters that I thought may help improve the alignment percentage but they did not. I attempted to BLAST some of the unaligned sequences from the samples that produced low alignment percentages and the sequences that I did BLAST either returned with no significant results or were mouse rRNA sequences, which leads me to believe that the polyA enrichment did not work as well as it could/should have. The fastq files are concatenated datasets since each sample was run in 4 lanes. I also tried aligning one sample's four files separately with HISAT2, instead of as a concatenated set, but each file's result had very low alignment percentages as well (about 4%) as opposed to only 1 having poor alignment.

I just wanted to ask if anyone had any suggestions on how to go about working with this - my sequencing core suggested demultiplexing the files and generating fastq manually with bcl2fastq (the ones I am currently using were generated on basespace), so I am working on that, but I wanted to get some other suggestions in case that is not successful.

Thank you in advance!!

rna-seq alignment hisat2 • 991 views

ADD COMMENT • link •

modified 11 months ago by ellascottgm • 0 • written 11 months ago by stephanie.major • 10

0

11 months ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

The very low mapping rates do sound as if they are related to the fastq content.

Have you run FastQC? This Galaxy tutorial explains how to run and interpret the results. https://galaxyproject.org/learn/ > NGS logistics. Any problems could be then be reported back to those doing the library prep/sequencing to troubleshoot or optimize methods.

Thanks! Jen, Galaxy team

ADD COMMENT • link written 11 months ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

TopHat align summary
Hello, I am using galaxy to analyze RNA seq of 100bp sing end data, sequenced with Illumina 2500...
Poor overall alignment rates
I am having problems with the alignment process for some data that I obtained through NCBI GEO da...
Filtering BAM files from HISAT2
Hi. I am new to rna-seq and I have a couple of quick questions. My input was paired-end non-stran...
Low alignment for paired end reads using HISAT2
Hello, I am new to processing and analysis of RNAseq data. I have recently completed a paired-en...
Map With Bowtie
Hi Colleagues, I mapped illumina sequences using Bowtie on Galaxy server, I could not locate the ...
HISAT2 gzip: input_r.fastq.gz: not in gzip format
Hello, I have been trying to use HISAT2 to to align RNA-seq ENCODE data which I download on my de...
HISAT2 produces no aligned reads
Hello all, My lab has just installed HISAT2 on our instance of Galaxy. Unfortunately I am having...
Tophat Results
Dear galaxy users, I aligned my RNA-seq data by using Tophat in galaxy. It generated some "...
How can I improve my mapping alignment rate with MiSeq 2x300 bp paired-end reads?
Edit 1: it appears I have misinterpreted the raw data, which in turn led to my poor results. The ...
Concatenating four Nextseq fastq files
Hi, Please excuse my naivety but my sequencing was run on NextSeq with single end reads, I have...
Chip Seq analysis with multiple biological replicates for differential expression
Hello, I am very new to sequence data analysis and had some structural questions. I am trying to ...
Merging two fastq files together
Hi all, I've recently received the .fastq files for my rna-seq experiment. According to the sequ...
Scaffold from Blast results
Hi All I have been struggling to get my head around how to create Scaffolds. I have created a d...
Low mapping of paired-end reads with tophat2
Hello. I'm trying to map paired-end reads on reference scaffold using tophat2. But the percentage...
>90% aligned concordantly 0 times ChIP-seq Bowtie2
Hi, I know this question have raised many times here and in other forums but I've tried everythi...
Bwa And Fastq Joiner Issues
Hello, I hope someone might be able to help me with these issues, as I'm relatively new at Bioin...
Cuffquant errors after using HISAT2
Hi, I have 2 sequence reads archive and I want to do some RNA-Seq analysis on them. Archive 1 :...
GEO SRA fastq-dump with very low mapping rate (Galaxy)
Dear Biostars, I am a quite unexperienced biologist doing a metaanalysis of RNA-seq/microarray e...
Fastq Joiner Fails To Join Pe Data.
Hi, I have HiSeq2000 paired end sequence data in two separate FASTQ files. I need to filter t...
htseq-count no hits
Hello, I'm having quite abit of trouble handling my RNAseq fastq files. These were originally ob...
Assigning a "sample_name" column dynamically in the SAM tool generate pileup
Hi everyone, I am creating a workflow in Galaxy that I will be using for variant analysis for ab...
HiSAT2 Alignment Rate Dropping with Cufflinks export option enabled
I am trying to take trimmed and qc filtered paired RNA seq data and run it through HiSat2 > St...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 167 users visited in the last hour