Paired-end insert size upper limit

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Paired-end insert size upper limit

0

4.0 years ago by

lukacdm • 0

United States

lukacdm • 0 wrote:

I am mapping paired-end reads using Bowtie2 and setting "maximum insert size for valid paired-end alignments" to 500 bases. However, when I calculate the insert sizes of the resulting bam files using Picard Insert Size Metrics I frequently see one or two inserts per file that exceed 20000 bases. Is the Bowtie2 mapping incorrectly returning very large inserts, or is the Picard software mis-analyzing the bam file? If these large inserts are really coming from incorrect mapping, how can I remove them from the bam files so my downstream analyses are not affected?

picard insert size metrics bowtie2 • 1.9k views

ADD COMMENT • link •

modified 4.0 years ago • written 4.0 years ago by lukacdm • 0

0

4.0 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

This is just how the data mapped first-pass. Downstream analysis/summary tools will not consider the data as valid - so you can just leave these in the input.

Best, Jen, Galaxy team

ADD COMMENT • link modified 4.0 years ago • written 4.0 years ago by Jennifer Hillman Jackson ♦ 25k

0

4.0 years ago by

lukacdm • 0

United States

lukacdm • 0 wrote:

Thanks, Jennifer. Will peak calling programs also ignore the long inserts?

ADD COMMENT • link written 4.0 years ago by lukacdm • 0

The two peak calling tools on the public Main Galaxy instance at http://usegalaxy.org only accept single-end input. So you will be running a tool on a local/cloud for this analysis.

There should be an option for this with most tools, but this is more educated guess than fact. If you are in doubt, there is usually a link to the 3rd party tool documentation on the execution/help form that will help you to determine the proper usage for the tool you are using.

Best, Jen, Galaxy team

ADD REPLY • link written 4.0 years ago by Jennifer Hillman Jackson ♦ 25k

0

4.0 years ago by

lukacdm • 0

United States

lukacdm • 0 wrote:

Thanks so much. I usually use other galaxy instances that have appropriate Peak Calling programs.

ADD COMMENT • link written 4.0 years ago by lukacdm • 0

Please log in to add an answer.

Similar posts • Search »

Insert sizes in Galaxy
I am new to RNASeq analysis. I used tophat to generate BAM files from FASTQ files but in the para...
Filter BAM app in Bam Tools
I'm trying to eliminate inserts larger than 500 bp in a Bam file generated by Bowtie2 using the F...
Error Running The Picard Tool?
Hi, I am trying to determine the mean inner distance between mate pairs, but encountered odd resu...
How to calculate the Average Insert Size after mapping the reads to the reference genome using BWA
Hi everyone, Having mapped the reads (paired-end) to a reference genome using BWA, I am trying t...
Picard tool collect insert size metrics not working
Hi, I am a newbie to RNA Seq analysis and Galaxy. I was trying to calculate the mean inner di...
Bowtie2/FreeBayes/mpileup variant detection on NGS of PCR amplicons around Cas9/CRISPR indels
Hey threre, I have an MiSeq experiment using 24 indices where in each index I was sequencing 3 P...
RNA-Seq Alignment too Big for Galaxy?
Hello, I have relatively large trimmed FASTQsanger files that I want to align via Star and event...
Aligning more than 2 sequences
I have a pair-end data fast1.fq and fast2.fq in (fastq format). Now, I need to align the pair-end...
RNAseq data to be processed in two ways: (i) mapping to de novo Trinity-based transcriptome and (ii) mapping a relatively new genome
Hello all, I am new to RNAseq data and learning this process step by step, so I have a few quest...
Bowtie2/Samtools files are not visualized with Trackster: Fatal error Exit code 255 ()
I am trying to map 8 paired-end reads on to the same reference custom-build genome. Each of these...
SAM-to-BAM error with custom reference
Dear Galaxy Biostar team, I'm seem to be having a problem running Samtools SAM-to-BAM conversion...
using SamToFastq 1.136.0 in Galaxy workflows
Hello, How can I connect in the workflow SamToFastq and Bowtie2 tools in Galaxy? I am trying to ...
How can I improve my mapping alignment rate with MiSeq 2x300 bp paired-end reads?
Edit 1: it appears I have misinterpreted the raw data, which in turn led to my poor results. The ...
MNase ChIP-seq paired-end data analysis ?
Hello everyone! Sorry for long explanation! I did paired-end ChIP-seq for my input and IP sample...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 169 users visited in the last hour