Making paired-end reads the same length

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Making paired-end reads the same length

0

3.0 years ago by

mtp1v12 • 0

United Kingdom

mtp1v12 • 0 wrote:

I have paired-end data (in two separate files). I have groomed my files, and would like to filter the reads for quality. However, the read lengths for some of the paired reads are not equal. If I filter the files independently, the mapping fails as the two files contain different numbers of reads. If I use FASTQ Joiner to merge the data, then filter, I cannot use FASTQ Splitter as some reads (~44%) can't be split due to unequal read lengths. Any help gratefully received. For example, is there a way to trim reads so that the paired reads are the same length? I should say that I am a Galaxy newcomer, so go easy...!

fastq bowtie galaxy filter • 2.2k views

ADD COMMENT • link •

modified 3.0 years ago • written 3.0 years ago by mtp1v12 • 0

0

3.0 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

What mapper are you using? Is the data Illumina? That has been previously manipulated to produce variable length reads?

Fastq files can be filtered so that both the forward and reverse inputs contain the same exact reads, but this is usually not necessary at the mapping step.

ADD COMMENT • link written 3.0 years ago by Jennifer Hillman Jackson ♦ 25k

Thank you for the reply Jennifer. I'm using Bowtie2 to map. The data is Illumina. I think I have the raw reads (sent from collaborator in Japan). I have used FASTQ groomer only, i'm not aware of any other manipulation. Thank you for your help.

ADD REPLY • link written 3.0 years ago by mtp1v12 • 0

Using Bowtie2, the content of the two fastq input files for paired-end mapping does not need to be identical.

Perform QA steps before the mapping run on the individual datasets.

Then filter the resulting BAM dataset after the run for properly paired mapped reads, etc.

If Bowtie2 is giving you an error - it is likely a format problem with fastq data itself. If you want help about that, let us know.

Thanks, Jen

ADD REPLY • link modified 3.0 years ago • written 3.0 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Remove "Unpaired" Reads From Quality-Filtered Pared-End Fastq Files.
Hi there, I obtained two fastq files from GA paired end run. I filtered each file by quality usi...
Splitting Biased Paired End FASTQ data
The FASTQ splitter in Galaxy warns that the tool only works if Paired-end data has equal length r...
Trimming And Thinning Of Paired-End Reads
Hi galaxy users, I've been experiencing some problems trimming the adapters and filtering by qua...
Pre-Processing Of Illumina Rna-Seq Paired End Data
Hello, I have Illumina 76bp paired end data for a zebrafish RNA-seq experiment and am basically ...
Fastq Joiner Fails To Join Pe Data.
Hi, I have HiSeq2000 paired end sequence data in two separate FASTQ files. I need to filter t...
Splitting pair end file
This is the type of file for pair end DNA sequence I have: @MONK:275:C1Y6RACXX:1:1101:1169:1...
Combining The Paired Reads From Illumina Run
Hi, I have two fastq files with the forward(/1) and reverse(/2) paired reads. The reads are not ...
Fastq Joiner
I am trying to join two groomed fastq files from a paired-end Illumina read using the fastq joine...
Help with downloading files from EBI SRA
I am new to galaxy and have been trying to figure out how to align RNA-seq reads to a genome in o...
splitting paired end fastq reads on galaxy
Hi, I'm new to galaxy. While working on a paired end data set, I noticed that there is only one f...
Fastq Splitter Empty And Fastq Manipulator Doesn'T Work
I am having the same issue as this user: http://user.list.galaxyproject.org/FASTQ-splitter-produ...
Read length from HT-Seq
Hi, Sorry for asking this very basic question. I have paired-end RNA-Seq data. After initial ada...
Illumina HiSeq reads and Fastq joiner
Hello, I have paired-end reads (Illumina HiSeq) from a metagenome in two Fastq files. These non...
Fastq Joiner Problem
Hi, I am trying to join two groomed fastq files from a paired-end Illumina read using the fastq j...
Any tools for separate unpaired reads in paired-end sequencing fastq files?
Hi, I would like to know if there is any tool can do the following job? I have some data files ...
Barcode Splitter On Paired End Illumina Reads
Dear Galaxy team I am so sorry for repeatedly posting the same question, but I do need some inp...
Help with interpreting RNA-seq output
I am new to galaxy and have been trying to figure out how to align RNA-seq reads to a genome in o...
How can I improve my mapping alignment rate with MiSeq 2x300 bp paired-end reads?
Edit 1: it appears I have misinterpreted the raw data, which in turn led to my poor results. The ...
paired-end sequencing or ChIA-PET
I want to look at original ChIA-PET datasets (FASTQ). It's a kind of paired-end sequencing. The s...
What is the correct regular expression for replacing inconsistent sequence and quality identifiers?
Hello, I am trying to analyze RNA-Sequence data. The SRA accession number for the file that I am...
Fastq Quality Trimmer For Pe Illumina?
Hello, I was wondering, can Galaxy's FASTQ Quality Trimmer tool be used on Illumina paired-end d...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 169 users visited in the last hour