How Much Can I Trimm My Reads

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: How Much Can I Trimm My Reads

0

6.3 years ago by

Du, Jianguang • 380

Du, Jianguang • 380 wrote:

Dear All, I am analysing RNA-seq datasets for the differential splicing events between cell types. My reads are 36bp long. In order to increase the quality of reads, I need to trim some nucleotides from ends. How many nucleotides can I trim? I am afraid that if I trim too much, the reliability of the alingment will be affected. Thanks in advance. Jianguang

• 771 views

ADD COMMENT • link •

modified 6.3 years ago by Jennifer Hillman Jackson ♦ 25k • written 6.3 years ago by Du, Jianguang • 380

0

6.3 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello Jianguang, This general protocol is also in the RNA-seq tutorial: http://main.g2.bx.psu.edu/u/jeremy/p/galaxy-rna-seq-analysis-exercise --> Understanding and QCing the reads That said, I had a sample of your data from before and I ran FastQC on it and see what you mean, the quality drops off steadily after the first 10 bases or so, then below phred+20 around the middle of the sequence (for both ends). There are a few options - 1 - Do as Ann suggests and just leave these alone and test to see what happens in TopHat. If the mapping fails, then you will know that you need to do some quality cleanup. 2 - Use the FastQC results to decide on a lower quality score boundary and trim the very worst sequences. Because of the length, yes, take care not to remove too much. As I stated, from the sample I looked at, even phred+20 would probably clip too aggressively. In general it is best to do as little manipulation as possible with expression data. Some testing on your part will be needed to identify the correct processing, and the same process will not apply to all datasets. But the general path outlined in the tutorial is a good one for what you are trying to do and should be able to address your questions. Take care, Jen Galaxy team -- Jennifer Jackson http://galaxyproject.org

ADD COMMENT • link written 6.3 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

How can I do trimming off to all nucleotide sequences that end with a specific sequence in my illumine reads?!
Hello everyone, I am new to Galaxy and I was wondering if anybody knows how can I trim all seque...
How do I convert numerals to nucleotides for RNA-Seq mapping?
The FASTQ reads I imported from EBI SRA (SRR064843) appear to be numeric (0-3) rather than AGCT....
genome de novo assembly
Hello, I have a paired-end 150 bp genome sequencing data with a different end quality.According ...
Way to trim all NGS nucleotide sequences to a specific section
Is there a way to trim the nucleotide sequence to a specific region? I know you can by base numbe...
Question
Hi Is it possible to trim a variable number of a specific nucleotide from the 3' ends of fastq R...
how to convert FASTQ to 454 reads
I am totally new at **usegalaxy.org**. I didn't know the difference between FASTQ reads and 454 r...
Re: Quality Based Trimming
Thanks for your reply Daniel. That's right... I did not even think about using the boxplot tool t...
trimming my illumina sequences
Good morning, I am new in use of galaxy and I need help. I tried to trim my raw data from MiSeq u...
RNAseq data to be processed in two ways: (i) mapping to de novo Trinity-based transcriptome and (ii) mapping a relatively new genome
Hello all, I am new to RNAseq data and learning this process step by step, so I have a few quest...
How do i remove multiple adapter sequences from my RNAseq reads?
Hi there, I want to remove the universal adapters as well as the index adapters in each data fil...
Question About Using Bowtie
I am trying to use bowtie to assign reads to the s. Cerevisiae genome. I have data from paired en...
lastz not running
I want to align my reads to a short fasta file to determine how many reads I have that match my t...
Filtering BAM files from HISAT2
Hi. I am new to rna-seq and I have a couple of quick questions. My input was paired-end non-stran...
Help removing part of a sequence
Hi, I am trying to trim out part of a sequence so I can align it better to a reference sequence....
trimgalore on interleaved paired-end fastq files
Hi, I'm analyzing interleaved paired-end fastq files downloaded via fastq-dump. I am trying to r...
How to extract SNPs from galaxy workflow
I exactly followed the below galaxy workflow that contain targetted re-sequencing data for a fath...
Trimming And Thinning Of Paired-End Reads
Hi galaxy users, I've been experiencing some problems trimming the adapters and filtering by qua...
Trinity with multiple datasets
Hello, I am running trinity on 4 data sets (2 left reads, 2 right reads) and I want it to be com...
Help For Trim Sequences
Hi, I am a galazy user and I want to trim exact sequences (not the location) from 5' end. Is ther...
How can I improve my mapping alignment rate with MiSeq 2x300 bp paired-end reads?
Edit 1: it appears I have misinterpreted the raw data, which in turn led to my poor results. The ...
Megablast
I have a large set of short reads (from human) that I'm trying to analyze with galaxy. Specifical...
Are all the starting BED coordinates generated by BWA matching the first base of the reads ?
I am currently using "Map with BWA for illumina" (in galaxy main server) with default settings to...
Evaluating Tophat'S Results
Hi, I ran TopHat on Galaxy for my RNA-seq data. I want to analyze TopHat's output files, such as ...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 179 users visited in the last hour