Question: remove overrepresented (contaminated) sequence
gravatar for jieun.e.park
2.7 years ago by
jieun.e.park0 wrote:


Based on FastQC result, I have two overrepresented sequence. One is my TruSeq Adapter sequence and another seems like a contaminated sequence (below) AAGCAGTGGTATCAACGCAGAGTACTTTTTTTTTTTTTTTTTTTTTTTTTT

I would like to remove/trim this contaminated sequence from my fastq file before mapping, but I don't know how to do it. I heard that this can be done with fastq-grep but not sure how to use grep tool to remove the contaminated sequence.

Thank you for your help in advance.

rna-seq • 2.6k views
ADD COMMENTlink modified 2.7 years ago by Jennifer Hillman Jackson25k • written 2.7 years ago by jieun.e.park0

If you could give me detailed explanation (I am new to usegalaxy!) I would appreciate it so much!

ADD REPLYlink written 2.7 years ago by jieun.e.park0
gravatar for Jennifer Hillman Jackson
2.7 years ago by
United States
Jennifer Hillman Jackson25k wrote:


If most of the sequence is contaminate, then it will fall out during mapping and trimming/filtering is not really needed as a precursor step. But if you still want to, see the trimming tools in the group NGS: QC and manipulation. Try Trim Galore! or Trimmomatic. Documentation is on each tool's form.

Thanks, Jen, Galaxy team

ADD COMMENTlink written 2.7 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 173 users visited in the last hour