Question: clear overrepresented reads that arent adaptadors
gravatar for biopili9
6 months ago by
biopili910 wrote:

I am working in galaxy and I mapped the sequences with a genome of reference but the result show that only 27% of sequences were mapped. I did the trimmomatic but I cant clear the sequences overrepresent. wich tools I need to use for clear the specifics sequences


rna-seq • 145 views
ADD COMMENTlink modified 6 months ago by luca.marisaldi30 • written 6 months ago by biopili910
gravatar for luca.marisaldi
6 months ago by
luca.marisaldi30 wrote:


Overrepresented sequences, among other things, might be an indicator of:

  • Contamination
  • Depending on the pre-processing cleaning might be a something related to an incomplete full elimination of ribosomal RNA
  • Presence of adapters (in this case FastQC should detect that)

I suggest to run a quality control before/after the cleaning steps, checking not only on the option "overrepresented sequences" but also other parameters such as "GC content" and "Kmer repetitions" (both provided by FastQC output). This way should give you some details about these sequences. As an option you can also blast them to quickly check what they are.

Once you understand that you can, in my opinion, proceed further.

Have a good day,


ADD COMMENTlink written 6 months ago by luca.marisaldi30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 84 users visited in the last hour