Question: Markduplicate single end reads miRNA seq data
0
gravatar for aangajala
9 months ago by
aangajala0
aangajala0 wrote:

I am trying to analyze a RNA seq, single end SRA dataset. For this do i have to perform Markduplicates as it was mentioned in the tutorial bellow. When i do so, I am seeing 70-80% duplicates.In the example provided in the tutorial, it was less than 10% for paired end data.I am wondering if I should perform markduplicate, considering it is a single end data.Please advice. Do i have to do anything different considering micro RNA sequencing data as compared to tutorial. https://galaxyproject.org/tutorials/ngs/

Thank you.

rna-seq • 265 views
ADD COMMENTlink modified 9 months ago by Jennifer Hillman Jackson25k • written 9 months ago by aangajala0
0
gravatar for Jennifer Hillman Jackson
9 months ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

For very short reads, especially single-end reads, removing duplicates is not straightforward. I would suggest skipping this step in the analysis but you should also consider advice from sources with a specific focus on this type of analysis (publications, and possibly general bioinformatics forums like https://www.biostars.org).

Tutorial: https://galaxyproject.org/tutorials/ngs/#sampling-coincidence-duplicates

Thanks, Jen, Galaxy team

ADD COMMENTlink written 9 months ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 167 users visited in the last hour