Question: How to remove duplicate sequence
16 months ago by
pattythepsuwan wrote:

Our Ribo-seq libraries involved a PCR step (9 cycles of amplification) in order to get enough material to put on the sequencer. Because of this, we expect that many of the reads are actually exact duplicates of clones which are not real duplicates but arise as an artifact of PCR. Is there any option on Galaxy that I can use to remove the duplicate?

Sincerely, Patty Thepsuwan

rna-seq reads duplicate
written 16 months ago by pattythepsuwan
16 months ago by
United States
Jennifer Hillman Jackson wrote:

Hello Patty,

Map the data, sort the BAM result, then run the tool NGS: SAMtools > RmDup. The tool group SAMtools also has tool options to mark and/or remove duplicates. To find all quickly, use the term "duplicates" in the tool panel search.

Thanks, Jen, Galaxy team

written 16 months ago by Jennifer Hillman Jackson
