Question: Masking reads that map to two genomes in galaxy
2.5 years ago
k.rattigan.1 wrote:

I've gotten RNAseq data (paired end Illumina) that is from two different species. I've used Tophat on the two genomes seperately and I get 42 and 5 million reads respectively that are concordantly aligned. I was wondering if there was a way of masking reads that map to both species

Thanks for any help or advice you can give.

2.5 years ago
United States
Jennifer Hillman Jackson wrote:


You could compare the mapped sequence identifiers and filter out sequences that way. Tools in the groups Text Manipulation, Filter and Sort, plus Join, Subtract and Group contain many basic operations that can be run together, much like line-command bioinformatics.

Best, Jen, Galaxy team

