4.5 years ago by
United States
Hello,
Keep in mind that when you collapse sequences, the original quality score is going to be lost. There isn't a way to merge the quality scores and they probably differ between the reads, even if the sequence is the same. Still - if you want to do this, convert to fasta, collapse, then convert back to fastq with default quality scores assigned, using the tool "NGS: QC and manipulation -> Combine FASTA and QUAL".
Your question does not seem to fit the tags assigned - you can change them or I can. Bowtie/Bowie2 would not be a good choice for mapping RNA-seq reads to yeast because of the splicing (Tophat/Tophat2 is better). Also, removing redundancy in reads will almost certainly bias the expression profile of the data, also not a good idea for RNA-seq. If really doing RNA-seq, I wonder what tutorial are you following .. here are some others to explore: https://wiki.galaxyproject.org/Support#Tools_on_the_Main_server:_RNA-seq
Best, Jen, Galaxy team
Thank you for responding to my question, much appreciated. I'm not actually doing RNA-seq but HITS-CLIP instead, but there was no tag for that so I thought RNA-seq might be sufficient. Do you still think that Tophat/Tophat2 would be a better fit for sequencing?
(since we must do PCR to generate our cDNA libraries for HITS-CLIP we collapse identical reads)
All Best,
Frank
Hello,
I am not specifically familiar with all of the nuances of this protocol, but since this is RNA data (any data from a cDNA library would fall into this category), then choosing a mapper that will accommodate spanning splice sites seems important. A post a bit over a year ago at the primary Biostar web site has Tophat/2 as well as BLAT mentioned as good potential choices, plus some others, in the top ranked answers. These both seem like good places to start.
https://www.biostars.org/p/60563/
Keep in mind that BLAT requires longer sequences, but will accept just fasta data and is an excellent tool. The wrapper is available in the Tool Shed for use in a local/cloud Galaxy. It will need proper licensing for the binary - but that is free for academic/research use. You may want to use the cloud/toolshed anyway, if other tools of interest are there. Be sure to check out Amazons AWS grants for educational/research work if choosing the cloud.
http://usegalaxy.org/cloud
http://usegalaxy.org/toolshed
Feel free to create new tags - this site is just getting started!
Jen