I have a fastq file that tophat2 can't process because one of the reads seems to have a sequence length far shorter than the corresponding quality scores (154 for sequence, 434 for quality scores). The entry with the mismatch doesn't seem to have a name so I can't find it and get rid of it or mask it from the analysis that way. Is there any way that I can instruct tophat2 to ignore any reads where this kind of mismatch occurs? Or some other solution?
Thanks