Question: fastq data types
1
gravatar for tomaz
3.4 years ago by
tomaz10
tomaz10 wrote:

Hi,

I've done quite some searching and have found numerous posts on the fastq format versions. I still cannot figure out what galaxy's format names designate:

- fastqcssanger

- fastqillumina

- fastqsanger

- fastqsolexa

Any help would be greatly appreciated.

 

ADD COMMENTlink modified 3.4 years ago by Jennifer Hillman Jackson23k • written 3.4 years ago by tomaz10
4
gravatar for Peter Cock
3.4 years ago by
Peter Cock1.4k
European Union
Peter Cock1.4k wrote:

See http://dx.doi.org/10.1093/nar/gkp1137

  • fastqcssanger = FASTQ color space using Sanger encoding (PHRED scores + 33)
  • fastqillumina = FASTQ using now obsolete Illumina encoding (PHRED scores + 64)
  • fastqsanger = FASTQ using Sanger encoding (PHRED scores + 33), probably you want this
  • fastqsolexa = FASTQ using long obsolete Solexa encoding (Solexa scores + 64)
ADD COMMENTlink modified 3.2 years ago • written 3.4 years ago by Peter Cock1.4k

Great answer Peter. I'm going to add in a link here that will help others (reading or finding post through a search) to detect original formatting and modify as needed for use with Galaxy's tools (or most of them!). The above is right on target; ".fastqsanger" is the standard quality score scaling expected by most tools.
http://wiki.galaxyproject.org/Support#Dataset_special_cases
Best, Jen, Galaxy team

ADD REPLYlink written 3.4 years ago by Jennifer Hillman Jackson23k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 47 users visited in the last hour