4 weeks ago by
The current version "Bowtie for Illumnia" in the Tool Shed (and installed at http://usegalaxy.org) accepts fastqsanger sequence as input (Fastq sanger with Phred +33 quality score scaling). The tool produces output appropriate for use with other tools at the public Main website only when fastqsanger data is used. Datasets must be assigned the datatype fastqsanger for most tools to even recognize them as proper input.
This is how to prep your files. It includes instructions about how to check the fastq type and make adjustments. If after running FastQC and the data is in fastqsanger format, or you know it is already, there is no need to run Fastq Groomer. Instead, just assign the datatype directly.
If running the prep steps, make this initial FastQC job run quicker by executing it against just a subset of the fastq dataset (Text Manipulation > Select first lines from a dataset with the line number to keep as a multiple of 4). The first few sequences (100 or so) is enough input to detect quality score scaling type. Running FastQC on all of the data can be done after to assess and act on the quality metrics reported (run on the full original dataset if not modified, or on the full Fastq Groomer output if it was used to make changes).
That said, the other tool option is Bowtie2. This is a better choice for most for many reasons. That tool also accepts/expects fastqsanger formatted input fastq data.
The Fastq Groomer can convert between fastqsanger and fastqillumina (or any of the others, either way), but converting from illumina > sanger is not useful for most.
If you still need help after doing the above, please let us know.
Hopefully this helps! Jen, Galaxy team