21 months ago by
United States
Hello,
The tool extracts the data in the format the data source servers, which in this case can be merged paired-end files.
For many use cases, this is the general prep workflow for fastq reads from NCBI:
- Get fastq data into Galaxy using NCBI SRA Tools > Extract reads
- Rename sequences - do this if the quality score labels on the "+" lines differ from the sequence names on the "@" lines
- FASTQ splitter
- FastQC
- Fastq Groomer or assign datatype to "fastqsanger" directly if quality scores already scaled for this format
- FastQC on the entire dataset again, if the Fastq Groomer was used
- Trim/filter as needed
- Downstream tools
Thanks!