Galaxy FastQC (v0.52) vs FastQC (v0.11.2) Overrep. sequences differences

Question: Galaxy FastQC (v0.52) vs FastQC (v0.11.2) Overrep. sequences differences

4.0 years ago by

Canada

Dear Biostars;

To start, I love your FastQC wrapper. In fact, I found performance differences between your version (v0.5.2) and the modern (v0.11.2) versions, where Galaxy's performs better. Specifically, I wonder why the Galaxy version picks up over-represented sequences that the current version does not. I have included two fastQC report outputs, run on the same dataset (one locally with v0.11.2 and one online with Galaxy). For space, I removed possible source column (all are Illumina seq primers).

v0.11.2 (modern): Overrepresented sequences

No overrepresented sequences

v0.52 (galaxy): Overrepresented sequences

Sequence	Count	Percentage
GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGGATCAGATCTCGTA	6632022	10.933354026186844
GATCGGAAGAGCGGTTCAGCAGGAATGCCGAGGATCGGAAGAGCGGTTCA	1131736	1.8657462765021882
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT	196415	0.32380392149686615
TATCGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGGATCAGATCTCGTA	165001	0.27201573632820514
AATCGGAAGAGCGGTTCAGCAGGAATGCCGAGACCGGATCAGATCTCGTA	150939	0.2488335417703102

To clarify, I do not want to know why/how to deal with sequence contaminations. I simply want to get my local (v0.11.2) to perform as well as Galaxy's. Thank you for your time!

fastqc • 936 views

ADD COMMENT • link •

written 4.0 years ago by jrichardalbert • 30

4.0 years ago by

Bjoern Gruening ♦ 5.1k

Germany

Bjoern Gruening ♦ 5.1k wrote:

Not sure this helps but the Galaxy Tool version v0.52 depends on version 0.10.1 of FastQC. So I guess newer versions of FastQC are "different"? Have you checked that both command line are identical?

ADD COMMENT • link written 4.0 years ago by Bjoern Gruening ♦ 5.1k

Thank you for the reply Bjoern. I will look further into the differences in v0.10.1, unfortunately I cannot answer your question as I do not run Galaxy locally.

ADD REPLY • link written 4.0 years ago by jrichardalbert • 30

Bjoern is correct, the wrappers were based on different versions of FastQC. Reporting versions - both the wrapper and any underlying program - for all tools in the Galaxy user interface is a current priority for our team and development community.

Apologies for the confusion this caused, Jen, Galaxy team

ADD REPLY • link written 4.0 years ago by Jennifer Hillman Jackson ♦ 25k

v0.11.2 (modern): Overrepresented sequences

v0.52 (galaxy): Overrepresented sequences

Similar posts • Search »