Question: Inquiry On Fastqc Report
0
Ng Kiaw Kiaw • 10 wrote:
Dear Galaxy Officer,
Good day.
I am a new user of Galaxy main server. The tools provided are very
user-friendly. Thanks for the establishment of these.
I just new to the RNA-seq analysis and now in the learning process of
Bioinformatics.
I would like to inquire on the FastaQC report generated on my data.
For your information:
Samples: Plant (dicotyledon)
Type of data: RNA-seq (Illumina HiSeq 2000 with CASAVA v 1.8.2)
Paired ends
Adapter sequence: RPI 15 ( 5
CAAGCAGAAGACGGCATACGAGATTGACATGTGACTGGAGTTCCTTGGCACCCGAGAATTCCA)
Main purpose of my analysis: Identification of novel transcript and
gene expression studies
I run FastQC on my raw RNA-seq data both forward and reverse. I attach
the FastQC report in this email.
My questions are:
1) The basic statistics shows that my data encoding is Sanger/illumina
1.9. When I grooming my data for downstream analysis in Galaxy, is
that correct I choose "Sanger" for the input FASTQ quality score type?
2) Based on the per base sequence quality, the quality scores are
above 20.0 for both forward and reverse data. Do I still need to trim
off my data?
3) The result for "Per base sequence content", "Per base GC content",
"sequence duplication level" are fail. What are these three results
indicate? What are the solution for these problems?
4) What the overrepresented sequence indicate? Do I need to trim off
the overrepresented sequence?
5) Based on the K-mer content, how could I analyse and justify whether
this is good data or not?
6) In the reverse data FastQC report, "per sequence GC content" seem
not good. What do this indicate?
7) How could I identify the adapter sequence in my RNA-seq data and
how could could I remove?
8) After grooming data, running FastQC on data, adapter removal, is
there any other pre-processing steps need to be done before running
bowtie and top hat?
Many Thanks in advance for your kind assistance and supports.
Best regards
Ng Kiaw Kiaw
PhD student
RIKEN Yokohama Campus
Japan.
ADD COMMENT
• link
•
modified 5.3 years ago
by
Jennifer Hillman Jackson ♦ 25k
•
written
5.3 years ago by
Ng Kiaw Kiaw • 10