Question: Fastqc And Blast?
0
gravatar for Jorge Braun
5.0 years ago by
Jorge Braun70
Jorge Braun70 wrote:
Hello mates, I have two doubts galaxy: a) I have rna-seq data from Illumina and do fastqc ... the results are good but when I fastgroomer to Sanger format and then fastqc ... the results are bad. Does anyone know the cause? I do not understand why. b) With the same sequences can know if they are different Rna and eliminate those that do not want to examine? merry christmas :)
galaxy • 1.3k views
ADD COMMENTlink modified 5.0 years ago by Jennifer Hillman Jackson25k • written 5.0 years ago by Jorge Braun70
0
gravatar for Jennifer Hillman Jackson
5.0 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello, For the first question, make sure that you are running the groomer with the correction options. In almost all cases for Illumina data this will mean leaving all but one setting at default. The setting to change is "Input FASTQ quality scores type:". The results of FastQC will inform you about how to set this. An example is in this wiki section's screencast plus the first bullet point: https://wiki.galaxyproject.org/Support#Dataset_special_cases http://vimeo.com/galaxyproject/fastqprep For the second, I am not sure what you mean by 'different'. Do you mean the data may have contamination from another species? Or that the the data content may be different with respect to quality? In short, to filter based on quality as reported in the FastQC report, try tools in the same tool group such as "FASTQ Trimmer" or "FASTQ Quality Trimmer". The protocols included in our RNA-seq pipeline help start out with some quality steps: https://wiki.galaxyproject.org/Support#Tools_on_the_Main_server:_RNA- seq And many from our community have contributed RNA-seq tutorials: https://wiki.galaxyproject.org/Learn#Other_Tutorials Hopefully this helps! Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD COMMENTlink written 5.0 years ago by Jennifer Hillman Jackson25k
Hello, of course, Jennifer is right for the first question 😊 . For the second question about blast ... I wonder if running after blast in galaxy I can remove sequences that can contaminate the data. It's possible? Last question, trinity is 100% operational in galaxy? Because trinity ran but the result was empty, and I think the script failure .Py Thanks Jennifer and colleagues for your patience and solutions Date: Fri, 13 Dec 2013 08:53:31 -0800 To: braun_bio@hotmail.com; galaxy-user@lists.bx.psu.edu Subject: Re: [galaxy-user] fastqc and blast? Hello, For the first question, make sure that you are running the groomer with the correction options. In almost all cases for Illumina data this will mean leaving all but one setting at default. The setting to change is "Input FASTQ quality scores type:". The results of FastQC will inform you about how to set this. An example is in this wiki section's screencast plus the first bullet point: https://wiki.galaxyproject.org/Support#Dataset_special_cases http://vimeo.com/galaxyproject/fastqprep For the second, I am not sure what you mean by 'different'. Do you mean the data may have contamination from another species? Or that the the data content may be different with respect to quality? In short, to filter based on quality as reported in the FastQC report, try tools in the same tool group such as "FASTQ Trimmer" or "FASTQ Quality Trimmer". The protocols included in our RNA-seq pipeline help start out with some quality steps: https://wiki.galaxyproject.org/Support#Tools_on_the_Main_server:_RNA- seq And many from our community have contributed RNA-seq tutorials: https://wiki.galaxyproject.org/Learn#Other_Tutorials Hopefully this helps! Jen Galaxy team Hello mates, I have two doubts galaxy: a) I have rna-seq data from Illumina and do fastqc ... the results are good but when I fastgroomer to Sanger format and then fastqc ... the results are bad. Does anyone know the cause? I do not understand why. b) With the same sequences can know if they are different Rna and eliminate those that do not want to examine? merry christmas :) ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/mailinglists/ -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD REPLYlink written 5.0 years ago by Jorge Braun70
The BLAST suite is not available on the public Galaxy server at http://usegalaxy.org but is available from the Galaxy Tool Shed if you have a local Galaxy instance: http://toolshed.g2.bx.psu.edu/view/devteam/ncbi_blast_plus/ One way to filter your FASTA file based on BLAST hits would be to use the tabular output from BLAST with this sequence filtering tool: http://toolshed.g2.bx.psu.edu/view/peterjc/seq_filter_by_id e.g. If you want to remove transcripts which seem to be mitochondria, you could BLAST against a mitochondrial database, and take only the sequence with no hits. Regards, Peter
ADD REPLYlink written 5.0 years ago by Peter Cock1.4k
Hi Jorge, I see that Peter helped with the Blast question (thanks Peter!), but you are having trouble with Trinity, too. Trinity should work without any issues to my knowledge. Make sure that you are running the latest distribution (or cloudman) and have all of the dependencies set up. Then, if you are still having problems, send details "to" the "galaxy-dev@bx.psu.edu" mailing list only (not to a team member, and please start a new thread - not a reply the galaxy- user list). This way our development team/community will see the thread and can help you troubleshoot the install. https://wiki.galaxyproject.org/MailingLists#The_lists Thanks! Jen Galaxy team -- Hello, of course, Jennifer is right for the first question Emoji. For the second question about blast... I wonder if running after blast in galaxy I can remove sequences that can contaminate the data. It's possible? Last question, trinity is 100% operational in galaxy? Because trinity ran but the result was empty, and I think the script failure .Py Thanks Jennifer and colleagues for your patience and solutions -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD REPLYlink written 5.0 years ago by Jennifer Hillman Jackson25k
0
gravatar for Jennifer Hillman Jackson
5.0 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello Mark, Correct - no needed. Any Illumina data that results from a CASAVA 1.8 or higher pipeline is already has quality scores scaled to Sanger Phred with an ASCII offset of 33. This translates to the ".fastqsanger" format in Galaxy, so no grooming is required, you can just assign the datatype. How to do that is in the first link below. And all the details, including how to read a FastQC report to determine this is in the screencast. Thanks! Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD COMMENTlink written 5.0 years ago by Jennifer Hillman Jackson25k
0
gravatar for Jennifer Hillman Jackson
5.0 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello Mark, Correct - not needed. Any Illumina data that results from a CASAVA 1.8 or higher pipeline is already has quality scores scaled to Sanger Phred with an ASCII offset of 33. This translates to the ".fastqsanger" format in Galaxy, so no grooming is required, you can just assign the datatype. How to do that is in the first link below. And all the details, including how to read a FastQC report to determine this is in the screencast. Thanks! Jen Galaxy team -- Jennifer Hillman-Jackson http://galaxyproject.org
ADD COMMENTlink written 5.0 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 175 users visited in the last hour