Question: Tophat To Cuffdiff Without Cufflinks
0
gravatar for Colaneri, Alejandro Cesar
5.5 years ago by
Hi I comparing gen expression data (RNA-seq) in Arabidopsis. Different genotypes, different conditions. Since Arabidopsis is very well annotated I decided to do cuffdiff directly after tophat. However when building my workflow I found that the cuffdiff said a gtf file is necessary. Please see the picture in this email, under INPUT FORMAT. My question is if I can still compare my libraries in the way I designed below. [cid:FEC3DE9D-5361-483A-BEDD-E7160270E291] Dr. Alejandro Colaneri Departments of Biology University of North Carolina at Chapel Hill 310 Coker Hall, CB# 3280 120 South Road Chapel Hill, NC 27599-3280 Tel: 919-962-2273 fax: 919- 962-1625
rna-seq cuffdiff • 2.6k views
ADD COMMENTlink modified 4.5 years ago by tuf076940 • written 5.5 years ago by Colaneri, Alejandro Cesar80
0
gravatar for Jennifer Hillman Jackson
5.5 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello, You will need to provide a GTF or GFF3 file to Cuffdiff - this is what the tool uses as a reference base to build gene, transcript, and if provided in the annotation attributes, transcript start site and protein groupings to perform the differential analysis. More details can be found here: http://cufflinks.cbcb.umd.edu/manual.html Our tutorial and other wiki help is linked from here, see "Tools on the Main Server": http://wiki.galaxyproject.org/Support#Interpreting_scientific_results Hopefully this helps, Jen Galaxy team -- Jennifer Hillman-Jackson Galaxy Support and Training http://galaxyproject.org
ADD COMMENTlink written 5.5 years ago by Jennifer Hillman Jackson25k
0
gravatar for Colaneri, Alejandro Cesar
5.5 years ago by
Well in that case I will like to use the same gtf file from TAIR 10 that was used by the tophat. But the tophat have the option bult in. How can I point cuffdiff to the same file? Dr. Alejandro Colaneri Departments of Biology University of North Carolina at Chapel Hill 310 Coker Hall, CB# 3280 120 South Road Chapel Hill, NC 27599-3280 Tel: 919-962-2273 fax: 919- 962-1625 Date: Thursday, May 30, 2013 8:07 PM To: "Colaneri, Alejandro Cesar" <colaneri@email.unc.edu<mailto:colaneri@email.unc.edu>> Cc: "galaxy-user@bx.psu.edu<mailto:galaxy-user@bx.psu.edu>" <galaxy- user@bx.psu.edu<mailto:galaxy-user@bx.psu.edu="">> Subject: Re: [galaxy-user] tophat to cuffdiff without cufflinks Hello, You will need to provide a GTF or GFF3 file to Cuffdiff - this is what the tool uses as a reference base to build gene, transcript, and if provided in the annotation attributes, transcript start site and protein groupings to perform the differential analysis. More details can be found here: http://cufflinks.cbcb.umd.edu/manual.html Our tutorial and other wiki help is linked from here, see "Tools on the Main Server": http://wiki.galaxyproject.org/Support#Interpreting_scientific_results Hopefully this helps, Jen Galaxy team Hi I comparing gen expression data (RNA-seq) in Arabidopsis. Different genotypes, different conditions. Since Arabidopsis is very well annotated I decided to do cuffdiff directly after tophat. However when building my workflow I found that the cuffdiff said a gtf file is necessary. Please see the picture in this email, under INPUT FORMAT. My question is if I can still compare my libraries in the way I designed below. [cid:part1.04070709.03020802@bx.psu.edu] Dr. Alejandro Colaneri Departments of Biology University of North Carolina at Chapel Hill 310 Coker Hall, CB# 3280 120 South Road Chapel Hill, NC 27599-3280 Tel: 919-962-2273 fax: 919- 962-1625 ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list: http://lists.bx.psu.edu/listinfo/galaxy-dev To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/mailinglists/ -- Jennifer Hillman-Jackson Galaxy Support and Training http://galaxyproject.org
ADD COMMENTlink written 5.5 years ago by Colaneri, Alejandro Cesar80
Hello, Perhaps there is a mix-up about a reference annotation (GTF/GFF3) resource versus a reference genome database? By "built-in", do you mean the TAIR 10 reference genome that Tophat can use as a target genomic sequence database? If so, then that can also be specified when running Cuffdiff (this is probably a good idea to do), but a reference annotation resource (genes and transcripts, not genomic sequence) is still needed to perform the differential expression. I haven't seen your history, so if you did use a reference annotation GTF/GFF3 file with Tophat (this is possible, as an additional input, when using "Advanced parameters" - along with the reference genome's genomic database), then you can and should definitely use that same one again. But, it wouldn't be "built-in", instead would be a dataset uploaded by you, and already in your history - so I am making a guess that this is not what was used. But I may have misunderstood. When you do locate a reference annotation dataset (if you do not already have one), be sure to review the RNA-seq FAQ linked from the wiki I sent earlier, as you may need to adjust the chromosome identifiers in the file before using it - they have to be formatted exactly like the genomic chromosome identifiers to work correctly with the tool. You can run a tool like Picard's " BAM Index Statistics" on the Tophat output to get the chromosome formats from the built-in Arabidopsis reference genome, compare that to the reference annotation file you have, and make adjustments as needed following the general guidelines in the FAQ, using tools in the group "Text Manipulation". Since the genome is not mammalian (the chrom formats are slightly different), the processing will likely not be identical to that in the FAQ, but the overall workflow will be similar. This is easy to mix up, so hopefully some part of this helps to clarify the different inputs, Jen Galaxy team -- Jennifer Hillman-Jackson Galaxy Support and Training http://galaxyproject.org
ADD REPLYlink written 5.5 years ago by Jennifer Hillman Jackson25k
0
gravatar for tuf07694
4.5 years ago by
tuf076940
tuf076940 wrote:

If you can download a GTF file for your species, and get the data into history, you would be ok.

ADD COMMENTlink written 4.5 years ago by tuf076940
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 168 users visited in the last hour