Question: RNAseq: RNA STAR ---Mapping to a viral genome
0
gravatar for esundaramoorthy
8 weeks ago by
esundaramoorthy0 wrote:

Hi, I am analyzing an infection time course in a human cell line transduced with Vaccinia virus. The Vaccinia virus genome is not available in genomes section. I used the instructions here (https://galaxyproject.org/learn/custom-genomes/) to download the Vaccinia virus genome (https://www.ncbi.nlm.nih.gov/nuccore/AY243312.1?report=fasta) (approx 195Kb genome). Normalized using the toolkit NormalizeFasta trimmed the first line and used this as my custom genome. I made this as a custom build and performed alignment using STAR with no success.

1) The same setup works fine with BWA alignment. Could someone give any insights?

2) Viral genomes have no chr contig references. Is there anyone who has used a viral genome as a custom genome. Can anyone let me know how to proceed with the chromosome coordinate formatting for RNA STAR alignment?

3) How do you append Viral genome to the human genome and merge human and viral GTF/GFF files?

Thanks for your help/advice, anything that would involve galaxy tools for manipulation would be a plus. Cheers Nambi

ADD COMMENTlink modified 8 weeks ago by Jennifer Hillman Jackson25k • written 8 weeks ago by esundaramoorthy0
1
gravatar for Jennifer Hillman Jackson
8 weeks ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

  1. RNA STAR expects the input fastq data to be spliced (RNA). BWA expects the input to be unspliced (DNA or unspliced RNA).

  2. The ">" line of the custom genome is the "chromosome". It is Ok if this is just "chr", or you can modify that yourself to be more specific. Just make sure that it is all "one_word" -- no spaces and is a match for the content of the related GTF dataset to avoid a mismatch problem.

  3. You could upload both reference genomes and combine them (Concatenate) into a single fasta target custom genome, but with such a large genome (human), tools may run out of memory during runtime when using a public server. You could set up your own Galaxy and allocate more memory and/or pre-index the custom genome into a built-in genome (to avoid re-indexing every time you map against it). The reference GTF could also be combined with Concatenate - just make sure that each "chromosome" identifier is unique within the file and is an exact match to the associated content in the custom genome.

FAQs: https://galaxyproject.org/support/

Tutorials: https://galaxyproject.org/learn/

Thanks! Jen, Galaxy team

ADD COMMENTlink modified 8 weeks ago • written 8 weeks ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 104 users visited in the last hour