Question: usegalaxy build snpEff database via genbank
gravatar for atasaral
5 months ago by
atasaral10 wrote:

Hello, I have built the SnpEff4.3 database for Salmo salar's chromosome 25 from Genbank and I have analyzed with SnpEff eff: annotate variants (Galaxy Version 4.3+T.galaxy2).I have obtained results for Chromosome 25. Now, I need to keep at the analyzing for the other chromosomes. For example, I did for Chromosome 8 and 9 exactly the same way but It didn't build SnpEff4.3 databases for them. But I built for chromosome 28 also. I need for all chromosomes. Please help me to solve this problem. I am using usegalaxy The error message as follow:

Fatal error: Exit code 255 (Error) Picked up _JAVA_OPTIONS: -Xmx7g -Xms256m 00:00:00 SnpEff version SnpEff 4.3t (build 2017-11-24 10:18), by Pablo Cingolani 00:00:00 Command: 'build' 00:00:00 Building database for 'ssa08' 00:00:00 Reading configuration file 'snpEff.config'. Genome: 'ssa08' 00:00:00 Reading config file: /galaxy-repl/main/jobdir/019/880/19880994/working/snpEff.config 00:00:00 Reading config file: /cvmfs/ 00:00:01 done Chromosome: 'ssa08' length: 26434011 java.lang.RuntimeException: Transcript 'NM_001140227.1' is already in Gene 'GeneID:100195198' at org.snpeff.interval.IntervalAndSubIntervals.add( at org.snpeff.snpEffect.factory.SnpEffPredictorFactory.add( at org.snpeff.snpEffect.factory.SnpEffPredictorFactoryFeatures.addMrna( at org.snpeff.snpEffect.factory.SnpEffPredictorFactoryFeatures.addFeatures( at org.snpeff.snpEffect.factory.SnpEffPredictorFactoryFeatures.create( at at at org.snpeff.SnpEff.main( java.lang.RuntimeException: Error reading file '/galaxy-repl/main/jobdir/019/880/19880994/dataset_25853021_files/ssa08/genes.gbk' java.lang.RuntimeException: Transcript 'NM_001140227.1' is already in Gene 'GeneID:100195198' at org.snpeff.snpEffect.factory.SnpEffPredictorFactoryFeatures.create( at at at org.snpeff.SnpEff.main( 00:00:08 Logging 00:00:09 Checking for updates... 00:00:10 Done.

galaxy • 442 views
ADD COMMENTlink modified 4 months ago • written 5 months ago by atasaral10
gravatar for Anton Nekrutenko
5 months ago by
Penn State
Anton Nekrutenko1.7k wrote:

Have you tried to create the DB from entire Salmon genome:

ADD COMMENTlink written 5 months ago by Anton Nekrutenko1.7k

Yes, Anton I have tried from this website for chromosome 25. And it worked. But When I tried for the other chromosome such as 8 and 9 it didn't. I followed on this link with this steps Option 4: Building a database from GenBank files When I added to usegalaxy the entire genome files it does not recognize them also. For this reaosn I was trying to build chromosome by chromosome.

ADD REPLYlink modified 5 months ago • written 5 months ago by atasaral10
gravatar for Anton Nekrutenko
5 months ago by
Penn State
Anton Nekrutenko1.7k wrote:

This seems like SnpEff issue. If I understand correctly, Salmon genome assembly is highly fragmented and so the same gene can be split between several scaffolds. This is likely what is happening here. Let me take a closer look.

ADD COMMENTlink written 5 months ago by Anton Nekrutenko1.7k

I really appreciate it. Thank you very much. I look forward to your reply.

ADD REPLYlink written 5 months ago by atasaral10
gravatar for Anton Nekrutenko
5 months ago by
Penn State
Anton Nekrutenko1.7k wrote:

Yes, it is the result of fragmented genome. We will try to modify SnpEff build to accept GFF in the next few days.

ADD COMMENTlink modified 5 months ago • written 5 months ago by Anton Nekrutenko1.7k

Your support is very significant for my research. I really thankful for it. I am looking forward to hearing from you.

ADD REPLYlink written 5 months ago by atasaral10
gravatar for atasaral
4 months ago by
atasaral10 wrote:

Hello Anton, Thank you for modifying it. Sincerely, Sebnem

ADD COMMENTlink written 4 months ago by atasaral10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 167 users visited in the last hour