Question: Salmon: Error while indexing Human transcriptome file
0
gravatar for syrez
13 months ago by
syrez0
Chicago
syrez0 wrote:

Hi there!

I downloaded what I hope to be a human transcriptome data file from biomart. I select for gene types in Gene, cDNA Sequences in Attributes and in header info, I pick Gene Stable ID and Transcript ID. The downloaded file is in txt format.

When I initiate salmon to index the file salmon index -t ~/path/mart_export.txt -i hg38_index

I get the following :

error Building suffix array . . . FAILURE: return code from libdivsufsort() was -1

Could anyone please help me with this?

Thanks!

salmon • 463 views
ADD COMMENTlink modified 13 months ago by Jennifer Hillman Jackson25k • written 13 months ago by syrez0
0
gravatar for Jennifer Hillman Jackson
13 months ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

I would check two items when importing sequence data this way:

  1. The load from was complete. Transcriptomes can be quite large and it is usually better to download these in fasta format from the source, then upload to Galaxy with FTP.

  2. This tool accepts fasta format. The tool Tabular-to-FASTA converts a tabular file to FASTA format. Also be sure to follow the guidelines for Custom genome formatting here (the same format rules apply for your use case): https://galaxyproject.org/learn/custom-genomes/

Thanks! Jen, Galaxy team

ADD COMMENTlink written 13 months ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 151 users visited in the last hour