Question: Rename sequence error
0
gravatar for kyle.hubbard
3.4 years ago by
United States
kyle.hubbard0 wrote:

Hello,

So I'm trying to create a metagenomic workflow, but I can't seem to get the formatting correct. I uploaded a FASTA file, then used the FASTA Manipulation/FASTA width formatter (set to 0) to get the nucleotide strings for each identifier/prefix on a single line.  I then attempted to use the NGS QC and Manipulation/Rename Sequences tool, but I get an error stating:

fastx_renamer: Invalid input: This looks like a multi-line FASTA file.
Line 7 contains a nucleotides string instead of a '>' prefix.
FASTX-Toolkit can't handle multi-line FASTA files.
Please use the FASTA-Formatter tool to convert this file into a single

If I used the FASTA width formatter and set to 0 so that the prefixes and strings are alternating every line, what's the issue?

Thanks!

galaxy • 1.0k views
ADD COMMENTlink modified 3.4 years ago by Jennifer Hillman Jackson25k • written 3.4 years ago by kyle.hubbard0
0
gravatar for Jennifer Hillman Jackson
3.4 years ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

There may have been a format issue with the orignal upload file. Examine the line the error reports. Be aware that other lines could have a problem, as just the first occurrence of a problem is usually reported by tools.

Note: if you used certain text editors with this dataset, some can add in hidden characters. Avoid that type of manipulation or use a plain text editor instead. The dreaded "^M" (a type of "soft return") from Microsoft tools is a common issue encountered in input datasets for bioinformatics pipelines (more in this wiki: https://en.wikipedia.org/wiki/Newline#Common_problems)

Then, once you have fasta format, use unwrapped sequences with these manipulation tools as you were doing. Then wrap the final dataset before using with tools downstream to avoid other issues (some tools do not care about wrapping, in particular NGS mappers, but many others do).

More about format troubleshooting is here: CustomGenomes#Troubleshooting

Thanks, Jen, Galaxy team

ADD COMMENTlink modified 3.4 years ago • written 3.4 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 178 users visited in the last hour