FASTA sequences not blasting.

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: FASTA sequences not blasting.

0

3 months ago by

gbernard • 0

gbernard • 0 wrote:

Hello Great Minds,

My FASTA sequences ( in tabular format) are not blasting. I downloaded the UNIPROT fasta.gz file to use as a database. Please provide some direction.

Best regards. GCB

rna-seq • 213 views

ADD COMMENT • link •

written 3 months ago by gbernard • 0

1

3 months ago by

gb • 60

gb • 60 wrote:

Not sure what you mean but this is already wrong:

My FASTA sequences ( in tabular format) are not blasting

Fasta is a certain format, first you have a line starting with ">" and a description and the next line is a sequence. https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs&DOC_TYPE=BlastHelp

So your file with sequences needs to be in in FASTA format not tabular. And would need to do a blastp.

ADD COMMENT • link written 3 months ago by gb • 60

Thank you so much! You are correct. The sequences are in fasta format with a Trinity header from the assembly. I downloaded the UNIPROT protein database and unzipped the file to compare against my sequences. Both are in FASTA format according to Galaxy. The issue now is to figure out what is the input value for the subject/database sequences and protein database section in the NCBI BLASTx. The protein database section reads "no Blast dbp available", I tried BLASTing my assembled reads against the UNIPROT database which is saved in my history and no results. Any suggestions? Is there a database I can import?

Best regards,

ADD REPLY • link written 3 months ago by gbernard • 0

I am not familiar with the public servers so I can not help you more. But you need to mention which galaxy server you are using. Also check the input, before you can blast against a reference you need to index the reference fasta file. So basicly you convert a fasta file to a blast database.

EDIT:

I just checked https://usegalaxy.org and went to the tool named "NCBI BLAST+ blastx". You need to change the setting "Subject database/sequences" to "fasta file from history". You also need to check the genetic code setting.

And an other thing, I am not sure if this is the best approach. Uniprot is a large database and your trinity output probaly contains a lot of reads so I think it will take very long. Maybe you can reduce your input. Like removing the duplicates or something. You can also try diamond.

ADD REPLY • link modified 3 months ago • written 3 months ago by gb • 60

Please log in to add an answer.

Similar posts • Search »

how to male a blast database
Hi, This is a simple and basic question but am so confused. I have generated my fasta sequence (...
Female dna Y?
I wanted to ask you a question when you load a fasta and vcf file into blat, blast or galaxy pro...
NCBI Standalone BLAST
I made a fasta format database and am attempting to query another fasta format nucleotide sequenc...
Blast Tools On Free Public Server
We're using the free public Galaxy server (main.g2.bx. psu.edu). We uploaded a genome FASTA seque...
Galaxy Error: Can'T Fine Fasta File
Hello, I'm new to Galaxy. When I read in a fasta file to Galaxy and then try to use it (in a bla...
Duplicate seq ids in uniref50
Dear all, I was trying to generate a protein database using the Galaxy tool NCBI BLAST+ makeblast...
Suppress Reporting Hit Number
Hi, I used the Megablast function (in the NGS: Mapping\ROCHE-454\) to analyze my FASTA sequences...
Calling Snp'S
Hi, Is it possible to convert a fasta file to sam/bam format using galaxy. ? OR Is is possible...
converting between different blast output and their visualizations
Hi, I ran a tblastX for a fasta dataset (paired sequences) against a fasta file (gene of interes...
Attaching annotations to Sequences
I have two files: one containing my original transcriptome reads and another containing blast+ bl...
Blast sequence using bacterial genome of ncbi
Hello, I have installed my galaxy on local server and successfully and added NCBI BLAST plus. I ...
How To Do Blast On Galaxy Sever?
Hi, Everybody, I uploaded two complete genome sequences and hope to BLAST homologs from both gen...
Setting Dbkey In A Workflow
I'm working on making my first workflow in Galaxy, using a local server. A high level overview of...
Another step to make Trinity output usable with Blast
I have an output from Trinity that looks like an assembled genome. When I use Blast to check for ...
Tools for annotating unknown sequences?
Dear all, what tools are there in the toolshed suitable to annotate 1000 sequences? (is there a w...
Extracting genomic fasta sequence based on annotation dataset content
I need to convert GFF3 (without sequence) list of DNA sequences to fasta DNA sequence list. Do yo...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 180 users visited in the last hour