Dear Galaxy community,
I would like to integrate my tool Annocript into Galaxy. Annocript was wrote in PERL language and needs a fasta file and a text configuration file in input. It runs BLAST software, DNA2PEP and Portrait. It creates a MySQL database that is used to integrate the annotations with UniProt.
Do you think it is possible to integrate it in Galaxy? Can you redirect me to a tutorial where a similar software has been integrated?
Thanks in advance
Francesco
Dear Martin,
thanks for the quick reply.
The script takes in input a configuration text file where you have to write the paths of the programs it needs. You can both write a full path or a single program name. In the latter case of course it should be in the $PATH. So, to respond your first question: it isn't mandatory that DNA2PEP, BLAST and Portrait are in the $PATH. But they must be installed on the user machine.
About the 2nd question, I try to explain the functioning because it is not clear to me. When you run Annocript the first time it downloads UniProt databases, creates a MySQL database (command CREATE) and inserts proteins annotations into tables. This database is created only the first time. Further times you run my software it will fetch information from the database to construct the transcriptome annotation. So, I guess also using Galaxy the user will have to install MySQL (client and server) on her/his machine.
The database is both kind of data structure needed for Annocript to run and a final useful artifact for those user, with an SQL expertise, who want to query directly the annotation database.
Hope to have been clear.
In order to answer you I have two main questions:
Does your perl script invoke these as binaries (expecting them on the $PATH)?
Is there (going to be) a Galaxy Tool that takes this DB as an input (and in what format if so) or is this going to be the final artifact of the analysis in Galaxy? How can Galaxy user benefit from this?