I am running a local version of galaxy (17.09) trying to analyze RNA seq data with Bowtie2, DEseq2. I am the admin and have gotten this instance mostly configured. I have managed to get all the sequences loaded in to the Data libraries off my hard drive as I haven't had time to set up a FTP server. I have been trying like crazy to get the ref genomes and GFF/GTF files loaded to start my analyses. I have been all over the wiki and here on biostars looking at how to load a reference genome for hg38. I watched the video "managing galaxy's built in data and data managers." as well as looked at several very similar questions. I used the used the tool shed to obtain DMs (create DBkey and ref genome) and followed the directions in the Video, tried to build Bowtie2 indexes, and got error. I realized then I might need to do some additional indexes first as per several threads. So I got the tools for Samtools indexes, Picard indexes, and twoBit indexes. When I attempted to run the samtools index from the data manger. I still get the following error telling me it can't find a file?
Fatal error: Exit code 1 () Traceback (most recent call last): File "/Users/broxn8/shed_tools/toolshed.g2.bx.psu.edu/repos/devteam/data_manager_sam_fasta_index_builder/1865e693d8b2/data_manager_sam_fasta_index_builder/data_manager/data_manager_sam_fasta_index_builder.py", line 92, in main() File "/Users/broxn8/shed_tools/toolshed.g2.bx.psu.edu/repos/devteam/data_manager_sam_fasta_index_builder/1865e693d8b2/data_manager_sam_fasta_index_builder/data_manager/data_manager_sam_fasta_index_builder.py", line 85, in main build_sam_index( data_manager_dict, options.fasta_filename, target_directory, options.fasta_dbkey, sequence_id, sequence_name, data_table_name=options.data_table_name or DEFAULT_DATA_TABLE_NAME ) File "/Users/broxn8/shed_tools/toolshed.g2.bx.psu.edu/repos/devteam/data_manager_sam_fasta_index_builder/1865e693d8b2/data_manager_sam_fasta_index_builder/data_manager/data_manager_sam_fasta_index_builder.py", line 40, in build_sam_index proc = subprocess.Popen( args=args, shell=False, cwd=target_directory, stderr=tmp_stderr.fileno() ) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/subprocess.py", line 710, in __init__ errread, errwrite) File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/subprocess.py", line 1335, in _execute_child raise child_exception OSError: [Errno 2] No such file or directory
So now I am totally confused how to resolved this and am not seeing a clear answer in the biostar responses or maybe I missed it?
I'm just learning command line codes and thought using galaxy to run my analyses would be easy especially since it's supposed to be for those without extensive computational experience. I have too much data to use the main instance which was fairly easy to use. It didn't seem like it should be that hard to setup a local instance, but I have to say, these ref genome instructions are seemingly not detailed enough for some of us. As this seems to be a re-occurring question all the time, perhaps it's time to write up a more detailed version of the instructions and by detailed I mean so that a child could do it. I'm sure it's something simple but I just can't figure it out! If anyone can help me figure this out, I'd be most appreciative!!! Best, Heather