Question: galaxy doesn't recognise dbSNP file I uploaded
2.5 years ago by
European Union
henryrobins30 wrote:


I'm somewhat new to the game and struggling with the GATK unified genotyper.

I found a dbsnp file (in vcf format) but Galaxy isn't recognising it as a vcf file. I tried modifying it under attributes and that changed the file type to vcf but galaxy still isn't accepting that I have a vcf file.

I then noticed there was a message on the file itself saying that galaxy had a problem uncompressing the zipped data, so i've unzipped and am uploading again but the file is 27gb and i'm running out of space! All of my fastq files were zipped and that wasn't a problem.

Surely there is an easier way? Any thoughts would be much appreciated.



gatk • 629 views
ADD COMMENTlink modified 2.5 years ago by Jennifer Hillman Jackson24k • written 2.5 years ago by henryrobins30
2.5 years ago by
United States
Jennifer Hillman Jackson24k wrote:


Any dataset is unzipped/uncompressed once loaded into Galaxy upon upload anyway, so loading it uncompressed just takes upload time. The size in a history will remain constant. If it uncompresses locally, but not in Galaxy for some reason, then load it uncompressed. Use a client such as FileZilla to manage the FTP (if needed) and resume transfer if it is interrupted, until the entire dataset is loaded. Do watch the disk space usage - if it goes over the 250 GB quota, then no work can be done until this is reduced.

If more resources are needed for the work, then a Cloud Galaxy could be a solution. AWS offers educational/research grants to help with costs. Galaxy Choices:

Thanks, Jen, Galaxy team

ADD COMMENTlink written 2.5 years ago by Jennifer Hillman Jackson24k
