Question: Privacy of data
13 days ago by
I am Huiwen Xu a PhD student from the University of Granada (Spain; We are studying the role of exercise and brown adipose tissue on gut microbiota in young healthy adults. We are very intersting in the use of your Galaxy software, however we have some points that we would like to discuss with you:

We know that other softwares (for instance U-search) have a high percentage of error when the files are integrated. Do you have some information about what is the percentage of error in the process to integrate the data in one unique file? We were wondering if we used Galaxy unploading the data to the cloud, if our data remains unvailable for the rest of users. Or our data become accesible for everyone? Our ethical comitté does not allow to share any information of our project. Both points are very important to us. Otherwise, we will not be able to use your software to prepare our manuscripts

13 days ago by
Martin Čech ♦♦ 4.4k
United States
I am not sure I understand what you mean by 'error percentage' but I believe the basic rule for data privacy in Galaxy is that you should put the security on different level than Galaxy e.g. on a VM/docker host and run Galaxy only within the container with local data.

This is a complex problem though, so if you describe your target workflow deeper we might be able to give you more advice.

With "Error percentage" I meant that for example when we used U-seach we can see that only the 53.32% of the total had been merged. So, my question is what percentage of data merged can be obtained with Galaxy.

Hello - How data will process through any workflow depends in the original data content, how it is prepared, and the tools/parameters used. Most Galaxy tools are wrappers around 3rd party open source tools. You can review/test out those we recommend for this type of analysis at a public Galaxy server, your own Galaxy server, or line-command.

All Galaxy tutorials:

Specifically, please see:

Thanks! Jen, Galaxy team

I am sorry but I still do not understand the question. What does it mean that 'data have been merged'?

10 days ago by
Bjoern Gruening4.9k
I agree with Martin that if you need to deal with legal issues and patient data for example you are probably not allowed to upload your data to your cloud. But to answer your question a little bit more concrete, uploaded data can not be seen by any other user than yourself. You are the admin of your data and you can fine-tune access right on your data. By default Galaxy is really strict and is protecting your data. So the question is really what are you restrictions, which level of privacy do you need to guarantee?



I would like to double down here and repeat that Galaxy should not be perceived as a privacy or security layer to your data. I recommend setting up your security at a different level.

