Question: Remove duplicate rows from table
gravatar for h.stotz
2.8 years ago by
European Union
h.stotz20 wrote:

I have a table in Galaxy with 19,132 rows.  I can remove duplicates using group from join, subtract and group and obtain 6,934 entries, but I loose the information from all of the other 23 columns.  How can I remove all of the duplicate rows while keeping all of the information of my 23 columns?

galaxy • 856 views
ADD COMMENTlink modified 2.8 years ago by Jennifer Hillman Jackson25k • written 2.8 years ago by h.stotz20
gravatar for Jennifer Hillman Jackson
2.8 years ago by
United States
Jennifer Hillman Jackson25k wrote:


There is no simple tool to perform a "sort unique" on a tabular dataset. Although this would be helpful. Let me ask around and open a ticket if there is interest (I'll post it back here).

Meanwhile, try the tool DataMash. It is similar to Group, but the columns to retain can be specified.

Thanks, Jen, Galaxy team 

ADD COMMENTlink written 2.8 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 179 users visited in the last hour