Remove duplicate rows from table

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Remove duplicate rows from table

0

2.8 years ago by

h.stotz • 20

European Union

h.stotz • 20 wrote:

I have a table in Galaxy with 19,132 rows. I can remove duplicates using group from join, subtract and group and obtain 6,934 entries, but I loose the information from all of the other 23 columns. How can I remove all of the duplicate rows while keeping all of the information of my 23 columns?

galaxy • 856 views

ADD COMMENT • link •

modified 2.8 years ago by Jennifer Hillman Jackson ♦ 25k • written 2.8 years ago by h.stotz • 20

0

2.8 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

There is no simple tool to perform a "sort unique" on a tabular dataset. Although this would be helpful. Let me ask around and open a ticket if there is interest (I'll post it back here).

Meanwhile, try the tool DataMash. It is similar to Group, but the columns to retain can be specified.

Thanks, Jen, Galaxy team

ADD COMMENT • link written 2.8 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Removing Duplicate Lines From A Table
Hi, I have a table and would like to remove duplicate lines based on values in the first column....
Duplicate Row Removal in Merged FeatureCounts
Hello, I am trying to transfer merged featurecount data into an R-studio package called RNASe...
Galaxy: Use header row of tabular data for column names?
Is there a way to use metadata in a header row of a tab-delimited format for column names? Altern...
Column Concatenation
Hi , I am trying to fetch multiple column from a table here is my code for row in result_set: ...
Critical Feedback
This student was more adventurous. I think he actually could do more of what he tried with more e...
Compute An Expression On Every Row Question
Hello, I am looking for the right way to do a computation using "text manipulation, compute an e...
Duplicate row names error with EdgeR on FeatureCounts files
Hi all, I created tabular count files using FeatureCounts with a GTF file (iGenomes, UCSC hg38) ...
Text Editing
Hello Luce, I can explain the use of the tools "Text Manipulation". For each file independently,...
Processing Gff3 File Of Variants
Hello all, I have a GFF3 file of variants from nextgen sequencing and want to find non-synonymou...
Lefse data adjusting to get a meaningful plot result
Hi all I have a little question that I will try to explain as far as possible. So basically I ha...
How to obtain up/down regulation column for DEG edgeR results
Hi All, I've gone through and done all my contrasts for a rather large DE experiment using edgeR...
filter tool data table (remove_value)
Hi, I use a tool data table to build a dropdown menu. I have my table defined in tool_data_table...
Blast2Go Local Instance Re: Table With Gene Count Reads
Howdy, Thanks Jen, I will try it tomorrow. I installed Blast2Go from the Toolshed in my lo...
June 23, 2011 Galaxy Development News Brief
June 23, 2011 Galaxy Development News Brief http://galaxyproject.org/wiki/Features/DevNewsBrief/...
Modifying Text in a Column
I have a data that is several columns. Below is data from the column I want to modify. **ENSGAL...
Remove sequences with duplicate chromosome start position
I used GALAXY to extract the 1000 bp upstream of all UCSC genes (i.e. promoters). I sorted the da...
MD Plot with Glimma starting from DESeq2 results
Hi, I'm trying to visualize results from the DESeq2 pepeline in interactive html, useing Glimma....

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 179 users visited in the last hour