I'm trying to use the compare datasets tool and can't get it to work.
My first file was uploaded in .fasta format, which I then converted to tabular and split the name column into multiple. All my fasta sequences are named ">hsa-circ-GENE_NAME-antisense.1", so I used the "convert" tool under text manipulation to convert "-" (dashes) to tab, which resulted in a 6-column tabular file with column 3 being the gene name.
The second file is a list of genes which I want to scan for the presence of in my first file. This is a .txt file, which I uploaded and then used the same convert tool (changing white-spaces to tabs) to change this to tabular form.
Then I went to the compare datasets tool and tried to compare column 3 of file 1 to column 1 of file 2, but it doesn't return genes from file 1 which I know are present in both datasets. For some reason it only returns genes which have very short names (eg: F2), and even then the list is very short.
I would very much appreciate some help with this!