Question: how to convert nucleotide strand from BioMart (1, -1) to strands compatible to Galaxy (+, -)
0
gravatar for mtjmar002
2.7 years ago by
mtjmar0020
France
mtjmar0020 wrote:

I cannot figure out how to convert strands coordinates downloaded from BioMart (-1, 1) to strands coordinates compatible with Galaxy (-, +) in the c4 column of a BED file. There is a paper saying to compute the expression:

c4==1 and "-" or (c4==1 and "+" or "")

I've tried this and other small variations of this expression but none seems to work.

Can anyone help me pease? It should be the same as in Python...

ADD COMMENTlink modified 2.7 years ago by Jennifer Hillman Jackson23k • written 2.7 years ago by mtjmar0020
1
gravatar for Jennifer Hillman Jackson
2.7 years ago by
United States
Jennifer Hillman Jackson23k wrote:

Hello,

UPDATED:

This can be done in two steps:

  1. Using the tool "Compute an expression on every row", use the term:
    str(int(c4)).replace("-1","-").replace("1","+")
  2. From here, in interval format (the dataset you describe is not really BED format. BED format has strand in the 6th column), one can reassign strand as the 5th column and use the dataset from there with many tools without issue.
  3. Or, using the tool "Cut", pull out just the columns for a 'clean' final interval dataset:
    c1,c2,c3,c5
  4. (Optional) History menu (gear icon): Extract workflow! Edit/label as desired. 

Improved solution via Dan Blankenburg, Galaxy team

Best, Jen, Galaxy team

ADD COMMENTlink modified 2.7 years ago • written 2.7 years ago by Jennifer Hillman Jackson23k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 55 users visited in the last hour