Question: Stuck
gravatar for Hsu, Amy (NIH/NIAID) [E]
8.2 years ago by
I'm sure this is a silly question, but I have been stuck since yesterday. I have some whole genome sequencing data - What was given to me is an enormous XL file with all the single base changes identified in 4 people. I don't want to track down all the changes, yet I've noticed some of them labeled as "novel" are in fact SNPs - particularly if I use build 130. I imported just chr, start, end using same base number for start/end from part of my file (chr 1) and then pulled down all the SNPs from UCSC for Chr. 1. What I would like to do is label the lines in my file that are snps. I have tried intersect, join, subtract, all to no avail. What am I doing wrong? Any help would be appreciated. thanks - Amy Hsu
snp • 658 views
ADD COMMENTlink modified 8.2 years ago by Jennifer Hillman Jackson25k • written 8.2 years ago by Hsu, Amy (NIH/NIAID) [E]20
gravatar for Jennifer Hillman Jackson
8.2 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello Amy, For your own SNP file, create it so that the start is "0-based" to be consistent with the BED (aka interval) file format used by UCSC and Galaxy. This means that if a SNP is located at base "36" on chr1, then your file would be: chr1 35 36 Double check that the chromosome naming format is exactly the same (capitalization matters) and this should fix the joining problems. A "join" is probably what you want to do if you have the entire UCSC SNP file. The "profile annotation" function would pull out UCSC's SNP information directly plus other features and may also be interesting to test out. More help is at: Please let us know if this does not help, Jen Galaxy Team -- Jennifer Jackson
ADD COMMENTlink written 8.2 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 172 users visited in the last hour