Does anyone knows how to generate dbSNP "Reference-Ordered Data" (ROD) file ?

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Does anyone knows how to generate dbSNP "Reference-Ordered Data" (ROD) file ?

0

3.1 years ago by

bio_vitus • 0

United States

bio_vitus • 0 wrote:

I like to know how to generate dbSNP "Reference-Ordered Data" (ROD) file from dbSNP data and is it possible

to generate it for per human chromosomes.

Appreciate your help.

Sincerely,

bio_vitus

dbsnp • 1.2k views

ADD COMMENT • link •

modified 3.1 years ago by Jennifer Hillman Jackson ♦ 25k • written 3.1 years ago by bio_vitus • 0

1

3.1 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

ROD data is reference annotation that is anchored by genomic position. This makes it easy to compare two datasets together using position.

If you use the built-in human hg_g1k_v37 genome (1000 genomes), then dbSNP is already indexed (sourced from the GATK resource bundle). If you want to use hg38/hg19 or any other build, then the idea is to locate a VCF dataset based on the same exact reference data (dbSNP or any other annotation that you want to link in). GATK itself, NCBI, UCSC, and others can be good data sources. A google should help narrow down the choices.

This prior Biostars question addresses essentially the same question with the bonus of covering the importance of confirming format (including the ordering of chromosomes in input data). Hopefully it will help: https://www.biostars.org/p/8212/

This might help as well, when deciding on which genome to use (hg_g1k_v37 or another as a Custom genome). It covers the details of that genome, plus format, and chromosome order when using a CG or linking in other annotaion data: Fasta Format, Custom Genomes, and GATK Chromosome ordering

If you just want to work on one chromosome, then you can use the full ROD dataset but only include the VCF containing the variant calls for the chromosome of interest. Or you can filter both down. Use the tool VCF Filter.

Thanks! Jen, Galaxy team

ADD COMMENT • link modified 3.1 years ago • written 3.1 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

DBSNP, Variant Caller
I have a VCF file from FreeBayes Variant caller. However I want VCF file to have dbSNP (rsID). T...
Mapping using multiple reference sequences
Hi I am using BWA to map my FASTQ files against two different reference sequences (V3-1 and V3-2...
Upload custom reference genome
We would like to know how to upload reference genome in my Galaxy account for the the species: L...
data orders in plotcorrelation
Hi, I am using galaxy-deeptools to analyze my data. But when I used plot correlation to visualize...
Generation of reference using CLC Genomic Workbench
Hello everyone, I have to analyse the transcriptome data in order to identify differentially e...
Using BWA with Illumina data and metagenomic "reference genome"
Hi, I'd like to try using BWA to align Illumina reads to some contigs a collaborator made from my...
Multi-join name of the dataset
Hi, I am analyzing RNA-seq data on multiple fastq files. I processed all of them to get the Feat...
Stitch maf Blocks Returns All Gaps
I am trying to use stitch gene blocks on a pairwise alignments I generated using progressivecactu...
Data Upload...
We have large files that cannot be uploaded using the "file upload" command and instead would nee...
Fasta Files from FTP sites
Hi, I have used FTP to download the mouse genome from NCBI, Ensembl, and UCSC. When I navigate t...
Line Estimation For Pileup Generation
Hello, I am curious if the line estimation shown in the history window for pileup generation is ...
Axt And Nib Files For Alignseq.Loc
I was wondering how to get a hold of the axt and/or nib files required by alignseq.loc. It's not ...
Help
Hello Pranathi, Sorry that you are having problems. Instead, use this Galaxy tool and the links ...
Bacterial genome annotation using annovar
Hello, am writing my own pipeline in python in order to annotate bacterial genome MTB, am new in ...
Cufflink Not Working
Dear Galxy admin and user I have generated BAM file from my RNa seq data by using Bowtie with cu...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 172 users visited in the last hour