Realigner Target Creator will not run?

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Realigner Target Creator will not run?

0

2.1 years ago by

frankie.north • 10

frankie.north • 10 wrote:

I have been running the pipeline below to try and call SNPs from RNA-Seq data but have encountered problems with Realigner Target Creator Tool in Galaxy. Can anyone see any obvious problems in the pipeline?

Import ucsc.hg19.fasta, ucsc.hg19.dict, ucsc.hg19.fasta.fai, ucsc hg19 snps, 1000G indels and RNA-Seq data.

Convert RNA-Seq data into BED

Convert RNA-Seq data into FASTQ

FastQC on RNA-Seq data

FASTQ Groomer on RNA-Seq data

FASTQ Splitter into forward and reverse reads (RNA-Seq data originally paired end)

Map with BWA for Illumina on forward and reverse reads

IdxStats on BWA output

Sort by chromosomal coordinate

RmDup on RNA-Seq data

Filter on RNA-Seq data for mapped reads and reads in proper pairs

ValidateSamFile to check for errors (no read groups assigned)

AddOrReplaceReadGroups on RNA-Seq data

ReorderSam to remove lexicographical sort

Filter for chromosome 1 to narrow down data size

ValidateSamFile to check for further errors (nucleotide difference in file does not match reality and mate not found for paired reads given)

I have tried to run Realigner target creator as a prerequisite for the Indel Realigner however it will not work, bringing up the error "Lexicographically sorted human genome sequence detected in reads". I would have thought this problem had already been solved by running the ReorderSam step? Could this be that my reference genome is the problem somewhere? When running the Realigner target creator I can only use the imported fasta hg19 file, as it does not bring up any locally cached references?

Thanks, Frankie.

snp gatk genome realignertargetcreator sorting • 717 views

ADD COMMENT • link •

modified 2.1 years ago by Jennifer Hillman Jackson ♦ 25k • written 2.1 years ago by frankie.north • 10

0

2.1 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

The GATK tools at http://usegalaxy.org are indexed for human using hg_g1k_v37.

My guess is that your hg19 fasta file is not sorted in the GATK expected way (it is very specific). Help to do that is here and includes extra help for custom genomes: https://biostar.usegalaxy.org/p/14777/

Best, Jen, Galaxy team

ADD COMMENT • link written 2.1 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Too few reads after VarScan on RNA-Seq data?
Hi all, I have been trying to initiate a protocol to call SNPs in RNA-Seq data, but have had a f...
How to Realign Indels and logic of workflow
I have paired end reads of a trio that I eventually need to call variants on (make a .vcf). So I'...
Where to get SNPs and Indels?
I downloaded some files from Broad Institute but Realigner Target Creator won't work with these f...
Need help with "Realigner Target Creator" tool
Hi every one, If somebody familiar with using the GATK realigner target creator on galaxy? My d...
Freebayes issue on merged data
Hi, I am completely new to this field so I'm hoping this is a simple mistake on my part. I'm tr...
GATK indel realigner using custom reference
I've been using some SAM (converted to BAM) files and some partial genome assemblies (custom refe...
Trimmomatic output after filtering paired end data - fastqsanger vs fastqsanger.gz
After running Trimmomatic on a paired end PolyA RNA-seq dataset where both forward and reverse re...
suite_samtools_1_2 EOF marker absent error
Hi all, so I'm experiencing a problem running SAM-to-BAM using samtools 1.2 from a toolshed insta...
This job was terminated because it used more memory than it was allocated
Hello, I have always encountered such problem "This job was terminated because it used more memo...
GWAS: Fastq to BAM to VCF
I'm analyzing NGS data on website:usegalaxy.org in the following steps: 1. Fastq file Raw reads Q...
BAM to VCF format
I'm analyzing NGS data on website:usegalaxy.org in the following steps: 1. Fastq file Raw reads Q...
issue with realigner target creator
I read about a few other people's posts concerning realigner target creator but felt that my issu...
Problem with RealignerTargetCreator in the CloudMap workflow
My lab has previously used the CloudMap pipeline to identify the location of mutations from in C....
Problems With Picard And Gatk Tools
Dear all, I have been trying to analyze some recently acquired WGS reads (re-sequencing with MiS...
Problem with indels not appearing in FreeBayes output
Hello, I have 6 datasets for 3 individuals (forward and reverse DNA seuqences) that I am trying ...
failure preparing job - workflow
Hi, I managed to setup my own server (running on Ubuntu) and managed to perform the necessary a...
Problem With Bam And/Or Bai Files
Hello Galaxy Team, I have been using Galaxy for SNP detection for with great success. Basically...
Re: Getting Reference Index Files In Local Galaxy Install
Hi, We have a local install of galaxy and I'm trying to add the reference index files for bwa usi...
False negative results in GATK unified genotyper output
Hi all, I'm having some problems with the output of the GATK unified genotyper. Essentially I ...
>90% aligned concordantly 0 times ChIP-seq Bowtie2
Hi, I know this question have raised many times here and in other forums but I've tried everythi...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 168 users visited in the last hour