Question: Picard Bam Statistic
0
gravatar for i b
6.3 years ago by
i b330
i b330 wrote:
Hi, I have used for the first time this tool, picard bam statistic. I have aligned my reads to a custom genome (7904 bp long) and had the following output: length= 7904 Aligned= 44 Unaligned= 0 Why is unaligned =0. I had the unaligned =0 also when aligning agaist hg19... thanks, ib
• 1.4k views
ADD COMMENTlink modified 6.3 years ago by Jennifer Hillman Jackson25k • written 6.3 years ago by i b330
0
gravatar for Jennifer Hillman Jackson
6.3 years ago by
United States
Jennifer Hillman Jackson25k wrote:
Hello, This means that your BAM file contains only aligned data. Running another tool that counts up similar statistics, such as "NGS: SAM Tools -> flagstat" could be used to confirm the counts. Best, Jen Galaxy team -- Jennifer Jackson http://galaxyproject.org
ADD COMMENTlink written 6.3 years ago by Jennifer Hillman Jackson25k
Hi all, We have recieved some bam files that were aligned to hg18. What would be the easiest workflow to get a VCF file from GATK in build hg19 ? We are running a local galaxy with only hg19 for the moment. Lifting the bam file would be my first choice, providing support to hg18 by generating all indices would be my last :-) Best regards, Geert
ADD REPLYlink written 6.3 years ago by Geert Vandeweyer30
Hello Geert, For the best results, especially for SNPs, you will want to map directly to the target genome. The genome Galaxy is using is the same primary human genome the GATK team also uses - the 1000 genomes build 37 -> "hg_g1k_v37". Click on the GATK links from one of the tools to see the details. GATK provides liftOver files between the the genomes, and you could install and use these with the liftOver tool, but not for BAM datasets. Inputs are BED, Interval, GFF. (BAM -> SAM -> interval). GATK also provides indexes (lifted) for hg19, but Galaxy does not provide an hg19 genome that is sorted appropriately for GATK, or at least not yet. RNA-seq tools and most other tools up until now required sorting in one way, and now GATK requires sorting in another, but keeping the database dbkey the same is important for visualization and other functions. It can get complicated when moving between tools in a history. We will likely have some 'best practice' solutions soon, but for now, use the 1000 genomes build to keep it all simple: Human (Homo sapients) (b37): hg_g1k_v37 The good news is that installing this genome has been greatly simplified. The genome and indexes are now available on an rsync server. You can simply download and add the genome directory and all the contents. You will still need to create the .loc file entries but the rest is done. http://wiki.g2.bx.psu.edu/Admin/Data%20Integration The "dbkey" is "hg_g1k_v37" Hopefully one of the options works out for you! Jen Galaxy team ps: You post ended up threading behind another post. I am not sure if this was because you started with a reply, but changed the subject line? This is not enough to start a new thread. Instead, please create a brand new message in your email client, then copy over the mailing list email address, add a subject line, and this will start a new thread that will get tracked and not missed. Thanks! -- Jennifer Jackson http://galaxyproject.org
ADD REPLYlink written 6.3 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 179 users visited in the last hour