Problem merging five 13.9 GB .BAM files

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Problem merging five 13.9 GB .BAM files

2

17 months ago by

rickwhite23 • 10

rickwhite23 • 10 wrote:

I keep getting this error when I try to merge five 13.9 GB .BAM file. Any ideas on how I can get it to work?

Fatal error: Matched on error: Picked up _JAVA_OPTIONS: -Djava.io.tmpdir=/galaxy-repl/main/jobdir/016/042/16042355/_galaxy_tmp -Xmx7680m -Xms256m [Wed Jun 07 11:31:14 CDT 2017] net.sf.picard.sam.MergeSamFiles INPUT=[/galaxy-repl/main/files/020/211/datase

samtools bam • 422 views

ADD COMMENT • link •

modified 17 months ago • written 17 months ago by rickwhite23 • 10

0

17 months ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

The job might be exceeding the memory allocated to the tool or there could be a problem with the inputs. Specifically, the BAMs may need to be re-sorted or there is a reference genome mismatch problem that should be corrected.

Please see: https://galaxyproject.org/support/#troubleshooting

Thanks! Jen, Galaxy team

ADD COMMENT • link written 17 months ago by Jennifer Hillman Jackson ♦ 25k

0

17 months ago by

rickwhite23 • 10

rickwhite23 • 10 wrote:

Thanks for you quick response I have a couple additional questions:

On memory - The files I have up loaded only equal 69.5 GB I delete all other files. I assume that the merged file should only be 69.5 GB in total size combined the the data would only take up 139 GB of the 250 GB allocated. Is this right?

As far as the mismatched reference genome - all files have the same reference

With respect to the "re-sort" - I listed the file from 1-5 are you suggesting that I try with 5-1?

Thanks again for your help

Rick

ADD COMMENT • link written 17 months ago by rickwhite23 • 10

Hi Rick,

Thanks for sending more feedback. The memory used to process jobs is distinct from the available space in your account. Sorting means to sort the BAM datasets (help in link below). Also, just to double check - you examined the BAM headers to make certain that they are each identical? If so, based on this extra info, the solution might be one of these:

Try sorting the BAM datasets, then filtering out unmapped reads (to reduce size), then execute a merge.
Try a few reruns to see if a different cluster node is able to process all of the data in batch as one job.
Merge fewer BAMs in any one job, then try to merge those results to produce the final result.
Convert BAM-to-SAM format (without headers for all, then with just the header for one of the inputs as all should be identical), use the tool Concatenate to merge the header with the SAM lines, convert SAM-to-BAM, then sort that final result BAM (coordinate sort for most use cases is best).

See the help sections for sorting inputs and checking chromosome identifiers for details: https://galaxyproject.org/support/#getting-inputs-right-

If after this is done the job still fails, and you would like a second opinion on the content/format before moving to a local/cloud Galaxy to process the data (the final option) - a bug report from one of the failed jobs can be sent in. Please leave all of the original, intermediate, and error datasets undeleted so we can review. Including a link to this Biostars post in the comments will help us to associate that report with this question. You can also add details in the history comments section.

Thanks! Jen

ADD REPLY • link modified 17 months ago • written 17 months ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Iterative Mapping -> open: No such file or directory
Hi, when I try running "Iterative Mapping" I get the following error. Please help! Usage: ...
Cuffmerge fails to merge gtf files
Hello, I am trying to merged to assembled transcripts produced by cufflinks using cuffmerge. I ...
Merging BAM files: bamtools not found
I have BAM files from aligning paired end Illumina reads. I would like to merge the aligned seque...
Error when sorting bam file for query name using Picard
Hi, I am new to data analysis and so far I have been able to navigate Galaxy - thanks for making ...
picard collectwgsmetrics error
Hi..I am trying to use the picard collectwgsmetrics on a bam file generated from a whole genom...
BAM files cannot be downloaded and failed MergeSamFiles Fatal error: Exit code 127 ()
Hi, I've mapped some WES and WGS data through the pipeline. For the past two days, either the "m...
GATK: Unified Genotyper Error
Greetings, I get the following error when running the GATK's Unified Genotyper using default set...
What is causing this MPileup error in Galaxy?
Hello, I'm new to galaxy and next gen sequencing. I'm trying to generate a VCF file from whole ...
Need help with "Reorder SAM/BAM" tool
HI am trying to reorder a SAM file output from BWA. So I can use it in GATK. However I get the...
Error in MiModD variant caling
Hi I am using MiModD on galaxy to do mapping-by-sequencing. I used bowtie2 to do the alignments ...
RmDup fails to build index file
We tried to run RmDup from NGS: SAMtools on https://usegalaxy.org/ However, we get the following...
Question: SAM-to-BAM: fatal exit code Error 139
Dear All, I am analyzing DNAseq data. I obtained a SAM file by using "Map with BWA for Illumina ...
BAM files fail to be recognized
I generated BAM files on galaxy using BWA-MEM. I am trying to run BAMtools, Picard, etc on them. ...
Trouble sorting a BAM file
Hi I have aligned reads to Brassica juncea genome 1.5 using HISAT2. Now i want to perform PCA an...
Fatal error :Exit code 1
Am getting the error below with "NSG: Picard:" and am using the tool "fixmate information". What ...
Tophat keeps failing
Hi, since yesterday Tophat on the galaxy server keeps returning error messages or turning green a...
Fatal error with SamToFastq using BAM files
Hello Biostars, I am a new Galaxy user and have some bam files that I need to run through RNA se...
VCFFilter 1.0.0 fatal error with FreeBayes VCF input
When running a vcf file (output was from FreeBayes tool), I am getting the fatal error Fatal...
Cuffmerge error - how to fix it
Hi guys, I am doing cuffmerge with reference annotation on my files after cufflink. The error oc...
htseq-count error with strand data
Hi! I am running htseq-count for a RNA-seq and I got this error: Fatal error: Unknown error occ...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 183 users visited in the last hour