Hi Galaxy Community,
The bedTools tools were updated recently with some great additions. Some of these updated tools require a 'Genome file', which is a file containing the size of the chromosomes of your reference genome. This should be a two column tabular file with the chromosome name in the first column and the END coordinate of the chromosome in the second column, see an example below (for mm9):
chr1 197195432
chr2 181748087
chr3 159599783
chr4 155630120
chr5 152537259
chr6 149517037
chr7 152524553
chr8 131738871
chr9 124076172
chr10 129993255
chr11 121843856
chr12 121257530
chr13 120284312
chr14 125194864
chr15 103494974
chr16 98319150
chr17 95272651
chr18 90772031
chr19 61342430
chrX 166650296
chrY 15902555
These are the bedTools tools requiring a 'Genome file':
Convert from BEDPE to BAM
FisherBed
ComplementBed
Intersect multiple sorted BED files
Merge BedGraph files
SlopBed
Genome Coverage
ShuffleBed
Convert from BED to BAM
RandomBed
FlankBed
Since the reference genome is associated with files input to bedTools does it make sense to have the user input the 'Genome file' when the same information is likely available in the inner workings of Galaxy? Having the user create and recall the 'Genome file' seems like an opportunity to introduce unnecessary errors.
Cheers,
Mo