Hi! After pileup file generation and convertion of pileup to interval, potentially we could extract the consensus using the extract genomic DNA option. But it asks you to first tag the sequence using one of the existing Database/Build options which are quite limited. How can you add more? Is there another way to extract the consensus from a pileup file? Thank you! Angeliki
Hello,
The 10 column pileup format includes the consensus in the 9th field. You could use a tool like "Tabular-to-Fasta" to extract it directly.
If you use the positional coordinates against the custom reference genome used to initially map with, and pull out that sequence, it is possible, but be aware that the sequence won't include any of the variations from your read data. If that is your goal (maybe you want to have both), the Extract tool can be used with the option to get the reference genome fasta data from the history. No need to assign the database with that usage.
- Tool: Extract Genomic DNA
- Option: Choose the source for the reference genome
- Setting: From the history
- Option: Using reference genome
- Setting: Pick your custom genome fasta dataset
If you ever do need a custom genome's database assigned to a dataset (some tools do work better that way -- but not Extract), a custom genome can be promoted to a custom build. This creates a "database" specifically associated with your account that can be assigned the same as any other database (Pencil icon > Edit Attributes > first tab, genome/database selection > Save)
FAQ: https://galaxyproject.org/support/
- Preparing and using a Custom Reference Genome or Build https://galaxyproject.org/learn/custom-genomes/
Thanks! Jen, Galaxy team
Hi Jen, thank you for your answer! My column 9 contains various characters (like .,g, $) and if you try tabular to fasta it only creates different fasta files (not one consensus) with all these characters under each heading...Column 4 looks more like the sequence, but still it generates various fasta files with different headings (not just one consensus). Any ideas? Best regards, Angeliki