I am using use galaxy for genome wide SNP discovery. I have discovered about 770,000 SNPs. After performing operations in galaxy I have been able to get sequences for each of them.I obviously have a pool of several genotypes. At this point I have 2 questions 1) How should annotate my vcf file so that I know in the end from where does each SNP come from? 2) Are there any tools for dealing with rows in galaxy. The tools in text manipulations mostly involve columns, or I cant get hold of tools working with rows. I need to delete alternate rows which I do not require. I am certain somebody must have done it in the past. Hope to hear from you soon, I am struggling with this for few days.