I'm a complete beginner at using Galaxy, and was hoping that someone could help me find the right tool(s) to find out how many gene locations are contained in a dataset I have.
(Some truth in advertising up front: I'm asking because I'm taking an online course on Galaxy, and this question is from a quiz I recently completed. I got the question wrong, and although I know the right answer now, I don't know how to get the right answer. I'd ask my fellow course participants/instructors, but there is no way to do so without the answer being visible to all students, many of whom have not yet taken the quiz.)
The dataset I have is a list of ~1700 transcripts from a human X chromosome. I'm trying to find out how many gene locations are contained within those transcripts. I downloaded the UCSC data for the hg19 X chromosome. And I've tried a variety of combinations of attempts using the Group/Join/Intersect tools, but I don't come anywhere near the right number.
Can anyone help me identify the right set of tools to determine, when given a set transcripts from a known organism/location, how many known genes are contained within that dataset?
Thanks in advance for any assistance.