Please see 'Tools on the Main server':
The RNA-seq tutorial (hosted at Galaxy) and the web sites/paper by the
tool authors should give you many good ideas for potential protocols.
Combining the files will not change the quality values. If this is a
Phred+33 scaled quality score, then yes, this is low. A double check
that the 'FASTQ Groomer' was run with the correct options would be the
first step. You also may want to run FastQC to generate broader
statistics. See the RNA-seq tutorial for details about running this
and then trimming sequences to improve overall quality.
A direct link is:
There are a likely many ways to do this, here are some:
1 - 'Get Data -> UCSC Main'
Track named "Human Proteins" with the primary table (blastHg18KG).
2 - 'Get Data -> BioMart'
Ensemble Genes 68, Danio rerio genes (Zv9). Filters -> Homologs ->
Help 'Using Galaxy'
Protocol 1 has examples of extracting data from the UCSC Table browser
and joining data - the methods can be applied to any similar data. If
you need to manipulate files, see Protocol 2, the last example is
multi-stepped and demonstrates that just about any file can be
to interval format and utilized.
3 - 'MAF predictions'
'Using Galaxy' (above) Protocol 5 has an alternate method for
"orthologs" (or maybe better described as 'syntenically conserved
homologs', since function is not evaluated) from conservation tracks.
Full details of MAF functions are in our 'Making whole genome
usable for biologists' paper:
The ZF Conservation track is not local to Galaxy, so you will need to
obtain the data from UCSC and FTP to Galaxy to work with it ('Get
is not an option). Review the track description in the UCSC browser
(track named "Conservation"), then find the data here:
Good luck for the choices you decide on!