How to count HiSeq sequences at Galaxy?

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Tool: How to count HiSeq sequences at Galaxy?

0

4.5 years ago by

kp5091 • 0

United States

kp5091 • 0 wrote:

Is there any way to count transcripts? I am now grooming and filtering my HiSeq data. I would like to know how many sequences were excluded by these steps.

rna-seq tool • 1.4k views

ADD COMMENT • link •

modified 4.5 years ago by Jennifer Hillman Jackson ♦ 25k • written 4.5 years ago by kp5091 • 0

0

4.5 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

For sequence data (.fastq types), tools in the group "Text Manipulation", "Filter and Sort", and "Join, Subtract, Group" can be used in combination to do many counting and summary tasks. Convert any Fastq data to tabular first ("NGS: QC and manipulation -> Fastq to tabular) to make the data available to these tools, or use "Filter" to pull out lines that are just for sequence identifiers.

You mention "transcripts" - does this mean that you have proceeded with a pipeline such as the tuxedo analysis in "NGS: RNA-seq"? If so, and you want stats from that level as well - summary counts can be obtained from certain of these output files (specifically the tracking files, and more advanced counts by comparing to input GTF/GFF3 reference annotation). See the CuffDiff manual for how these datasets are formatted (the dataset name and original file name from the tool will be similar). Our hub for RNA-seq analysis with many link-outs to resources can be found here: https://wiki.galaxyproject.org/Support#Tools_on_the_Main_server:_RNA-seq

Hopefully this helps but if you need more details for a specific task not covered here please let us know!

Best, Jen, Galaxy team

ADD COMMENT • link written 4.5 years ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

Using a reference transcriptome as a custom genome with tools
Hi, I want to know if I can build a transcriptome using the sequences of other transcriptome t...
Metagenomics workflow / tutorial using paired-end sequencing (HiSeq)
I came across this galaxy workflow that was originally used with 454 sequences. https://usegala...
Clip Adapter Sequence
Good morning, I am very new in using Galaxy. I would like to use Clip to remove the adapter sequ...
Using HTseq and DEseq2 for RNAseq quantitation, format of input data?
Hi all, For clarities sake my experimental setup is: 2 conditions (treated + non treated), with ...
Question About Sorting Barcoded Sequences
Hi, I have an Illumina HiSeq lane of sequences, in which I input multiple samples with 5-prime 6 ...
How to use Mothur Unique.seqs
I see two window 1. Sequences to filter I use this to select sequence(s) of new assembling 2. ...
Data analysis workflow in Galaxy
Hello to everyone, I am quite new in bioinformatics and I need little help. I got task to implem...
galaxy can't read the fastq data
I have uploaded some of ChIP-seq compressed as gz to Galaxy. After uploading, those gz file becom...
Illumina HiSeq reads and Fastq joiner
Hello, I have paired-end reads (Illumina HiSeq) from a metagenome in two Fastq files. These non...
Trimmomatic and adapters
Hi all, As relatively new within the bioinformatics world, I am a bit confused when it comes to ...
Extracting the read counts from a collapsed fasta file?
I have collapsed my fastq file so I know have the output fasta file which contains all the unique...
Analyzing ChIP-seq data from HiSeq 3000
I have acquired ChIP-seq data from an Illumina HiSeq 3000. I have followed the analysis workflow ...
How to count unique short sequences in FASTQ
Hi, I wonder whether there is any tool that I could use to output the number of unique reads. I d...
Need help with "FASTQ Groomer" tool
Hi, I try to groom a sequence from NCBI sequenced from ilumina Hiseq 2000 by BGI, it's running...
Correct Input Into The Minimum Alignment Count
Dear all, I changed the minimum alignment count to: 100, 400, and 1000 minimum alignment ...
Getting RT stops
I'm looking to count RT stops for a sequencing job I performed. I have a reference sequence that ...
RNA seq beginner
I just got back RNA seq data produced on the HiSeq 4000. They prepared the libraries and performe...
Can I run the megablast against my own database?
Hi, I'm trying to fish out antibiotic resistant genes from a hiseq data generated from human sto...
RNA-Seq High Expression Genes Lost
Hi I'm noticing that after running my BAM files through CuffDiff and CuffNorm that some genes th...
How to make uploaded data available in Huttenhower galaxy
Hello, I successfully uploaded several HiSeq bacterial shot gun metagenomic datasets to the main ...
Trim Adapters And Other
Hi, all! I have a maybe naive question that might be not so related to Galaxy usage. So I got th...
Galaxy HTSeq tool give zero counts
Hi, I tried galaxy HTseq tool on BAM file generated by HISAT2 to generated counts. I provided GT...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 169 users visited in the last hour