Normalization on Single End and Paired End reads

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Normalization on Single End and Paired End reads

0

7 months ago by

Ismaelrymy • 0

Ismaelrymy • 0 wrote:

Hi there.

I want to assemble a transcriptome with Trinity using reads that are both paired- and single-end. Due to memory issues I have to normalize the reads, but the normalization process in Trinity is not able to handle this kind of data. Is there any application in Galaxy that I can use to normalize the reads prior to the assembly process? Data is in fastqsanger format

Thank you very much for your time.

error from the randomization step: Error, pairs.K25.stats is empty. Be sure to check your fastq reads and ensure that the read names are identical except for the /1 or /2 designation. at /opt/packages/trinity/2.2.0/util/insilico_read_normalization.pl line 882.

assembly galaxy • 414 views

ADD COMMENT • link •

modified 7 months ago by Jennifer Hillman Jackson ♦ 25k • written 7 months ago by Ismaelrymy • 0

0

7 months ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

This utility has not been wrapped for Galaxy in the ToolShed (https://usegalaxy.org). However, it might be available at the Trinity Galaxy server http://ncgas.org/galaxy_trinity_faq.php. They may also be able to provide you with more resources versus using Galaxy Main https://usegalaxy.org, as that is a domain specific server. Apply for an account to see if they can help.

That said, I wouldn't expect the tool to accept both paired and single end sequences as input. Trinity hosted at Galaxy Main does not, even though that is a command-line usage option. Inputs are expected to all be the same length and contain only matched paired-end datasets.

If you just want a random sample, convert with Fastq-to-Tabular (for the forward or reverse reads), use Select random lines from a file, then convert the format back with Tabular-to-Fastq. Then sync up the other side of the pair with the tools FASTQ interlacer and FASTQ de-interlacer.

Your final option is to set up your own Galaxy server and allocate sufficient memory resources to process the entire dataset. https://galaxyproject.github.io/

Thanks! Jen, Galaxy team

ADD COMMENT • link written 7 months ago by Jennifer Hillman Jackson ♦ 25k

Please log in to add an answer.

Similar posts • Search »

RNAseq data to be processed in two ways: (i) mapping to de novo Trinity-based transcriptome and (ii) mapping a relatively new genome
Hello all, I am new to RNAseq data and learning this process step by step, so I have a few quest...
Cufflinks Fpkm
Hi, I want to use cufflinks handle the results of Tophat. Cufflinks uses FPKM to normalize the e...
I am slightly confused about the input requirements for the differential expression tool - last step of Trinity
Hello all, I have a pressing question... To start with, I have read the Trinity methods paper ...
Trinity "Exceeded step memory limit" error?
Hi, I'm trying to assemble a de novo transcriptome using ~270 million paired end reads in Trinit...
Paired end illumina RNAseq reads - running trinity with paired trimmed files and adding unpaired as single read?
We have paired end Illumina HiSeq 4000 reads that we are working to remove adaptors and trim base...
generating 0 bytes for trinity assembled transcript file
I used a single end fastqsanger file ( related to the leaf of Solanum ) and tried to run trinity ...
De Novo Assembly Plant Transcriptome
Dear Galaxy Expert, I would like to use Galaxy to de-novo assembly single-end read illumina data...
How do I combine multiple SE libraries in a single Trinity run?
I have two SE librarys I need to assemble together with Trinity. Library one has 7928922 read pa...
How long does galaxy-trinity program take to denovo assembly
Hello Admin, How long does galaxy-trinity program take to denovo assembly of a 2*100 bp paired-e...
Comparing the Single and Paired End Assemblies
Hello there! Im using this touorial to analyze and assembly mate pair NGS reads for my illumi...
Suggestions For De Novo Assembly Plant Transcriptome Without Reference
Dear Galaxy Expert, I would like to use Galaxy to de-novo assembly single-end read illumina data...
Trinity run preprocessing
I am new to Trinity and galaxy instance (https://usegalaxy.org/). I have 17 samples pair ended. I...
Queue times to run Trinity?
Hello, I'm rather new to Galaxy. I'm trying to run de novo assembly for RNAseq data using Trinit...
set_environment fails to set up Trinity path
I have a local galaxy instance on which I am trying to run Trinity. I have the following in my ba...
Red error from RSEM Trinity -- Solution: input matched paired end reads
Hi, I've constructed a trinity assembly using ~30M PE read. Then I was trying to estimate abunda...
Galaxy RSEM error
Dear all, I am trying to run RSEM via Galaxy but there is a problem. FYI, 1. Reference sequ...
Error using stringtie - AttributeError: 'NoneType' object has no attribute"
Hi, I have **RNA-seq data** and I am interested in whole gene expression results but also transcr...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 183 users visited in the last hour