problems with PE reads upload

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: problems with PE reads upload

0

2.5 years ago by

pablinperaza • 0

pablinperaza • 0 wrote:

Hi. I am new in galaxy and I am having problems when uploading samples. I am uploading PE reads (R1 and R2), using Pear for pairing sets, concatenate datasets (have files w/6Million reads each) to have a unique file (around 30 millon reads) and then I performed FASTQC to that file. The FastQC report says that I have much more reads, and when I performed FASTQC just to one R1 sample it reports just 10% of the reads. Why it happens? Do I need to set any parameters to import? Thank you if you can help me. Kind regards, Pablo

rna-seq • 594 views

ADD COMMENT • link •

modified 2.5 years ago • written 2.5 years ago by pablinperaza • 0

0

2.5 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

It is okay to upload paired datasets from this tool. However, do not concatenate multiple samples. Leave each distinct when loaded. Use FTP to make this simpler to organize when uploading or if any data is over 2 GB in size. https://wiki.galaxyproject.org/Support#Loading_data

Once the data are loaded, the issues with the tools should go away. Consider using a Dataset Collection and a Workflow (once the analysis path is known).

Thanks, Jen, Galaxy team

ADD COMMENT • link modified 2.5 years ago • written 2.5 years ago by Jennifer Hillman Jackson ♦ 25k

0

2.5 years ago by

pablinperaza • 0

pablinperaza • 0 wrote:

Hi Jennifer, thank you for your reply and the tutorials. They were very helpful.

Now my question is, when I get this results from tha paired reads, hoy can I merge them. Some facilities give me reads files in no more than 6millon reads, and they give me always more than 6 files per sample.

XX_L001_R1_001 XX_L001_R1_002 XX_L001_R1_003 XX_L001_R2_001 XX_L001_R2_002 XX_L001_R2_003

At the End I will have

XX_001 XX_002 XX_003

But, I would like to have XX (all paired reads in one file). Is it possible? In order to handle less files. Thank you. Pablo

ADD COMMENT • link written 2.5 years ago by pablinperaza • 0

Please log in to add an answer.

Similar posts • Search »

RNAseq data to be processed in two ways: (i) mapping to de novo Trinity-based transcriptome and (ii) mapping a relatively new genome
Hello all, I am new to RNAseq data and learning this process step by step, so I have a few quest...
Do I trim paired end sets first or merge and then trim?
Hi there, I have a quick question - I have raw 10Gb -sized datasets for my samples. For each one...
illumina PE quality control - NGS: QC & Manipulation
Hi, I have a PE illumina miseq data set (separate forward-R1 & reverse-R2) of a WGS of a par...
Paired-end imports produce 8 files. What to do?
Hi all, I've been scouring the forum for the "first step" in importing fastaq.gz (which unzips au...
Differential Expression in bacteria RNA-seq
Hello, I got a problem in Cufflinks: I need to analyze fastq file from a RNA-seq (I got bacteria...
TOPHAT align half of the reads
Since I am using a laptop, I'd like to exploit usegalaxy to perform an alignment using tophat. I ...
Cannot join combined R1 and R2 files
Hi Everyone, From reading around, I'm not sure whether a solution has been found to the probl...
Trimmomatic and adapters
Hi all, As relatively new within the bioinformatics world, I am a bit confused when it comes to ...
Css behavior with fastqc html report
Hi all, I am currently working on a our galaxy instance, when I use fastqc tool shed, the html ...
mapping RNA-seq reads with "N" in the middle of each read
Hi all, I am performing differential gene expression analysis using the Tophat-Cuffdiff protocol...
Problems With Picard And Gatk Tools
Dear all, I have been trying to analyze some recently acquired WGS reads (re-sequencing with MiS...
pipeline for DNA-seq analysis
Thanks for all your help. Finally I got the data uploaded on the Galaxy. As suggested there was a...
Permission Denied Error When Running Fastqc
Hello All, One more problem when running analysis on local galaxy install. I am trying to run fa...
Can you see what i got problem??
I did uproading file through EBI homepage about NCBI Geo dataset SRR 5682257 &5682258. [EBI S...
Where can I find the Trinity statistics?
Hi, This is maybe a stupid question, but I am new to RNA sequencing and the Galaxy environment -...
Data From History Now Showing Up In Fastq Drop Down
Hi All, We have a galaxy local install. Thanks to Carlos's suggestion, I was able to get the ref...
Galaxy problems plus tech info
Dear Office, my user is fradiancona@yahoo.it I uploaded library files in Galaxy (fastq.gz files)...
Using Segments Of Sequences As A Reference Genome - Bowtie For Illumina
Dear all, My problem seems like something that should have a very simple solution from my end and...
Mapping of small dataset takes too long
Dear Galaxy Team, Do you know if there is a problem with NGS mapping of fastqsanger files in Gal...
Local Instance Data Upload
Hi, I am trying to set up a ftp server to upload data in my local instance of galaxy but I am...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 180 users visited in the last hour