How to interpret and control for effects of primer contamination?

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: How to interpret and control for effects of primer contamination?

0

16 months ago by

neptune24 • 10

United Kingdom

neptune24 • 10 wrote:

Hi all, I'm currently analysing some RNAseq samples which were amplified prior to library prep and adapter ligation. Having filtered out the adapter sequences, I'm left with some very specific kmer contamination: CTTCAG starting at position 15 (with all bases on either side equally represented). I had a look at the sequences containing this kmer in that location and there are no individual sequences that are excessively abundant and the top few all map to common genes.

It seems to be coming from one of the amplification primers, despite the primer removal reaction: 3’-GACTTCNNNNNNNNNNNNNN (http://www.sigmaaldrich.com/technical-documents/protocols/biology/seqr.html)

Could anybody help me to understand how this is affecting my data and how I can control for it? Has this primer attached itself to the end of my sequences or has it caused an amplification bias towards genes containing that motif? Is there anything I can do to remove this contamination without losing a substantial portion of my data, or can I discount it because it seems to be consistent across samples?

Any insight would be much appreciated as I don't have much background in molecular biology.

amplification bias primer contamination • 366 views

ADD COMMENT • link •

written 16 months ago by neptune24 • 10

Please log in to add an answer.

Similar posts • Search »

Issue With Saving 'Manipulate Fastq' In Workflow; And Request For Advice Dealing With Barcoded 454 Data
Hi, I'm a new user, learning how to use Galaxy while I wait for my 454 results. So I'm not actua...
remove overrepresented (contaminated) sequence
Hello, Based on FastQC result, I have two overrepresented sequence. One is my TruSeq Adapter s...
Primer Contamination, Miranalyzer
Hi Galaxy, Ive got 2 problems for you; 1) Ive got microRNA Illumina NGS data that I want to ana...
RNAseq data to be processed in two ways: (i) mapping to de novo Trinity-based transcriptome and (ii) mapping a relatively new genome
Hello all, I am new to RNAseq data and learning this process step by step, so I have a few quest...
Galaxy Tuxedo Protocol Questions
Hi everybody, I'm totally new to RNA-seq analyses and it's my first time doing this and using Ga...
3' Adapter Trimming Using Fastx-Toolkit Clipper
Hi all, I am analyzing miRNA sequencing now. My data is 51bp, single -ended and ~5 M reads. I wan...
How do i remove multiple adapter sequences from my RNAseq reads?
Hi there, I want to remove the universal adapters as well as the index adapters in each data fil...
Galaxy fastqc contaminant list
Hello...I created a tab separated text file containing a primer name and associated primer sequen...
Getting species/taxa from GIs
Hi there, I'm using megablast to find out whether my RNAseq data might contain contamination from...
False negative results in GATK unified genotyper output
Hi all, I'm having some problems with the output of the GATK unified genotyper. Essentially I ...
Can contamination cause alignment errors and how can I trim contaminants away?
Hello Galaxy Team, Sorry if this question is all over the place. I have been trying to align ...
Removal of primers using Trimmomatic
Hello, Trimmomatic has an illuminaClip feature which appears to be able to cut custom sequences ...
FastQC Kmer in centre of sequence, quality trim?
I have run a FastQC on some fastqc sequences and the report tells me I have contaminating Kmers. ...
Batch conversion of ID to gene symbol
I'm an old school molecular biologist who studies gene expression but is quite new to bio-computi...
FASTQC Overrepresented Sequences
Hey all, after running Trimmomatic and clipping Illumina adapters, I always run a FASTQC to have...
Removing a single base from 3' end of the reads using "Clip" option
Hello, I need to remove a single base i.e. C from the 3' end of all of my reads. I tried doing i...
How To Extrapolate Differentially Expressed Genes After Running Cuffdiff?
Hi guys, have a question about the cuffdiff output "differential expression testing". For most o...
Running MACS2 without a control sample
While going through the [Analysis of Chip-Seq][1] data tutorial, I realized that one of my input ...
Chip Seq analysis with multiple biological replicates for differential expression
Hello, I am very new to sequence data analysis and had some structural questions. I am trying to ...
Degenerate primer removal tool on Galaxy?
Hi there, I am working on a next generation sequencing dataset and can't seem to find a tool ...
CutAdapt Batch Mode Input for 3' and 5' Adapters
Hi, I'm currently using the CutAdapt tool on Galaxy to remove adapter sequences from fastq-forma...
How To Analyze Genes Differential Expression Using Galaxy? Thanks.
Hi All, Now I have two Paired-End samples data ( they are Next Generation Sequence Data and fa...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 171 users visited in the last hour