Deconvoluting Ngs Samples With Multiple Barcodes

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: Deconvoluting Ngs Samples With Multiple Barcodes

0

8.4 years ago by

Pip Griffin • 60

Pip Griffin • 60 wrote:

Hi, I have a sequence file that has 454 reads for 64 barcoded samples. I also have a second 'query' which is a file with the names of all 64 samples, and the corresponding 'sample identifier sequence' (19 bp) in the following format: (AGGTTGATTGAATGGCTTA)|(GATGAAGAACGCAGAACCT) (I need to search for the forward or the reverse identifier). I want to 'join' the two queries by searching for a match in the first query with the 'sample identifying sequence' in the second query so that I end up with a copy of the first query with a new column corresponding to the sample names. But the 'join' command only returns perfect matches between columns. How can I join two queries with a partial match? (obviously only 19bp of the total result sequence will overlap with the identifying sequence) I could use the 'manipulate fastq' command, but I would have to do 64 separate steps, as far as I can tell. I would really appreciate any help with what is probably quite a simple problem! thanks very much Pip Griffin

• 868 views

ADD COMMENT • link •

written 8.4 years ago by Pip Griffin • 60

Please log in to add an answer.

Similar posts • Search »

Critical Feedback
This student was more adventurous. I think he actually could do more of what he tried with more e...
Unable To Import Run Or Save-To-File Published Workflow After Galaxy Upgrade
dear all, we've just upgraded our Galaxy server (Galaxy revision 7148:17d57db9a7c0, upgraded to...
Compare two datasets issue
Hi, I'm trying to use the compare datasets tool and can't get it to work. My first file was upl...
Chip-Seq Data Analysis Question
Hello, My name is Christopher Terranova and am a M.S student at the University of Buffalo SUNY.I...
Issue With Saving 'Manipulate Fastq' In Workflow; And Request For Advice Dealing With Barcoded 454 Data
Hi, I'm a new user, learning how to use Galaxy while I wait for my 454 results. So I'm not actua...
Text Editing
Hello Luce, I can explain the use of the tools "Text Manipulation". For each file independently,...
Attaching annotations to Sequences
I have two files: one containing my original transcriptome reads and another containing blast+ bl...
Need help with "Join two Datasets" tool
Is there a way to do a complete merge with this tool? For my 2 data sets there are going to be ro...
Mapping To Only 3 Genes / Targeted Resequencing / Solid4 / Short Reads
Hi! Following situation: 10 barcoded "samples". Each sample consists of a mix of the sequences 3...
February 18, 2011 Galaxy Development News Brief
February 18, 2011 Galaxy Development News Brief http://bitbucket.org/galaxy/galaxy- central/wiki...
Need help with 'barcode splitter'
I have a Fastq dataset obtained from Miseq with 24 million reads. When I use the barcode splitter...
Rna-Seq Galaxy Workflow For Pe Barcoded Samples?
Hello, I posted to the seqanswers forum, but have not received any feedback. I am working with R...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 169 users visited in the last hour