Question: Using filtered data for TopHat
1
gravatar for kuk2
2.0 years ago by
kuk220
kuk220 wrote:

Hi,

When I extract DRR data using "Extract reads in FASTQ/A format from NCBI SRA," for TopHat, I was able to get both a bam file and bam-index file.

However, when I filtered the extracted reads in FASTAQ format using "Filter FASTQ reads by quality score and length" and used the filtered data for TopHat, I was not be able to get the bam-index file and got the following error message, "An error occurred setting the metadata for this dataset Set it manually or retry auto-detection."

I repeated this twice, and I got the same message. Please let me know if you have any suggestion to fix this error.

Thanks,

kazz

ADD COMMENTlink modified 2.0 years ago by Jennifer Hillman Jackson25k • written 2.0 years ago by kuk220

Is your FASTQ file after the filter step still valid? For example if your file would be empty you can get such an error, because no index could be created.

ADD REPLYlink written 2.0 years ago by Bjoern Gruening5.1k

Is your FASTQ file after the filter step still valid? For example if your file would be empty you can get such an error, because no index could be created.

ADD REPLYlink written 2.0 years ago by Bjoern Gruening5.1k
0
gravatar for kuk2
2.0 years ago by
kuk220
kuk220 wrote:

Hi,

The file is not empty. When I tried to make the bai file from the link "Set manually and auto-detection" and then "Convert Format," I was able to make a bai file, and I was able to see the bam file using igv.

However, when I was analyzing a different data set (this time I used the original DRR data, not filtered) again, I got the same error (no bai file made) again (I tried twice, and I got the same error message; this situation is exactly the same as my first question). So I tried to make the bai file using the same command (Convert Format). Although this new data set was added to queue four days ago, it has never been processed, and I only get the following message, "This is a new dataset and not all of its data are available yet."

This new data set does not appear to be empty, either, since I can see some sequences and scores in the file.

I have no idea why this data set does not begin to be processed for such a long time. Please give me your suggestion to fix this problem. Thanks,

kazz

ADD COMMENTlink written 2.0 years ago by kuk220
0
gravatar for Jennifer Hillman Jackson
2.0 years ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

There are some issues with Galaxy Main at http://usegalaxy.org. This may have impacted jobs as of late Weds and now is manifesting as temporary downtime (two distinct issues). Please follow here for details and updates: https://biostar.usegalaxy.org/p/20698/

Our apologies for the inconvenience, Jen, Galaxy team

ADD COMMENTlink written 2.0 years ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 175 users visited in the last hour