Skip to main content
eScholarship
Open Access Publications from the University of California

Assessment of 454 Sequencing Errors in Microbial Genomes

Abstract

The introduction of pyrosequencing-based sequencing platforms such as 454-sequencing along with the Sanger technology for whole-genome shotgun sequencing has provided an alternative way to produce cost-effective sequence data. For microbial genome finishing, the 454/Sanger hybrid approach has made a considerable impact in reducing the time required to close a genome. However, errors within the 454 sequencing data remain to be addressed to make sure that final consensus represents high quality sequence. JGI is working on further cost reduction of sequencing and finishing processes and is planning significant cut of Sanger sequencing. This will increase the portion of the genome covered with pyrosequence only. Thus the analysis of errors produced by this new technology becomes very important. We conducted a study to assess the quality of the 454 sequence data in order to determine its impact on error rate, and to devise a strategy to correct or reduce the errors. The study analyzed 29 microbial genomes containing both Sanger and 454 only assemblies to look for mismatches in the 454 sequence data. We examined both homopolymer tracts and non-homopolymer regions with respect to read depth and quality assignment. An overview of our findings and strategy for reducing the 454 errors will be presented.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View