Skip to main content
eScholarship
Open Access Publications from the University of California

Target Selection and Deselection at the Berkeley Structural Genomics Center

Abstract

At the Berkeley Structural Genomics Center (BSGC), our goal is to obtain a near-complete structural complement of proteins in the minimal organisms Mycoplasma genitalium and M. pneumoniae, two closely related pathogens. Current targets for structure determination have been selected in six major stages, starting with those predicted to be most tractable to high throughput study and likely to yield new structural information. We report on the process used to select these proteins, as well as our target deselection procedure. Target deselection reduces experimental effort by eliminating targets similar to those recently solved by the structural biology community or other centers. We measure the impact of the 69 structures solved at the BSGC as of July 2004 on structure prediction coverage of the M. pneumoniae and M. genitalium proteomes. The number of Mycoplasma proteins for which thefold could first be reliably assigned based on structures solved at the BSGC (24 M. pneumoniae and 21 M. genitalium) is approximately 25 percent of the total resulting from work at all structural genomics centers and the worldwide structural biology community (94 M. pneumoniae and 86M. genitalium) during the same period. As the number of structures contributed by the BSGC during that period is less than 1 percent of the total worldwide output, the benefits of a focused target selection strategy are apparent. If the structures of all current targets were solved, the percentage of M. pneumoniae proteins for which folds could be reliably assigned would increase from approximately 57 percent (391 of 687) at present to around 80 percent (550 of 687), and the percentage of the proteome that could be accurately modeled would increase from around 37 percent (254 of 687) to about 64 percent (438 of 687). In M. genitalium, the percentage of the proteome that could be structurally annotated based on structures of our remaining targets would rise from 72 percent (348 of 486) to around 76 percent (371 of 486), with the percentage of accurately modeled proteins would rise from 50 percent (243 of 486) to 58 percent (283 of 486). Sequences and data on experimental progress on our targets are available in the public databases Target DB and PEPCdb.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View