Contrasting disease and nondisease protein aggregation by molecular simulation.
Published Web Locationhttps://doi.org/10.1021/ar800062k
[Figurre: see text]. Protein aggregation can be defined as the sacrifice of stabilizing intrachain contacts of the functional state that are replaced with interchain contacts to form non-functional states. The resulting aggregate morphologies range from amorphous structures without long-range order typical of nondisease proteins involved in inclusion bodies to highly structured fibril assemblies typical of amyloid disease proteins. In this Account, we describe the development and application of computational models for the investigation of nondisease and disease protein aggregation as illustrated for the proteins L and G and the Alzheimer's Abeta systems. In each case, we validate the models against relevant experimental observables and then expand on the experimental window to better elucidate the link between molecular properties and aggregation outcomes. Our studies show that each class of protein exhibits distinct aggregation mechanisms that are dependent on protein sequence, protein concentration, and solution conditions. Nondisease proteins can have native structural elements in the denatured state ensemble or rapidly form early folding intermediates, which offers avenues of protection against aggregation even at relatively high concentrations. The possibility that early folding intermediates may be evolutionarily selected for their protective role against unwanted aggregation could be a useful strategy for reengineering sequences to slow aggregation and increase folding yield in industrial protein production. The observed oligomeric aggregates that we see for nondisease proteins L and G may represent the nuclei for larger aggregates, not just for large amorphous inclusion bodies, but potentially as the seeds of ordered fibrillar aggregates, since most nondisease proteins can form amyloid fibrils under conditions that destabilize the native state. By contrast, amyloidogenic protein sequences such as Abeta 1-40,42 and the familial Alzheimer's disease (FAD) mutants favor aggregation into ordered fibrils once the free-energy barrier for forming a critical nucleus is crossed. However, the structural characteristics and oligomer size of the soluble nucleation species have yet to be determined experimentally for any disease peptide sequence, and the molecular mechanism of polymerization that eventually delineates a mature fibril is unknown. This is in part due to the limited experimental access to very low peptide concentrations that are required to characterize these early aggregation events, providing an opportunity for theoretical studies to bridge the gap between the monomer and fibril end points and to develop testable hypotheses. Our model shows that Abeta 1-40 requires as few as 6-10 monomer chains (depending on sequence) to begin manifesting the cross-beta order that is a signature of formation of amyloid filaments or fibrils assessed in dye-binding kinetic assays. The richness of the oligomeric structures and viable filament and fibril polymorphs that we observe may offer structural clues to disease virulence variations that are seen for the WT and hereditary mutants.