- Tallorin, Lorillee;
- Wang, JiaLei;
- Kim, Woojoo E;
- Sahu, Swagat;
- Kosa, Nicolas M;
- Yang, Pu;
- Thompson, Matthew;
- Gilson, Michael K;
- Frazier, Peter I;
- Burkart, Michael D;
- Gianneschi, Nathan C
The discovery of peptide substrates for enzymes with exclusive, selective activities is a central goal in chemical biology. In this paper, we develop a hybrid computational and biochemical method to rapidly optimize peptides for specific, orthogonal biochemical functions. The method is an iterative machine learning process by which experimental data is deposited into a mathematical algorithm that selects potential peptide substrates to be tested experimentally. Once tested, the algorithm uses the experimental data to refine future selections. This process is repeated until a suitable set of de novo peptide substrates are discovered. We employed this technology to discover orthogonal peptide substrates for 4'-phosphopantetheinyl transferase, an enzyme class that covalently modifies proteins. In this manner, we have demonstrated that machine learning can be leveraged to guide peptide optimization for specific biochemical functions not immediately accessible by biological screening techniques, such as phage display and random mutagenesis.