Much of modern pharmaceutical development has occurred within or nearby the chemical space delineated by Lipinski’s rules of five (< 500 molecular weight, < 5 hydrogen bond donors, < 5 octanol/water partition coefficient, < 10 hydrogen bond acceptors) due to the practical advantages in pharmacokinetic properties evinced by such small molecules, especially cell permeation. This leaves intracellular interactions which occur over a larger surface area (i.e. not involving native small molecules) “undruggable”. Cyclic peptides of up to 1200 molecular weight have demonstrated the ability to inhibit a wide variety of such undruggable intracellular interactions. However, engineering cell permeability into cyclic peptides remains a major barrier to their therapeutic utility. Here I present my work using tandem mass spectrometry and the PAMPA artificial membrane permeability assay to Much of modern pharmaceutical development has occurred within or nearby the chemical space delineated by Lipinski’s rules of five (< 500 molecular weight, < 5 hydrogen bond donors, < 5 octanol/water partition coefficient, < 10 hydrogen bond acceptors) due to the practical advantages in pharmacokinetic properties evinced by such small molecules, especially cell permeation. Their small size has made them ideal for occupying small binding pockets instead of a protein’s intended substrate and achieving sufficient oral bioavailability for oral delivery is generally facile. More recently, therapeutic enzymes and antibodies have joined Insulin in a class of injectable macromolecule therapeutics to great success – multiple new therapeutic antibodies are approved each year. Their large size gives them superior selectivity, specificity, and potency compared to small molecules, but also precludes them from entering cells to modify intracellular interactions (and from oral delivery). These two therapeutic modalities leave a great many intracellular interactions which occur over a larger surface area (i.e., not involving native small molecules) “undruggable”, and the continual discovery of actionable disease-relevant interactions in this category has prompted a search for new therapeutic modalities.
Cyclic peptides of up to 1200 molecular weight have demonstrated the ability to inhibit a wide variety of such “undruggable” intracellular interactions. Their ease of synthesis combined with advances in DNA/mRNA encoded library construction and screening technologies have ensured that obtaining potent cyclic peptide leads against virtually any protein target is possible. However, these powerful encoding technologies cannot screen for permeability and engineering cell permeability into cyclic peptides remains a major barrier to their therapeutic utility. The conformational nature of cyclic peptide passive cell permeability has thus far defied computational prediction over a broad set of compounds and empirical evaluations of cyclic peptide passive permeability have been of limited size (tens to low hundreds). The overall goals of this dissertation are to empirically evaluate the prevalence of passively cell permeable backbone geometries across thousands of geometrically diverse cyclic hexa- and heptapeptides, to derive general insights into the design of passively cell permeable cyclic peptides, and to publish a database of known permeable backbone geometries with which screening libraries can be biased towards passive cell permeability.
Chapter one covers the details of CycLS, a non-de novo cyclic peptide sequencing program using a database-matching approach to allow sequencing of entire libraries of cyclic peptides in a timely manner. I validated CycLS against a unique-mass library of 400 cyclic hexapeptomers, achieving 95% sequencing accuracy despite a ratio greater than 500:1 of decoy sequences to true sequences, and against a mass-redundant 1800-member library of cyclic hexapeptides and hexapeptomers (also found in chapter two) by resynthesis of twenty-two individual compounds over a broad range of sequencing scores. I then devised a normalized sequencing confidence metric that was able to divide the seventeen successfully sequenced resynthesized compounds from the five unsuccessfully sequenced resynthesized compounds. Direct sequencing of cyclic peptides rather than by linearization or encoding is critical for passive cell permeability assays, which are sensitive to any encoding tag or the structural elements pre-requisite to many linearization strategies. CycLS is freely available online and improves on previous work in this area by inclusion of extensive spectral pre-processing to remove noise and boost signal, which is critical to ensure sequencing quality.
CycLS is the first step of a computational workflow to associate individual compound identities with corresponding mass spectrometry quantified assay results by matching individual peak retention times given an identical LCMS method. The remainder of that workflow can be found in appendix A and is composed of CycLS and two additional steps. Processing of the assay data is done via AutoPAMPA, which performs automated peak-finding, integration, and calculation of permeation rates for the PAMPA artificial membrane permeability assay. AutoPAMPA removes data analysis as the limiting factor to throughput for mass spectrometry quantified assays by batch-processing of hundreds to thousands of peaks per job. Finally, RTMerge performs peak alignment and matching by retention time between CycLS and AutoPAMPA outputs and generates statistics describing the physicochemical properties and structure of each compound. In addition to the work included here, this computational workflow has already been successfully used to explore the permeabilities of hundreds of lariat peptides and enables further such projects in the future.
Chapter two investigates the passive permeability of 1800 cyclic hexamers and 3600 structurally related cyclic heptamers with highly variant backbone geometries by PAMPA. Of especial interest were the effects of N-methyl residues, peptoid residues, and beta residues on passive permeability. I identified 823 hexamers and 1330 heptamers with permeation rates greater than 1 * 10-6 cm/s, a threshold at which compounds are considered passively cell permeable. I confirmed the utility of these library-derived permeabilities by correlation with the pure permeabilities of 9 resynthesized hexamers and 10 resynthesized heptamers. A matched-pair analysis revealed that peptoid and beta residues have a negative structural contribution to passive permeability that I hypothesize originates from their increased flexibility. As expected, a matched-pair analysis of stereochemistry showed little effect averaged over such diverse backbone geometries.
Library generation technologies with complete synthetic control stand to benefit greatly from the permeable hexamer and heptamer backbone geometries discovered in chapter two by selecting only the most permeable of them, but combinatorically generated libraries doing the same would be limited to small library sizes or few backbone geometries. To enable combinatorically generated libraries to better utilize known permeable backbone geometries I defined and investigated passive permeability “motifs” of length three, holding three residues of the library design static while allowing the rest to vary over a number of known permeable backbone geometries. The best motifs had median permeabilities four-fold greater than the median permeability of all other compounds with the same number of hydrogen bond donors. Bundled into motifs, these sets of permeable backbone geometries allow combinatorically generated libraries increased size and some degree of geometric diversity.
This dissertation has thoroughly explored the impact of stereochemistry, N-methylation, and peptoid residues on the passive permeability of cyclic hexa- and heptamers and gained some insights into the effect of beta residues. In addition to these insights into the average effects on permeability, this dissertation emphasizes that backbone geometries with high intrinsic permeability may be designed into DNA/mRNA encoded screening libraries to improve the likelihood of hits with favorable pharmacokinetic properties. Lastly, the computational workflow necessary to gain these insights can be used to obtain a similar register of “privileged” backbone geometries for larger ring sizes.