Solution Conformations Shed Light on PROTAC Cell Permeability

Proteolysis targeting chimeras (PROTACs) induce intracellular degradation of target proteins. Their bifunctional structure puts degraders in a chemical space where ADME properties often complicate drug discovery. Herein we provide the first structural insight into PROTAC cell permeability obtained by NMR studies of a VHL-based PROTAC (1), which is cell permeable despite having a high molecular weight and polarity and a large number of rotatable bonds. We found that 1 populates elongated and polar conformations in solutions that mimic extra- and intracellular compartments. Conformations were folded and had a smaller polar surface area in chloroform, mimicking a cell membrane interior. Formation of intramolecular and nonclassical hydrogen bonds, π–π interactions, and shielding of amide groups from solvent all facilitate cell permeability by minimization of size and polarity. We conclude that molecular chameleonicity appears to be of major importance for 1 to enter into target cells.

T argeted protein degradation is a new concept in drug discovery which is of particular interest for intracellular targets that are difficult to modulate with small molecule drugs. 1,2 This concept is based on proteolysis targeting chimeras (PROTACs), a chemical modality consisting of a ligand that binds to the target protein, a linker, and an E3 ubiquitin ligase ligand. PROTAC induced formation of a ternary complex with the target protein and the E3 ubiquitin leads to ubiquitination and subsequent degradation of the target protein. As a result of this mode of action PROTACs are catalytic. 3 Structural information on the ternary complexes between target proteins, PROTACs, and E3 ubiquitin ligases is now beginning to allow rational, structure-based design of PROTACs. 4 Design of PROTACs also requires knowledge about how their structures influence their ADME properties, with cell permeability being required for their mode of action. A few recent studies have begun to provide some, albeit sometimes conflicting, insight into how the ADME properties of PROTACs could be optimized. For instance, a study of a small set of seven PROTACs found that their permeabilities across Caco-2 cell monolayers correlated to chromato-graphically determined descriptors of lipophilicity and polarity. 5 Another study of 11 PROTACs highlighted how the combined use of the parallel artificial membrane permeability assay (PAMPA) and lipophilic permeability efficiency (LPE) can be used to provide insight into PROTAC cell permeability. 6 This study also suggested that AlogP should be kept below 5 to increase the likelihood for PROTACs to be cell permeable. However, other studies found no correlations between cell permeability and chromatographic lipophilicity or PAMPA permeabilities, 7,8 pointing to the need for increased understanding of how in vitro assays may be used for optimization of the ADME properties of PROTACs. 8 PROTACs reside in the chemical space close to, or beyond, 9,10 the outer limits for oral absorption deduced from analysis of drugs, clinical candidates, and optimized leads in the beyond rule of 5 chemical space. 11,12 For instance, the molecular weights of PROTACs typically range from 900 to 1100 Da, while their number of rotatable bonds falls between 20 and 25. 9,10 This puts PROTACs at pharmacokinetic risk, where low oral absorption and/or cell permeability may prevent them from reaching their intracellular targets. Currently, efforts to improve these potential shortcomings are hampered by the lack of understanding of PROTAC structure−property relationships, for instance regarding how they cross cell membranes.
For the first time, we provide structural and property based insight into how PROTACs may permeate cell membranes by behaving as molecular chameleons. We used NMR spectroscopy to determine the conformational ensembles of PROTAC 1 ( Figure 1A) in solutions having different polarity and hydrogen bonding properties. Chloroform was chosen to mimic the interior of a cell membrane as it has a dielectric constant (ε = 4.8) close to that of a lipid bilayer (ε = 3.0). 13 As PROTAC 1 has low aqueous solubility (Table 1), DMSO alone and in a 10:1 mixture with water was used to resemble the aqueous extra-and intracellular environments.
PROTAC 1 targets the extracellular signal-regulated kinase 5 (ERK5), 14 a potential cancer target, by recruitment of the Von Hippel−Lindau (VHL) tumor suppressor E3 ligase. Thus, the E3 ligase ligand of 1 belongs to one of the two ligand classes that are used in the majority of PROTACs, with the other class being based on cereblon (CRBN) E3 ligase ligands. 9,10 Maple et al. recently reported a set of 217 VHL PROTACs, 10 the majority of which have flexible linkers that can be classified as either aliphatic (alkyl, n = 63) or based on ethylene glycol (PEG, n = 135). A third, smaller subset has rigid linkers (n = 19) based on alkynes combined with piperidine, pyridine, and/ or piperazine moieties. Comparison of the descriptors of Lipinski's 15 and Veber's 16 rules calculated for 1 to those of the subset of PEG-based PROTACs revealed 1 to be a good representative of this subset ( Figure 1B). All six descriptors for 1 fall within the 25th to 75th percentile of the corresponding descriptor for the subset. The subsets that have alkyl or rigid linkers differ from the PEG subset by having a higher calculated lipophilicity (cLogP), a lower topological polar surface area (TPSA) and fewer hydrogen bond acceptors (HBA) ( Figure S3). As expected, the rigid PROTAC subset has a lower number of rotatable bonds (NRotB) than the two other subsets. It is notable that two or more of the hydrogen bond donors (HBDs) in most VHL PROTACs originate from amide bonds; a feature found to be particularly detrimental for the Caco-2 cell permeability of a collection of macrocycles residing in beyond rule of 5 chemical space. 17 PROTAC 1 was synthesized from phenol 2 14 by alkylation with ethyl bromoacetate followed by saponification of the ethyl ester to give carboxylic acid 3 (Scheme 1). Acid 3 was then coupled with amine 4 18 using HATU as the promoter to give 1 after purification by reversed phase HPLC. Similar to many PROTACs, the aqueous solubility of 1 is low, 8 whereas its lipophilicity is in the drug-like range and the permeability in the PAMPA assay is medium to high (Table 1). PROTAC 1 binds to its target protein, ERK5, 14 and to the VHL E3 ligase in biochemical assays with IC 50 values just above and below 1 μM, respectively ( Table 1). The VHL potency drops off 14fold in a cell-based assay, revealing 1 to have a medium permeability into target cells. Despite residing in chemical space far beyond the rule of 5 15 and Veber's rule, 16 where cell permeability is expected to be low, PROTAC 1 surprisingly displays medium to high permeability. We therefore considered 1 to be of great interest for investigating how structural and conformational properties, such as molecular chameleonicity, may enable PROTACs to be cell permeable.
The solution conformational ensembles of 1 were determined by deconvolution of time-averaged NMR data into individual conformations using the NMR analysis of molecular flexibility in solution (NAMFIS) algorithm. 19 NAMFIS has previously been used to determine the solution conformations of both flexible, linear compounds and of macrocycles having different flexibility. 20−25 Proton−proton distances obtained from nuclear Overhauser effect (NOE) build-up measurements and dihedral angles calculated from   Figure 2). Theoretical conformational ensembles that provide a comprehensive coverage of conformational space are also required as input to NAMFIS. The required ensembles were generated by unrestrained Monte Carlo conformational searches using a variety of force fields and implicit solvent models within a 42 kJ/mol energy window. 20 Based on these inputs, the NAMFIS algorithm varies the probability of each conformer in the theoretical ensembles to find the best fit of the probability    20 i.e., by the addition of random noise to the experimental data, by the random removal of individual experimental restraints, and by comparison of the experimentally observed and back-calculated distances and scalar coupling constants. Just as most of the reported VHL-based PROTACs, 9,10 1 has a large number of rotatable bonds (NRotB = 27, Figure  1B) indicating a high flexibility. In spite of the potentially high flexibility, inspection of the proton−proton distances and dihedral angles revealed that the two domains of 1 were well constrained in all three solvents (Figure 2). Both the conformation of the VHL ligand domain and of the region linking the ERK5 binding protein of interest (POI) ligand to the first part of the flexible linker were constrained by multiple proton−proton distances and dihedral angles. In addition, a few long-range NOEs were observed for 1 in each of the three solvents between protons located in the POI and VHL ligands, or in either of the two ligands and a distant part of the linker. The distances that originated from these long-range NOEs indicated that folded conformations were found in the ensembles, which brought protons distant from each other in the structure of 1 into close spatial contact. To ensure that the long-range NOEs did not enforce distorted conformations of the POI-linker and VHL ligand domains, the ensembles

ACS Medicinal Chemistry Letters
pubs.acs.org/acsmedchemlett Letter obtained by NAMFIS analyses using all experimental restraints were validated against ensembles using restraints only from the two well-defined domains of 1, in addition to the standard validation described above.
The NAMFIS analysis revealed that each of the three solution ensembles of 1 consisted of a limited number of conformations, ranging from 6 to 10 in each of the three solutions (Table S10). The conformations in the DMSO and DMSO−water ensembles were structurally more diverse than those in chloroform (Table S12). In the two polar solutions, the maximum pairwise RMSD values between the most different conformations were 8.9 and 8.4 Å, while it was 6.6 Å in chloroform. No single (congruent) conformation was adopted in two solutions; the most similar conformations were number 2 (chloroform) and number 13 (DMSO) that had a pairwise RMSD value of 3.2 Å. Four or five major conformations represented approximately 70−80% of each ensemble (a major conformation was defined as having a population ≥10%). In chloroform, PROTAC 1 adopted highly folded conformations. In three of the four major conformations, the backbone of 1 made two turns, while the remaining major conformation had one turn ( Figure 3). The four minor conformations in chloroform were all folded with two turns ( Figure S2). In the polar solvents, more elongated conformations were observed, with two of the major ones in DMSO being linear and one being folded with one turn. In the DMSO−water solution, one linear conformation and three folded conformations with one turn were found. The minor conformations in the two polar solutions were also more elongated than those in chloroform ( Figure S2).
Passive cell permeability occurs via desolvation of the compound as it leaves the extracellular aqueous environment, followed by interactions with the negatively charged phospholipid head groups and subsequent passage across the hydrophobic membrane interior. The polarity and the size of the compound, when adopting the permeating conformation(s), are the key properties that determine the kinetics of the desolvation step and the diffusion across the membrane. 26 The compound's polarity is well described by its solvent accessible 3D polar surface area (SA 3D PSA) while its size is approximated by the radius of gyration (R gyr ). The R gyr is calculated as the root-mean-square distance between the atoms of compound and its center of mass, 27 and we calculated the SA 3D PSA of 1 based on its polar atoms (O, N, and attached H) as well as on additional partial charges as recently reported. 20 With the exception of one conformation (no. 4, 10%), the conformations adopted by PROTAC 1 in chloroform had a R gyr in the interval of 5.4−6.2 Å, i.e., at the very low end of the range (5.2−9.7 Å) displayed by all conformations in the three solutions ( Figure 4A and B, Table S11). The conformations in DMSO spanned a wider size (R gyr ) range than that in chloroform, whereas the size range in DMSO−water was closer to that in chloroform ( Figure 4B). As revealed by the population weighted mean values for R gyr , conformations were less compact in DMSO and DMSO−water than in chloroform. The SA 3D PSA of six of the conformations which represent 87% of the ensemble in chloroform were ≤207 Å 2 , i.e., in the lower third of the range of all 24 conformations (172−259 Å 2 ). The ensembles in the three solutions populated similar ranges for SA 3D PSA, but the ensembles in DMSO and DMSO− water were shifted toward higher polarity. Population weighted mean values for SA 3D PSA increased significantly, i.e., by just over 35 Å 2 , between the ensembles adopted in chloroform and DMSO−water, with the ensemble in DMSO having an intermediate value ( Figure 4C). The solution conformations of 1 show a weak correlation between their R gyr and SA 3D PSA ( Figure 4A) and also a weak inverse correlation between the number of intramolecular hydrogen bonds (IMHBs) and the SA 3D PSA ( Figure S4). Thus, compact conformations (low R gyr ) tend to have a lower SA 3D PSA and more IMHBs, all of which contribute to a higher permeability than if less compact and more polar conformations were adopted. Importantly, all but one of the conformations in chloroform, i.e., 90% of the ensemble, have a R gyr below 7 Å, the proposed upper cutoff for passive cell permeability of compounds in bRo5 space. 26 Similarly, the majority (87%) of the ensemble in chloroform has a SA 3D PSA in the range were cell permeability may be low-moderate (up to 200−210 Å 2 ), as indicated by a recent study. 20 In contrast, conformations in DMSO and DMSO−water were significantly larger, and those in DMSO−water were also more polar. We therefore conclude that most conformations populated in chloroform have values for R gyr and SA 3D PSA that should allow 1 to permeate cells and that folded conformations having two turns such as conformations 1−3 are likely be the permeating species.
Inspection of the proposed permeating conformations 1−3 provided detailed insight into what intramolecular interactions allow an environment dependent reduction of size and polarity for 1 ( Figure 5). In conformation 1, PROTAC 1 forms an IMHB between the hydroxyl group of hydroxyproline and the tertiary amide of the ERK5 ligand. The NH of two of the secondary amide bonds of 1 point toward the center of the folded conformation, thereby shielding them from the surrounding apolar environment, while the third amide NH may be partially shielded by the pyrimidine ring. In addition to the IMHB, a π−π interaction between the pyrimidine and thiazole rings may contribute to minimizing the R gyr of 1 in this conformation. Two IMHBs, which involve the NH of two of the three secondary amides of 1, are formed in conformation 2 that has a low R gyr and a very low SA 3D PSA. The third amide NH is partially shielded from the surrounding as it points to the center of the conformation. In addition, the hydroxyl group of the hydroxyproline moiety of 1 is shielded from solvent by the pyrimidine of the ERK5 ligand, potentially by formation of a nonclassical hydrogen bond to the pyrimidine. The SA 3D PSA of conformation 3 is reduced by IMHBs that involve the NH of two of the three secondary amides of 1 and by shielding of the remaining amide NH by the adjacent tert-butyl group. In this case, a π−π interaction between the pyrimidine ring and phenyl ring in the VHL domain could contribute to keeping the R gyr of conformation 3 low, similar to the π−π interaction of conformation 1. In summary, inspection of conformations 1−3 reveal that the SA 3D PSA of PROTAC 1 can be reduced in an apolar environment by the formation of IMHBs, shielding of amide NH groups from solvent and by formation of nonclassical hydrogen bonds. In addition to these intramolecular interactions, π−π interactions appear to contribute to minimization of R gyr . We suggest that the ability to adopt folded conformations stabilized by similar intramolecular interactions that minimize R gyr and SA 3D PSA is likely to be required also for other VHL PROTACs that have flexible PEG-based linkers to enter cells and induce target degradation.
Recent studies of drugs beyond the rule of 5 space, 11,28 most of which are semirigid macrocycles, have revealed that they benefit from behaving as molecular chameleons in order to display high aqueous solubility and cell permeability. 20,29,30 This study is the first to show that the more flexible PROTACs also can behave as molecular chameleons and that this property is important for their cell permeability. By using a validated NMR technique, we found that the VHL based PROTAC 1 adopts its shape, i.e., its radius of gyration (R gyr ) and its solvent accessible 3D polar surface area (SA 3D PSA) to the surrounding environment. Compound 1 populated conformations that were more elongated and polar in environments used as mimics of polar extra-and intracellular compartments. In contrast, most conformations were folded with a low R gyr and a low SA 3D PSA in chloroform, which has a polarity close to that of the center of a cell membrane. Importantly, the folded conformations adopted in chloroform had values for R gyr and SA 3D PSA compatible with satisfactory passive cell permeability, 20,26 even though properties such as MW, TPSA, and the number of rotatable bonds place 1 into chemical space far beyond the rule of 5. As the calculated properties of 1 are representative for reported VHL based PROTACs that have flexible, PEG-based linkers, chameleonicity may also be of general importance for the cell permeability of this type of PROTACs. However, this hypothesis needs to be confirmed by studies of additional PROTACs. Design of cell permeable PROTACs could capitalize on recent learnings from other drug candidates, where intramolecular hydrogen bonds 31 and shielding of amide