Controlling Structure and Dimensions of a Disordered Protein via Mutations

The dimensions of intrinsically disordered proteins (IDPs) are sensitive to small energetic-entropic differences between intramolecular and protein–solvent interactions. This is commonly observed on modulating solvent composition and temperature. However, the inherently heterogeneous conformational landscape of IDPs is also expected to be influenced by mutations that can (de)stabilize pockets of local and even global structure, native and non-native, and hence the average dimensions. Here, we show experimental evidence for the remarkably tunable landscape of IDPs by employing the DNA-binding domain of CytR, a high-sequence-complexity IDP, as a model system. CytR exhibits a range of structure and compactness upon introducing specific mutations that modulate microscopic terms, including main-chain entropy, hydrophobicity, and electrostatics. The degree of secondary structure, as monitored by far-UV circular dichroism (CD), is strongly correlated to average ensemble dimensions for 14 different mutants of CytR and is consistent with the Uversky–Fink relation. Our experiments highlight how average ensemble dimensions can be controlled via mutations even in the disordered regime, the prevalence of non-native interactions and provide testable controls for molecular simulations.

a huge challenge as small changes in dimensions are difficult to extract from conventional experiments.
Intrinsically disordered proteins (IDPs), on the other hand, provide a wonderful testing bed for probing mutation-driven structural perturbations. 13 High-sequence-complexity IDPs, i.e., those protein sequences that are disordered despite exhibiting little compositional bias, are more interesting as they are expected to populate pockets of local structure, rich in native or non-native interactions (deduced from the folded conformation), apart from fully disordered states. For example, this can be observed in studies of denatured states of folded proteins. [14][15][16] It should, therefore, be possible to tune the dimensions of natural IDPs through small perturbations of basic thermodynamic factors including backbone conformational entropy, hydrophobicity, and charge-charge interactions, that (de)stabilize pockets of structure.
The CytR DNA-binding domain (referred to as CytR hereon) is a high-sequence-complexity IDP that acquires a compact three-helix bundle structure in the presence of DNA ( Figure  1A). 17 The conformational ensemble of CytR is sensitive to temperature, 18 salt, 19 and DNA, 17,19 all of which arise from a combination of destabilizing electrostatics ( Figure 1B) and weak packing in the folded conformation. 20 In this work, we control the dimensions of the disordered CytR via a combination of folded structure and sequence-alignment-based expectations, and rational engineering. Though we observe nonintuitive effects, a likely manifestation of non-native interactions within the disordered ensemble, we find a strong correlation between secondary structure and the apparent Stokes radius (R s ), a first such observation in IDPs.
We have shown earlier that the P33A mutation in CytR reduces the secondary structure content by reducing the population of an excited folded-like conformation. 21 Main-chain entropic considerations 22 suggest that, if an amino acid with a higher flexibility is introduced at the same position, the secondary structure content is expected to be even lower, arising from the larger entropic destabilization of residual structure. True to this, the thermal dependence of the P33G mutant's CD signal is flat (blue in Figure 2A) unlike the weak structural transitions observed in the WT or the P33A mutant (red and green in Figure  2A, respectively). In addition to the secondary structure content, the mean dimensions as measured by R s (in a calibrated size-exclusion chromatography column 20 ), shows a trend where P33G is more expanded than the WT ( Figure 2B). Another interesting position is residue K35 located in the electrostatically frustrated binding site of the folded conformation with a number of positive charges around it ( Figure 1B). Eliminating this positive charge through the K35Q mutant not only enhances the secondary structure content (though only marginally) but also reduces the dimensions of the ensemble (cyan in Figure 2).
Similarly, we have shown recently that the double mutant A29V/A48M (DM; that introduces two large hydrophobic substitutions in the protein core) promotes collapse of the disordered ensemble into a compact folded ensemble while simultaneously increasing the secondarystructure content (mutations identified via sequence comparisons; magenta in Figure 2 and Supporting Figure S1). 20 Building on the folded-like conformation of the DM, we engineered two other mutants, Quad (A29V/A48M/R43A/K46A; black in Figure 2) and Pent (A29V/A48M/R43A/K46A/R28A; gray in Figure 2), that further eliminate the residual electrostatic frustration. These two mutants again promote structure and compaction while differentially affecting the secondary structure content. However, our attempts to rationally engineer structure or loss of structure were not always successful. For example, the A29V mutant enhances the structure-compactness only in the presence of the R28Q substitution and not otherwise (dark cyan in Figure 3A). Thus, collapse and structure acquisition in polymer chains is an emergent property arising from long-range many-body effects that can be engineered not only via hydrophobicity (as shown before for the DM) but also via a combination of hydrophobicity and electrostatics (A29V/R28Q mutant).
Interestingly, a number of single-point mutations (R28Q, A29V, D34S, R43N, R43E, K46A, and A48M) modulate ensemble dimensions despite being disordered ( Figure 3A,B). The average dimensions of CytR can be mutationally varied from ~22 Å in the P33G mutant to ~16 Å in the Pent mutant ( Figure 3B; Figure S2 for the elution profiles). If one excludes DM, Quad, and Pent that induce large structural changes, the maximal change in Stokes radius is ~3.5 Å, which is still significant as it corresponds to ~40% reduction in the volume of the protein chain compared to the fully unfolded P33G mutant.
Mutations that eliminate unfavorable interactions (R43N, R43E, K46A) as expected from the folded conformation still result in an expansion of the ensemble, suggestive of nonnative interactions in the disordered state ( Figure 3B). The only exceptions to this are the R28Q and K35Q mutations that promote slight compaction and the D34S mutation that promotes slight expansion, upon elimination of unfavorable and favorable interactions, respectively ( Figure 3B). Despite these varied outcomes on mutations, we find that it is possible to collapse all such variations into a single plot of ensemble dimensions versus secondary-structure content ( Figure 3C; r = 0.96 and p <10 −9 ). In fact, the dependence of relative CD signals on the relative molecular volumes agrees reasonably well with the Uversky-Fink relation 23 ( Figure S3). We also observe a similar correlation between dimensions and secondary-structure content at different salt concentrations for the CytR WT combining SEC and analytical ultracentrifugation data ( Figure S4). These observations are strong evidence that the magnitude of the far-UV CD signal at 222 nm (in absolute units) for a given protein can be an intimate indicator of the molecular dimension even in the disordered regime, 24 particularly for helical protein domains, and if appropriately calibrated.
Is the correlation between Stokes radius and secondary structure a manifestation of (de)population of only folded-like conformations upon mutation? To answer this, we modeled the conformational behavior of CytR and its mutants via the semiquantitative Wako-Saitô-Muñoz-Eaton (WSME) statistical mechanical model that considers an ensemble of 2 47 microstates (47 being the length of the folded region of CytR) 25,26 with contributions from packing interactions, electrostatics, and implicit solvation. 27,28 Since the model is Go-like, 29 it captures the effect of only those mutations that display nativestructure-derived expectations (Figure 2 and Table S1). The predicted unfolding curves are very similar to experimental observations, both in terms of the order of stability and structure ( Figure 4A), validating the model energetics and assumptions. All of the generated free-energy profiles display multiple minima whose relative populations are modulated on mutation ( Figure 4B,C). This is also consistent with both explicit and implicit-solvent simulations of WT CytR that highlight a flat conformational landscape with numerous competing sub-states. 18,30 In fact, even the folded-like DM displays broad, weakly cooperative, and probe-dependent unfolding. 20 Taken together, our observations suggest that the apparent Stokes radius represents an effective ensemble average of numerous partially structured states.
The solvent sensitivity of IDP dimensions arises from small imbalances between intrachain and chain-solvent interactions. 15,16,[31][32][33][34] In this work, we tune this relative balance by introducing mutations in a high-sequence-complexity IDP, and thus indirectly affecting the solvent quality (as seen by the polymer chain) without modulating solvent composition. Our results establish that the average dimensions of CytR can be modulated via mutations in a continuous manner through both rational engineering and non-native effects. CytR and its mutants can thus serve as excellent model systems for simulations, and to particularly validate force-fields and water models without resorting to extreme solvent conditions or temperature.

Supplementary Material
Refer to Web version on PubMed Central for supplementary material.   Biochemistry. Author manuscript; available in PMC 2020 August 14.

Europe PMC Funders Author Manuscripts
Europe PMC Funders Author Manuscripts (A) Thermal unfolding curves predicted by the WSME model for the mutants of CytR following calibration of model parameters with the WT unfolding curve as described before. 21,29 The color coding is the same as in Figure 3A. (B and C) Predicted one-dimensional free-energy profiles of WT CytR (thin red), single-point mutants (panel B), and multiplepoint mutants (panel C).